Goal
On my platform, users can add past job titles (e.g., “Project Manager”). For filtering and analysis, that’s too broad you could be an IT Project Manager, Construction Project Manager, etc.
In Sweden, we have SSYK (similar to the US SOC – Standard Occupational Classification or the globally used ISCO – International Standard Classification of Occupations ), which categorizes job roles into detailed sections.
“Project Manager” alone is meaningless, but if a user writes “I was a project manager at a construction company,” I want to send that title + context to ChatGPT via API and map it to the correct SSYK code, returning a structured JSON. Example:
{
"raw_title": "Project Manager",
"ssyk_hirarki_1": "5",
"ssyk_hirarki_2": "54",
"ssyk_hirarki_3": "541",
"ssyk_hirarki_4": "5419",
"ssyk_occupation": "Project Manager, Construction"
}
Problem
I know how to make API calls, but I can’t figure out how to attach the SSYK JSON file (≈15,000 lines) so that the model can reference it.
I’ve used the following API call to analyze documents (like invoices), but it seems to only accept .pdf. I tried .json and .txt, but no luck.
{
"model": "gpt-4o-mini",
"temperature": 0.7,
"text": { "format": { "type": "json_object" } },
"input": [
{
"role": "user",
"content": [
{ "type": "input_file", "file_url": <file_url> },
{ "type": "input_text", "text": <instruction> }
]
}
]
}
Question
How do I attach the SSYK JSON to the prompt so that the model can actually read it and look up the right category? Do i upload it beforehand somewhere to openAI?
Is there a reliable way to handle large reference lists (like 15k job codes) without converting everything to PDF?
Payment 
I’d be happy to pay someone if they can point me in the right direction and help me get this working. I can share screenshots (but not my editor, my business partner isn’t comfortable with that).
Some poeple i look up to that might be able to help out here: @adamhholmes @boston85719 @Jici @georgecollier
