Data Engineer, OneDegree AI

OneDegree Group

工作內容

Job Description

We are seeking a skilled and motivated individual to join our team as a Data Engineer. The successful candidate will assist in research, support model training, and optimize machine learning solutions. The role requires organizing relevant data, analyzing test results, and fine-tuning models. Additionally, the candidate will be responsible for analyzing and interpreting large datasets to support our B2B AI large language model (LLM) Verification solution.

OneDegree Tech Blog: https://medium.com/onedegree-tech-blog

-

How to apply

Please apply this position through 👉 https://grnh.se/3d78328e4us

It will help us process your applications faster
*Please apply by English CV, thank you.
 

-

Responsibilities

  1. Proficient in Data Warehousing, Data Preprocessing Concepts, Data Cleaning, Data Preparation, Tokenization, and Related Data.
     

  2. Skilled in tokenization and transforming raw data into structured formats suitable for machine learning models.
     

  3. Proficient in Python (including numpy, scipy, pandas) and OpenAI.
     

  4. Experience with SQL and Database Design (e.g., SQL/NoSQL/Vector DB).

    • Understanding of database design principles, including both SQL and NoSQL databases.

    • Familiarity with vector databases and their application in AI and machine learning contexts.
       

  5. Experience with OpenAI, Langchain, and RAG Architecture:

    • Hands-on experience with OpenAI technologies and integrating them into machine learning workflows.

    • Knowledge of Langchain and RAG (Retrieval-Augmented Generation) architecture, and their implementation in practical projects.
       

  6. Demonstrated ability to analyze large datasets, using statistical and machine learning techniques to derive insights.
     

  7. Strong Team Collaboration and Self-Learning Abilities:

    • Proven ability to work effectively in a team environment, collaborating with colleagues to achieve common goals.

    • Self-motivated with a strong desire to continuously learn and stay updated with the latest industry trends and technologies.

-

條件要求

Requirements

  1. Proficient in Data Warehousing, Data Preprocessing Concepts, Data Cleaning, Data Preparation, Tokenization, and Related Data.
     
  2. Skilled in tokenization and transforming raw data into structured formats suitable for machine learning models.
     
  3. Proficient in Python (including numpy, scipy, pandas) and OpenAI.
     
  4. Experience with SQL and Database Design (e.g., SQL/NoSQL/Vector DB).
    • Understanding of database design principles, including both SQL and NoSQL databases.
    • Familiarity with vector databases and their application in AI and machine learning contexts.
       
  5. Experience with OpenAI, Langchain, and RAG Architecture:
    1. Hands-on experience with OpenAI technologies and integrating them into machine learning workflows.
    2. Knowledge of Langchain and RAG (Retrieval-Augmented Generation) architecture, and their implementation in practical projects.
       
  6. Demonstrated ability to analyze large datasets, using statistical and machine learning techniques to derive insights.
     
  7. Strong Team Collaboration and Self-Learning Abilities:
    • Proven ability to work effectively in a team environment, collaborating with colleagues to achieve common goals.
    • Self-motivated with a strong desire to continuously learn and stay updated with the latest industry trends and technologies.

-

遠端型態

部分遠端工作

加分條件

  1. Familiarity with Machine Learning Frameworks (e.g., Keras, TensorFlow, PyTorch) is a Plus:
    • Experience using popular machine learning frameworks such as Keras, TensorFlow, or PyTorch.
       
  2. Familiarity with NLU/NLG/NLP Architectures (e.g., BERT, Transformers) is a Plus:
    • Knowledge of Natural Language Understanding (NLU), Natural Language Generation (NLG), and Natural Language Processing (NLP) architectures.
    • Understanding of Java programming language and its application in data engineering and machine learning projects.

員工福利

法定項目

勞保、健保、特別休假、勞退、婚假

其他福利

好好工作,好好休息

  • 加入第一天即享有年假,首年 15 天年假(依照入職比例發)
  • 每年全薪病假 5 天、全薪生理假 3 天

一起成長,持續精進

  • 參加 conference、外部訓練都有補助 (正職員工適用)
  • 證照補助 (正職員工適用)
  • 讀書會社團 - 前端、後端、SRE、區塊鏈等多元主題(全體同仁適用)

努力工作,我們也用力生活

  • 健康檢查補助 (正職員工適用)
  • 社團補助 - 各種運動社團、桌遊社、電玩社、這週要幹嘛社
  • 定期補充的零食以及飲料櫃、義式咖啡機、氣泡水機
  • 舒適的開放式工作環境,距離捷運台北101站 5分鐘路程
  • 彈性上下班時間、彈性遠端工作

薪資範圍

NT$ 700,000 - 980,000 (年薪)