# great learning experience
# opporutity to work in a central location
# good exposure on blockchain platform
about the company
Our client is a Consulting company helps transform businesses with its blockchain-based POS solution.
about the job
You’ll be responsible for providing high-quality datasets, including but not limited to the following tasks:
...
1. Obtain publicly available internet data through analysis, simulation, reverse engineering, etc.
2. Improve the quality and quantity of data collection through technical means.
3. Participate in data cleaning for large NLP models, including but not limited to data format conversion, content extraction, etc.
4. Deeply understand the data content, analyze data characteristics, continuously optimize data cleaning rules, and improve the quality of output data.
skills and experience required
- Master's degree or higher in Computer Science or Artificial Intelligence, with 2 years or more of relevant work experience preferred.
- Familiar with various transformer models and multimodal models, with experience in multimodal model development preferred.
- Experience in one or more fields such as web scraping, data preprocessing, and big data is preferred.
- Understanding of frameworks such as TensorFlow, PyTorch, and Keras.
- Proficient in Python programming, familiar with Linux, and capable of using Shell scripts to solve daily problems.
- Familiar with multi-threading, multi-processing, and network programming, familiar with at least one of the following tools: Scrapy, Spark, Xpath, Css-Selector, or having relevant experience in the NLP field is preferred.
Fulfill at least 3 of the 5 criteria below:
• Proficient in one of the following languages: Python, Java, C++, R, Julia.
• Familiar with datasets, including annotation.
• Familiar with one of the libraries: TensorFlow, PyTorch, and/or Keras.
• Familiarity with transformer models, tweaking, or deploying.
• It helps if you are also familiar with traditional models such as CNN, RNN, GNN.
To apply online please use the 'apply' function
(EA: 94C3609/ R1324990 )