Data Scientist We look for a data scientist who can work at the cross functional team. We are on a mission to organize the world’s financial data to provide crucial insights.
What You’ll Do:
Write server-side and back-end processes for high-volume data analytics and processing.
Identify, research, invent and prototype new methods of analyzing and integrating financial data.
Investigate and discover new data sources.
Develop ways to automate discovery and integration of new data sources.
Analyze, transform, structure and crunch financial data that is growing in size, diversity and complexity.
Work on ways to use NLP to take open ended queries and interpret them to relevant computations.
Invent and research ways of taking unstructured data sources, finding key entities and events involved, and develop ways to organize and store these insights.
Use NLP techniques to summarize events.
What we look for:
At least one core programming expertise, such as python (iPython, NumPy, SciPy, Pandas) or Experience with machine learning, natural language processing and identifying training sets.
Strong background in NLP.
Strong statistical knowledge and experience applying analysis to real data.
Understanding of algorithms, data structures and design patterns.
Ability to read a method from a blog or a PhD paper (or just invent a new one), and quickly prototype and evaluate its usefulness.
Effective coding, documentation and communication habits.
Technologies we like:
Statistics programming language like R Python and specifically Numpy, SciPy, Pandas Spark and Hadoop.
Data stores like MySQL, ElasticSearch, Neo4j and various NoSQL alternatives.
Ubuntu linux, hosted on Amazon AWS & Google Cloud
Development tools like Git, Jenkins.