• Use

    Big data

    to achieve insanely great results

      Aggregate

    Still not collecting data or have it scattered across multiple data centers? We can help you finally integrate it in one place and uncover new opportunities.

      Analyze

    Struggling to understand what is going on with your data? We can help you find actionable insights and build advanced predictive models to drive your company up.

      Win

    Busy with client relationship management, fraud, risks and other routine? Let intelligent software do it for you. Focus on things that matter. Create value.

  • go

    data driven

    ALREADY IN MAY 2014


    Retail Intensive

      Next big event

    Speaker: Elon Musk

    Time: 3.00 pm (EST)

    Date: 15th of May 2014

    Topic: Next Big Thing in Data-driven World

    Banking Intensive

      Next big event

    Speaker: Elon Musk

    Time: 3.00 pm (EST)

    Date: 15th of May 2014

    Topic: Next Big Thing in Data-driven World




Services

Helping You Act On Your Data And Get Things Done.

To guarantee an exceptional quality of our services, we attract the best experts in the field for each problem leveraging our unique network of contacts and partnerships with the top universities and research labs all over the world.

Data Strategy

We have exposure to numerous data problems and innovative solutions. We know what works and what does not, when to play and when to stay. Our executive advising services help organization leaders make right decision about data having 24/7 access to our experts.

Full Cycle R&D

Research challenges is our passion. If you have a data problem, contact DATASTARS. Coming from the top international PhD programs and having partnerships with the world class research labs, we guarantee deep thinking and exceptional results.

Data Enrichment

We help our clients enrich thier data with the data from external sources using modern data fusion techniques and partnerships with the leading international data suppliers. External data leads to increased profitability and new disruptive business models.

Data Visualization

It is very hard to make informed data-driven decisions when you cannot see a big picture and do not understand your data. We develop interactive visualizations to bring the understanding back to life. Our visualizations are bridges enabling exploration and novel thought-provoking insights.

Data Prototyping

If you are short of peopled who can turn your data-driven idea into action in a short period of time, contact DATASTARS. Our world class data scientists and software engineers have over 200 years of cumulative programming experience and can easily ship you a product for a feasibility study.

Big Data Training

Data is a new pillar in a business model. However, there are very few people, who know how to work with data and create value. To help our clients cross this gap, we designed a laser focused series of corporate trainings Machine Learning, Big Data, and Big Data for Executives, that will bring their teams up to speed.

To guarantee an exceptional quality of our services, we attract the best experts in the field for each problem leveraging our unique network of contacts and partnerships with the top universities and research labs all over the world.

Products

Disruptive data solutions helping you win faster and lead bolder.

Three of our core principles are pushing for perfection, engaging intellectually, and challenging the status quo. We follow these principles with our hearts and minds creating products our clients love.

STarget

STarget is a sentiment analysis technoldetection of polarity of texts written in a language at multiple levels of granularity.

portfolio-thumb-1

Opinosis

A novel graph-based summarization framework that generates concise abstractive summaries of highly redundant texts.

portfolio-thumb-5

[job]Snipper

Technology to fully automatically synthesize highly informative structured snippets from unstructured job postings, and conduct job search at a semantic level.

portfolio-thumb-5

Cambridge SEREC

A breakthrough SEarch and REComendation engine that allows to combine multiple unstructured heterogeneous data sources and work with them in a unified way.

portfolio-thumb-5

RichRelevance

Partnered with RichRelevance we represent in Russia global leader in omnichannel personalization with the most innovative brands and retailers.

portfolio-thumb-3

Gravity R&D

Partnered with RichRelevance we represent in Russia global leader in omnichannel personalization with the most innovative brands and retailers.

portfolio-thumb-3

Parboiled2

Parboiled2 is a Scala 2.10.3+ library enabling lightweight and easy-to-use, yet powerful, fast and elegant parsing of arbitrary input text.

portfolio-thumb-5

Core Expertise

Behind The Awesome Production

We are not in the business of making canned presentations and speeches. We help our clients solve real data problems. With our world class expertise in machine learning, risk intelligence, recommender systems, social network analysis, text mining, time series analysis, information retrieval, outlier detection, algorithmic trading, and game optimization we approach each problem in a unique way and select the best technique to achieve maximal results.

Machine Learning

  • Supervised learning
  • Regression analysis
  • Model evaluation

Risk Intelligence

  • Credit scoring
  • Social credit scoring
  • Anti/Clieck-fraud
  • Web/Social anti-spam
  • Outlier/Anomaly Detection

Text Mining

  • Linear and hierarchical text classification
  • Categorization
  • Topic modeling
  • Snippets generation

Opinion Mining

  • Reviews summarization
  • Comments summarization
  • Sentiment analysis

Information Extraction

  • Named entity extraction
  • Web boilerplate detection
  • Web scraping
  • Query parsing

Recommender Systems

  • Email targeting
  • Product recommendations
  • Group recommendations

Social Network Analysis

  • User profiling
  • Information diffusion
  • Influencers detection
  • Viral marketing optimization
  • Community detection

Time Series Analysis

  • Time Series Forecasting
  • Trend detection
  • Change point detection
  • Quality control
  • Supply chain optimization

Customer Intelligence

  • Loyalty management
  • Retention prediction
  • Client base segmentation

Data Fusion

  • Artificial intelligence
  • Trading Strategies

Game Optimization

  • Bottlenecks detection
  • Activity personalization

Search Ranking

  • Text ranking
  • Entity ranking
  • Opinion-based product ranking

Technologies we love.

Our Research Publications

1. [under review] N. Spirin, J. He, M. Develin, M. Boucher, K. Karahalios, Large Scale Analysis of People Search Patterns in Online Social Network, 37th ACM SIGIR Conference (SIGIR2014)
2. [digital archive] N. Spirin, M. Eslami, J. Ding, P. Jain, B. Bailey, K. Karahalios, Strategies for Crowdsourcing Design Examples Search, Arxiv, 2014
3. N. Spirin, M. Eslami, J. Ding, P. Jain, B. Bailey, K. Karahalios, Searching for Design Examples with Crowdsourcing, 23rd International WWW Conference (WWW2014)
4. N. Spirin and K. Karahalios, Unsupervised Approach to Generate Informative Structured Snippets for Job Search Engines, 22nd International WWW Conference (WWW2013)
5. N. Spirin and J. Han, Survey on Web Spam Detection: Principles and Algorithms, SIGKDD Explorations, Dec 2011
6. J. Tedesco and N. Spirin, Efficiently Retrieving Relevant Pages for Fully-Qualified Entities, University of Illinois at Urbana-Champaign Technical Report, Dec 2012
7. N.V. Spirin and K.V. Vorontsov, Application of Monotonic Correction to Web Search, In Proceeding of 8th International Conference on Intellectualization of Information Processing (IIP2010)
8. N.V. Spirin and K.V. Vorontsov, Learning to Rank with Nonlinear Monotonic Ensemble, 10th International Workshop on Multiple Classifier Systems (MCS2011)
9. N. Surovenko and N. Spirin, Methods for Keywords Extraction and Text Classification, In Proceeding of 54th Conference on Problems in Fundamental and Applied Sciences
10. A. Artemov and N. Spirin, Application of Machine Learning for Automated Information Extraction from OCR-ed Documents, In Proceeding of 54th Conference on Problems in Fundamental and Applied Sciences
11. E. Severenkov and N. Spirin, Application of Crowdsourcing for Scientific Research, In Proceeding of 54th Conference on Problems in Fundamental and Applied Sciences (Best Student Paper Award)
12. N. Spirin, Object-oriented Web Search with Logical and Numerical Constraints, In Proceeding of 54th Conference on Problems in Fundamental and Applied Sciences

Clients

From The Best To The Best

To guarantee an exceptional quality of our services, we attract the best experts in the field for each problem leveraging our unique network of contacts and partnerships with the top universities and research labs all over the world. To guarantee an exceptional quality of our services, we attract the best experts in the field for each problem leveraging our unique network of contacts and partnerships with the top universities and research labs all over the world. To guarantee an exceptional quality of our services, we attract the best experts in the field for each problem leveraging our unique network of contacts and partnerships with the top universities and research labs all over the world.

Testimonial

  • Misquotation is, in fact, the pride and privilege of the learned. A widely- read man never quotes accurately, for the rather obvious reason that he has read too widely.

    Hesketh Pearson, CEO Baju Buathik
  • If you wish success in life, make perseverance your bosom friend, experience your wise counselor, caution your elder brother and hope your guardian genius.

    Joseph Addison, Developer Machine

About

A Premier Data Science Firm With The Deep Research Roots.

To guarantee an exceptional quality of our services, we attract the best experts in the field for each problem leveraging our unique network of contacts and partnerships with the top universities and research labs all over the world.

Our Team

  • Data Scientists

    12
  • Big Data Engineers

    5
  • UX Researchers

    1
  • Front-end Developers

    2

Our Principles

  1. Challenge the status quo
  2. Grow quickly
  3. Act with intergrity
  4. Engage intellectually
  5. Give our best to a client
  6. Push for perfection
  7. Move fast
  8. Believe in what you say
  9. Own our decisions and advices
  10. Ask hard questions constantly

Our partners.

1

  We attract the best experts in the field for each problem.

From the very beginning it became clear to us that in order to solve the most complex client problems, it is crucial to get the best minds involved in the problem solving process. Therefore, we invest in our relationship building process a significant part of our time by participating in academic and industry conferences, releasing pro bono research reports, and thought provoking blog posts and magazine articles. As of know, we have strategic partnerships with 4 universities in Europe, Asia, and North America. The same applies to our core talent sourcing strategy. Our consultants, data scientists, and engineers have experience working for the most innovative companies in the world, like Facebook, Microsoft, Microsoft Research, EMC, IBM, Yandex, Ebay, Yahoo, LinkedIn, and many others.

2

  We solve problems when nobody else can.

Mathematics talent is rare. Even more rare when a mathematician can code, give presentations, and explain her/his deep insights clearly and simply. It is almost impossible to get such a talent excited about an ordinary problem and fit her/him in a legacy organization. Fortunately for you, over the years we have accumulated such a talent under one umbrella being a honeypot with the complex client problems, playful work environment, and professional freedom. Together we work in flow and achieve synergistic effects allowing us to tackle previously untapped problems. Finally, we are hackers and can figure out a solution to any problem.

3

  We know the state-of-the-art and develop the one ourselves.

Our culture is built around constant evolution and learning. If existing tools and instruments aren't enough to solve a specific client problem, we build our own having the true engineering mindset. To keep our minds sharp and skills up-to-date, every week we have internal research seminars, presentations, and tutorials. Every month we invite industry experts to give keynote speeches and share their problem solving best practices and technological know-how. Several times a year we participate in academic and industry conferences. Finally, we arrange webinars, workshops, and conferences in partnership with the major universities and other leading data-driven companies.

4

  We do the impossible to make sure our clients are happy.

7-day workweek is a norm for us. Our leaders have been doing it for over a decade. We can easily wake up at 4am to overcome the timezone differences and barriers of a distributed work. If our client has a critical deadline, we take it with the full responsibility and deliver on time. One time we stayed 78 hours awake in a row with just a 4 hour sleep break in between to make it happen!

Stay in touch Get updates for events and articles on big data

Careers

Building the team of the brightest data minds.

Since we are very selective in our talent sourcing strategy, we don't have any open positions. Instead we are constantly crawling the web and professional social networks for the greatest data minds that can augment our expertise and have the highest possible impact for their talent. DATASTARS was founded with the idea that the brightest PhD graduates should keep working on the problems where they have the most expertise rather than doing laborious and unintelligent work in legacy corporations. We operate more like a research lab rather than a typical company. Therefore, we guarantee that if you impress us with your skills, energy, and attitude, you will have fun doing the things you are the best at in a team of similarly talented!

Send an email with your CV and links to professional profiles and publications to careers@datastars.ru

Contact Us

Share your story for us.


Email: contact@datastars.ru
Tel: +1 (650) 846-1600


Sed blandit augue vitae augue scelerisque bibendum. Vivamus sit amet liberoturpis, non venenatis urna. In blandit, odio convallis suscipit.

© 2014 DATASTARS