• Principal Data Scientist

    Location US-NC-Raleigh
    Posting date 1 week ago(11/9/2018 9:12 AM)
    Job ID
    65999
    Category
    Information Technology
  • Company description

    At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.

    Job summary

    The Red Hat Enterprise Data and Analytics team is looking for an experienced data scientist to join us in Raleigh, NC. In this role, you will support both departmental and enterprise teams with insights gained from analyzing company data. You’ll use large datasets to find opportunities for the optimization of our offerings and processes and use models to test the effectiveness of different courses of action. You’ll need to have a solid experience using a variety of data mining and data analysis methods and data tools, building and implementing models, using and creating algorithms, and creating and running simulations. You’ll need to have a proven ability to achieve business results with their data-based insights. As a Principal Data Scientist, you’ll need to be comfortable working with a wide range of stakeholders and functional teams. You’ll also need to have a passion for discovering solutions hidden in large data sets and working with stakeholders to improve business outcomes.

    Primary job responsibilities

    • Work with stakeholders throughout the organization to identify opportunities for leveraging company data to promote business solutions
    • Mine and analyze data from company databases to guide optimization and improvement of product development and business strategies
    • Assess the effectiveness and accuracy of new data sources and data gathering techniques
    • Develop custom data models and algorithms to apply to data sets
    • Use predictive modeling to increase and optimize business outcomes
    • Coordinate with different functional teams to implement models and monitor outcomes
    • Develop processes and tools to monitor and analyze model performance and data accuracy

    Required skills

    • Solid problem-solving skills with an emphasis on product development
    • Experience using statistical computer languages (R, Python, SLQ, etc.) to manipulate data and draw insights from large datasets
    • Experience working with and creating data architectures
    • Knowledge of a variety of machine learning (ML) techniques (clustering, decision tree learning, artificial neural networks, etc.) and their real-world advantages and drawbacks
    • Knowledge of advanced statistical techniques and concepts (regression, properties of distributions, statistical tests and proper usage, etc.) and experience with applications
    • Excellent written and verbal communication skills for coordinating across teams
    • Desire to learn and master new technologies and techniques
    • 5-7 years of experience manipulating datasets and building statistical models
    • Master’s or PHD degree in statistics, mathematics, computer science, or another quantitative field,
    • Coding knowledge and experience with multiple languages
    • Knowledge of and experience with statistical and data mining techniques like GLM, regression, random forest, boosting, trees, text mining, social network analysis, etc.
    • Experience querying databases and using statistical computer languages like R, Python, SLQ, etc.
    • Experience using Redshift, S3, and Spark
    • Experience creating and using advanced ML algorithms and statistics, including regression, simulation, scenario analysis, modeling, clustering, decision trees, neural networks, etc.
    • Experience with distributed data and computing tools like map and reduce, Hadoop, Hive, Spark, Gurobi, MySQL, etc.
    • Experience visualizing and presenting data for stakeholders using Tableau, SAP BusinessObjects (BOBJ), D3, ggplot, etc.


    Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, uniformed services, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.


    Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

     

    Interested in this job?

    Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
    Share on your newsfeed