Jane Street Market Prediction

Click the links below to see project

Part1 - EDA
Part2 - Predict
Part3 - Blog post

Kaggle Competition. This project includes exploratory data analysis(notebooks) and prediction scripts.

Problem: 130 anonymized features. Predict whether taking each trading opportunities will result in profit for a whole year.

Dataset: 12GB of Real world financial markets.

anonymized set of features, feature_{0...129}, representing real stock market data.
each row in the dataset represents a trading opportunity, for which you will be predicting an action value: 1 to make the trade and 0 to pass on it.
each trade has an associated weight and resp, which together represents a return on the trade.
date column is an integer which represents the day of the trade, while ts_id represents a time ordering.
in addition to anonymized feature values, you are provided with metadata about the features in features.csv. cred: https://www.kaggle.com/c/jane-street-market-prediction/data

Modules used: numpy, pandas, tensorflow, tqdm, random, datatable, sklearn, gc, seaborn, matplotlib, plotly, defaultdict

INSTRUCTION although I don't recommend you to run this since this is specifically for Jane Street ANONYMIZED FEATURES competition..

input folder - Put the feature.csv, train.csv from https://www.kaggle.com/c/jane-street-market-prediction and imputed_df.csv from (https://www.kaggle.com/louise2001/janestreetimputeddata).
src folder - scripts are stored here. Just run python train.py to clean and train and save model to models folder. submit.py is to submit for Kaggle competition.
model folder - the models are saved here output folder - the output cleaned data are saved here notebook folder - notebooks used for EDA and feature engineering and training are here.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.vs		.vs
input		input
notebooks		notebooks
src		src
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Jane Street Market Prediction

Click the links below to see project

Problem: 130 anonymized features. Predict whether taking each trading opportunities will result in profit for a whole year.

Dataset: 12GB of Real world financial markets.

Modules used: numpy, pandas, tensorflow, tqdm, random, datatable, sklearn, gc, seaborn, matplotlib, plotly, defaultdict

INSTRUCTION although I don't recommend you to run this since this is specifically for Jane Street ANONYMIZED FEATURES competition..

About

Releases

Packages

Languages

leejaeka/Jane-Street-Market-Competition

Folders and files

Latest commit

History

Repository files navigation

Jane Street Market Prediction

Click the links below to see project

Problem: 130 anonymized features. Predict whether taking each trading opportunities will result in profit for a whole year.

Dataset: 12GB of Real world financial markets.

Modules used: numpy, pandas, tensorflow, tqdm, random, datatable, sklearn, gc, seaborn, matplotlib, plotly, defaultdict

INSTRUCTION although I don't recommend you to run this since this is specifically for Jane Street ANONYMIZED FEATURES competition..

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages