Grow Your Network
Making your profile informative by adding a biography, linking your social media accounts as such.
Receive information about data science job opportunities and get connected with head hunters.
For Educational Purposes
If you would like to use SIGNATE for educational purposes at universities and companies, please contact us.
Fill in the required fields on temporary registration information input screen.
- Input username with 2 to 20 characters.
- Usernames must be unique.
- Input password with 6 to 20 alphanumeric characters.
You will receive a six-digit code after filling in the required fields in .
- The code is valid for 30 minutes.
- If by any chance you don’t receive the code, please check your spam folder or email settings and contact us from here.
Enter the six-digit code you received in  and complete the email address verification.
- If you have closed the code input screen, please click the link [To reissue your confirmation code, click here] in the email in  to show it again.
When the email address verification is completed in , you will be redirected to the Profile Details input screen. Please fill in the required fields to complete registration.
- Date of birth is required when you forget your password.
You can choose any competitions that are likely to suit your skills and interests!
When you try to download data for the first time in a competition, the terms of the competition will be displayed.
Please read through and agree to the terms.
- You must agree to the terms of each competition to participate in it.
- There are some competitions that require a conclusion of NDA contract and identity verification.
If you have any questions about data and terms of the competition after taking a look at overview of dataset, please contact us.
- Please do not use the dataset for any purpose other than the competition and redistributing data is prohibited.
- There are some competitions where cloud data analysis environment is provided, which means participants cannot download dataset to their local environment.
Confirm the Rules
Please read through the competition rules so that your model won't violate them. Click here to see an example of the competition rules. For any questions about the rules, please contact us.
Manage Analysis Results
It would be helpful to reproduce the result when you become a preliminary winner. As there's a chance to win due to disqualification as such, don't be discouraged with bad scores!
Overfitting happens when a model learns the detail and noise in the training data to the extent that it negatively impacts its performance on new data, which results in a drop in the final ranking.
Please keep in mind to create a practical model as each competition has its own goal such as achieving business challenges, solving social problems, and sharing results of studies.
Once you have downloaded dataset, get started with figuring out the characteristics of the data. It is important to confirm the number of records in data, category of data (continuous, discrete, string), percentage of missing data, distribution of data, and task type (regression, classification, multi-class, multi-label).
After figuring out the characteristics of the data, it's time to start exploratory data analysis. It is good to analyze to find such as the relationship between the variables, which variables are closely related with the objective variable, and what kind of features seems to be created to gain insight for prediction accuracy improvement, by using the methods like clustering, correlation analysis, and other various approaches.
Once the exploratory data analysis is done, let's move on to the next step: feature engineering and creating a model. Evaluate your prediction accuracy based on the data used to train your model to estimate how good prediction it gives. Then submit the prediction on the test set to SIGNATE to see how it actually performs.
Keep engineering features that are likely to contribute to your model improvement and submit the prediction to confirm whether it works or not.
Make good use of techniques such as cross validation, bootstrapping, feature selection, and hyper parameters tuning to avoid overfitting.
Don't forget to manage your analysis results so that you can reproduce them later.
Keep improving model
It's good to see how good other competitors models are and compare with yours. As the competition end date approaches, more and more submissions are made and the competition tends to become hot and intense. Keep improving your model so you can maintain your ranking.
Submission limits per day
Every competition has a limit to the number of submissions per day. Once you have reached the limit of the day, give your brain and machine a break and let's wait for the limit getting reset at 0:00.
Final ranking is determined by the private leaderboard
The test set is split into a public and private part in the competition. During the competition, scores on the leaderboard are computed based solely on a fraction of the test set. We use the rest of the set to evaluate and determine the final ranking after the competition. Leaderboard automatically switches to the final evaluation the moment competition ends, where we sometimes see the leaderboard ranking shaken up.
The purpose of the competition for the participants is to produce the model that predicts best on the test set whose objective variable is disclosed.
* Trying to figure out the true value of objective variables on the test set is regarded as cheating which is prohibited. When a participant is found to have cheated in the competition, he or she will be disqualified.
Submission score is automatically computed based on the evaluation criteria to be reported on the leaderboard along with its ranking.
Scores reported on the leaderboard during the competition are temporary ones and after the competition, they will be switched into final evaluation ones.
The leaderboard will be switched to display the final ranking once the competition is closed.
Preliminary winners are required to provide their code and documentation. Please make sure that your model can reproduce the final ranking score and does not violate the competition rules. The documentation should be well written and polished.
Preliminary winners are required to submit source code and documentation of their model to receive the prize money. Please see the guidelines below for further details.
Transferring intellectual property rights
In order to receive the prize money, the intellectual property rights in your model are required to be transferred to the competition sponsor. Please note that use of the model and publishing its approach in the blog is not allowed once the rights are transferred. If you have any questions, please contact us.
We will send a notification email to preliminary competition winners.
- We will notify in a week after determining their final ranking.
- Please check your spam folder if the email is not sent to your inbox.
Following our team's instruction, submit your model documentation by the deadline.
- We would appreciate it if you could provide us with well-arranged code and document as it helps us to proceed the verification process smoothly.
- Please make sure that your submitting model is what you used to generate the final evaluation result.
After we receive your model submission, we will confirm whether the model violates the competition rules and can reproduce its final evaluation result.
- Please note that our team will contact you when in unexpected situations during the model verifying process.
- To proceed the process smoothly, please provide your model documentation in as much detail as possible.
Here are examples of required information:
- Suggestions obtained from analysis and modeling
- A procedure to reproduce the result
For more details, click here.
Almost done! By concluding a contract of such as transferring rights, you will acquire the right to accept the payment of the prize.
- After entering into this agreement, in principle, the payment of the prize money will be completed within 2 months.