Sansan offers a digital business card organizing service for corporates, aiming for promoting work style reforms by transforming each business opportunity into an asset and making best use of it. The service is based on a system that converts the images of scanned text into machine-encoded text. During the scanning process, in order to achieve the secure and effective data import, the business card is fragmented into as much pieces as possible to be scanned. Even though the optical character recognition technology enables the conversion process, it's still way too difficult to achieve the low error rate.
In order to break through the current circumstances, Sansan presents you a challenge to create an algorithm that automatically recognizes and labels the items on the business card.
※The card data is originally generated for the competition, therefore any similarity to actual persons or organizations is purely coincidental.
Way to represent an item area
An item area is represented by rectangle by specifying the coordinates of the both upper-left and bottom-right points.
The algorithm should label an appropriate tag on each items from among the following 9 tags: company name, full name, title, address including post number, phone number, fax number, mobile number, email address, and website.
Submission file format is csv.
Determination of the final rankings
1. During the competition, scores on the leaderboard are computed based solely on a fraction of the test set. We use the rest of the set to evaluate and determine the final ranking after the competition.
2. If you have the same score with the other participants, the rank will be placed on a first-submit.
Preliminary winners are asked to provide the following items:
- Model source code
- Pre-trained model
- Model documentation*1 (Execution environment, execution time**1, random seed value*2, feature descriptions and importances*3, modeling approach, insights, and so on)
*1: Give a description of each of the following parts: preprocessing, training, and predicting
*2: Should be fixed in the source code
*3: If available
- Questionnaire (motivation for the competition, elapsed time, opinions and feedbacks on the competition, and so on)
3. When a preliminary winner falls under any of the following, it will lose its eligibility to claim the prize:
- Does not respond to the contact within the prescribed period
- Unable to reproduce the result
- Does not meet the terms or rules
4. When a person is disqualified, its runner-up will be eligible for winning the prize.
Premise
Please keep in mind to create a practical model as each competition, in principle, has its own goal such as achieving business challenges, solving social problems, and sharing results of studies.
One account per individual
・You may have one account per participant. Creating a team is not permitted in this competition.
・Submissions can be made up to 5 times per day.
No private sharing
・Sharing code or data with other participants is not permitted.
・You may open source your code.
Data limitations
・Using external data is not permitted.
・Using pre-trained models is permitted as long as they are open sourced.
・Manual labeling on the test set is not permitted.
・Training models with target labels modified is not permitted.
・Manually overwriting location information in the train set is not permitted.
Other limitations
・You may use any softwares as long as they are open sourced and free.
・Modeling process should be divided into 3 parts and a source code for each part has to be executable independently.
1. Preprocessing
Define functions for both train and test set.
2. Training
Define functions that generate such as pre-trained model, feature importances, and results of cross validation.
3. Predicting
Define functions that generate predictions by importing input variables and pre-trained models.
Data
This competition uses data from Sansan.
Software
You need to register Big Data University - Data scientist Workbench in order to use the software provided in the competition.
https://www.courseduck.com/programming/python/
Disclosure policy
Model *1 | Public |
Analysis results *2 | Public |
Restricted : Using in a limited range from research, education to seminars, where many unspecified people cannot access
*2 The insights obtained using the information and data provided, or the solutions including scripts and processed data such as summary statistics
In order to participate in the Competition, you are required to agree to these Terms, in addition to the Terms of Use of SIGNATE.JP Site (hereinafter referred to as the “Terms of Use”). You should participate in the Competition after reading carefully and agreeing to these Terms. If you agree, these Terms, the matters that are added to these Terms as "additional matters", the Terms of Use and other terms and conditions that you have agreed to shall be binding on the relevant parties as integral documents.
Article 1. Definitions
1.For the purpose of these Terms, the following terms shall be defined as follows:
(1)"Site" means the website "SIGNATE (https://signate.jp)” on which the Competitions are posted.
(2)"Competition" means any competition on AI development or data analysis on the Site as held by the Host.
(3)"Host" is the host(s) of the Competitions.
The Host may be SIGNATE, Inc. (hereinafter referred to as the “Company”) or the Company’s client companies, affiliated companies, schools or organizations, etc. (hereinafter referred to as the “Client(s)”).
(4)"Participant(s)" means the member(s) who participate in a Competition.
(5)"Submissions" means, collectively, the analysis and prediction results and reports, etc. as submitted in the Competition.
(6)"Final Submissions" means the Submissions that are specified by a Participant on the prescribed page in the Site by the time of completion of a Competition.
(7)"Winner Candidate" means the Participant who has received a notice from the Company that he/she is nominated as a winner candidate.
(8)"Submissions for Final Judgment" means the analysis and prediction model and learning data, etc. as submitted by a Winner Candidate pursuant to the instructions of the Company.
(9)"Final Judgment" means the acceptance inspection and judgment, including reproducibility verification, by the Company for the Final Submissions and Submissions for Final Judgment of a Winner Candidate.
(10)"Winner" means the Winner Candidate who is informed by the Company that he/she has won a prize.
2.Unless otherwise defined in these Terms, the terms used in these Terms that are defined in the Terms of Use shall have the same meaning as defined in the Terms of Use.
Article 2. Competition
1.A member who desires to participate in a Competition shall be required to agree to these Terms and to satisfy the conditions for participation as specified in each such Competition.
Any person who is not a member shall not participate in any Competition.
2.Participants shall participate in each Competition in the manner as advised by the Company and shall be obligated to comply with the rules as prescribed in each Competition.
3.Participants may submit the Submissions for the assignment of each Competition during the period of such Competition and submit a proposal on the method of solving the problem to the Host by the end of the period of the said Competition.
4.Participants may submit the Final Submissions in the form specified in each Competition by the time specified by the said Competition.
5.The Final Submissions as submitted shall be evaluated by the evaluation method as specified in each Competition and the final rank order shall be determined based on such evaluation.
6.Any Participant may, as a general rule, check the evaluation results of the Participant him/herself and each of the other Participants on the Site for the Submissions that may be evaluated quantitatively.
7.Participants shall be liable or otherwise responsible for their own Submissions, including their legality.
8.Participants shall not submit any Submissions that have no direct relationship to each Competition.
9.Unless otherwise provided for, Participants shall not directly communicate to, consult with, make a request to, solicit or take any other actions with the Host in respect of the matters related to a Competition during the period of the said Competition.
10.Any Participant who has uncertainty or questions about any Competition shall make sure to contact the Company or its designee through the procedures prescribed by the Company as posted on the Site.
11.The Company shall not be obligated to pay any remuneration or other consideration other than those prescribed in the following Article for any act of the Participants as prescribed in paragraphs hereof.
Article 3 Reward and Vesting of Rights
1.Unless otherwise provided for, any Participant shall satisfy the following requirements in order to be entitled to receive a reward in any Competition that offers a reward:
(1)To be a winner;
(2)To agree to transfer to the Host and the relevant transferee of rights in such Competition all transferable rights, such as copyrights, rights to obtain patents and know-how, etc. in and to all analysis and prediction results, reports, analysis and prediction model, algorithm, source code and documentations for the model reproducibility, etc., and the Submissions contained in the Final Submissions and Submissions for Final Judgment (including the rights as prescribed in Article 27 and Article 28 of the Copyright Act and the rights to obtain patents; hereinafter referred to as the "Rights");
(3)To agree that any relevant transferee of rights exclusively has the right to use the know-how contained in the Final Submissions and Submissions for Final Judgment for its own business and other purpose without any restriction;
(4)To agree not to exercise moral rights to the Rights against the relevant transferee of rights;
(5)To enter into an agreement for the transfer of the Rights with the relevant eligible transferee of rights, including the agreement to the matter in the preceding three (3) items and other reasonable provisions;
(6)To have the personal identity of such Participant verified by the Company.
(7)Not to breach any provision of these Terms and the Terms of Use.
2.Any Winner Candidate shall, after having received a notice from the Company that he/she is nominated as a winner candidate, submit the Submissions for Final Judgment on or before the designated date and communicate the matters requiring confirmation or response in relation to the Final Submissions and the Submissions for Final Judgment to the Company on or before the designated date, in accordance with the instructions of the Company.
The Company shall carry out the final judgment based on such matters requiring confirmation or response.
If the Company receives no confirmation or response satisfactory to the Company on or before the designated date, the Company may exclude such Winner Candidate from the subject of the final judgment.
3.If the Company considers that the Final Submissions or Submissions for Final Judgment need to be amended or modified, or there occur any additional matters requiring confirmation, in the course of the final judgment, any Winner Candidate shall take action or make response in relation to the matters that require amendment, etc. or the detailed information on the matters requiring confirmation, on or before the designated date in accordance with the instructions of the Company.
If the Company receives no action or response satisfactory to the Company on or before the designated date, the Company may exclude such Winner Candidate from the final judgment.
4.The Company shall determine the Winner through the final judgment and inform the Winner to that effect.
Article 4 Confidentiality
1.Any Participant shall treat any information and data that they receive from the Company in relation to each Competition (hereinafter referred to as the "Company-Provided Information") as confidential information and shall not disclose the same to any third party and use the same for any purpose other than for such Competition and purpose specified by the Company separately; provided,
however, that the confidential information shall not include any information that falls under any of the following items:
(1)Information that is known to the public at the time of the disclosure;
(2)Information that is already possessed by the Participant at the time of the disclosure (only in the case where such Participant may demonstrate such fact by reasonable means);
(3)Information that becomes known to the public without the fault of the Participant after the disclosure;
(4)Information that is independently developed by the Participant without reference to any information as disclosed (except for those Submissions of the person eligible for a prize which are evaluated); or
(5)Information that is rightfully disclosed by any third party having a right to do so without the obligations of confidentiality (only in the case where such Participant may demonstrate such fact by reasonable means).
2.Any Participant shall delete or return to the Company the Company-Provided Information immediately after the completion of each Competition.
3.Any Winner shall handle his/her Final Submissions and Submissions for Final Judgment in the same manner as prescribed in paragraph 1 hereof.
4.If there is any separate arrangement in relation to the confidential information in each Competition, the provisions of such arrangement shall prevail over the provisions of these Terms.
5.If any dispute occurs between the Host or other third party and the Company due to the breach by any Participant of the provisions of this Article and such other party makes any claim against the Company, such Participant shall compensate for any damage, loss, expenses (including, but not limited to, attorneys’ fees), lost profits and lost revenues, etc. incurred by the Company.
6.The provisions of this Article shall survive the termination of the relevant Competition or the Participant’s completion of the procedures for withdrawal from the service of the Company, with respect to the Company-Provided Information and the Winner’s Final Submissions and Submissions for Final Judgment for a period of five (5) years thereafter.
Article 5 Prohibited Acts of Participants
1.The Company shall prohibit Participants from engaging in any of the following acts in any Competition:
(1)An act of cracking, cheating, spoofing other misconduct;
(2)An act of directly communicating to, consulting with, making a request to, soliciting or responding to solicitation or other activities to other Participants or the Host (other than the Company) without the involvement of the Company;
(3)Any profitmaking activities using the Competition (including solicitation or scouting activities, and use for a third party in educational business, etc.) without the prior approval of the Company in writing or any other manner specified by the Company;
(4)Transfer, offering as collateral or other disposition of the status as a Participant or the rights or obligations as a Participant (except with the prior written consent of the Company); and
(5)Any other act in breach of the Terms of Use.
2.If the Company deems that a Participant engages in any of the prohibited acts as prescribed in the preceding paragraph, the Company may, without prior notice to the Participant, disqualify the Participant from the Competition in which the Participant participates, temporarily suspend the Participant from using the service of the Company, withdraw the Participant’s membership, claim damages from the Participant or take any other measures deemed necessary by the Company.
Article 6. Change, Discontinuation or Termination of Provision of Services under These Terms
1.The Company may change or temporarily suspend the services provided by the Company under these Terms without prior notice to the members.
2.Upon one (1) month prior notice to the members, the Company may suspend for a long period of time or terminate the services provided by the Company under these Terms.
3.The Company shall not be liable for any results or damage arising from the measures taken by the Company under this Article.
Article 7 Modification of Terms
1.The Company may modify, add or delete any provisions of these Terms from time to time without the approval of the members.
Enforced on April 1, 2018