本コンペティションは、依頼主である株式会社飯田産業(以下、「クライアント」といいます。)による人財発掘を目的としています。このため、本コンペティションの参加者には、会員登録時の入力情報に追加して、個人情報をご入力いただきます。本コンペティションにおける参加者の情報は、株式会社SIGNATEからクライアントに対して提供されます。これに伴い、当該クライアント又は株式会社SIGNATEから、参加者に対して、採用又は業務委託に関する情報提供又はオファーに関する連絡がなされる場合があります。

お知らせ
2019/06/11 ルールから、参加資格「日本語をネイティブに話せる方」を削除しました。
2019/06/13 よくある質問をFAQに追加しました。
2019/07/31 FAQを更新しました。


背景

土地の販売価格は様々な要因によって決定します。土地そのものの特性はもちろん、面している道路などの周辺環境や駅からの距離といった利便性なども関係しています。
今回のコンペでは、埼玉県における過去の土地の販売実績データを用い、土地の販売価格を予測するアルゴリズムの作成に挑戦していただきます。



タスク説明

データは大きく「現場」「号棟」の2種類に分かれています。「現場」データには、"PJ番号"で管理される区画毎に土地の情報が記録されています。この「現場」データで管理される区画に対して、1つ以上の号棟が割り当てられ、その情報が「号棟」データに記録されています。(下図参照)

今回は、この「号棟」単位の販売価格を予測していただきます。



【本コンペ特記事項】
本コンペティションは、依頼主である株式会社飯田産業(以下、「クライアント」といいます。)による人財発掘を目的としています。このため、本コンペティションの参加者には、会員登録時の入力情報に追加して、個人情報をご入力いただきます。本コンペティションにおける参加者の情報は、株式会社SIGNATEからクライアントに対して提供されます。これに伴い、当該クライアント又は株式会社SIGNATEから、参加者に対して、採用又は業務委託に関する情報提供又はオファーに関する連絡がなされる場合があります。


In order to participate in the Competitions, you are required to agree to these Terms, in addition to the Terms of Use of SIGNATE.JP Site (hereinafter referred to as the "Terms of Use"). You should participate in the Competition after reading carefully and agreeing to these Terms. If you agree, these Terms, the matters that are added to these Terms as "additional matters", the Terms of Use and other terms and conditions that you have agreed to shall be binding on the relevant parties as integral documents.


Article 1. Definitions

1.For the purpose of these Terms, the following terms shall be defined as follows:

(1)"Site" means the website "SIGNATE (https://signate.jp)" on which the Competitions are posted.
(2)"Competition" means any competition on AI development or data analysis on the Site as held by the Host.
(3)"Host" is the host(s) of the Competitions. The Host may be SIGNATE, Inc. (hereinafter referred to as the "Company") or the Company’s client companies, affiliated companies, schools or organizations, etc. (hereinafter referred to as the "Client(s)").
(4)"Participant(s)" means the member(s) who participate in a Competition.
(5)"Submissions" means, collectively, the analysis and prediction results and reports, etc. as submitted in the Competition.
(6)"Final Submissions" means the Submissions that are specified by a Participant on the prescribed page in the Site by the time of completion of a Competition.
(7)"Winner Candidate" means the Participant who has received a notice from the Company that he/she is nominated as a winner candidate.
(8)"Submissions for Final Judgment" means the analysis and prediction model and learning data, etc. as submitted by a Winner Candidate pursuant to the instructions of the Company.
(9)"Final Judgment" means the acceptance inspection and judgment, including reproducibility verification, by the Company for the Final Submissions and Submissions for Final Judgment of a Winner Candidate.
(10)"Winner" means the Winner Candidate who is informed by the Company that he/she has won a prize.
2.Unless otherwise defined in these Terms, the terms used in these Terms that are defined in the Terms of Use shall have the same meaning as defined in the Terms of Use.

Article 2. Competition

1.A member who desires to participate in a Competition shall be required to agree to these Terms and to satisfy the conditions for participation as specified in each such Competition. Any person who is not a member shall not participate in any Competition.
2.Participants shall participate in each Competition in the manner as advised by the Company and shall be obligated to comply with the rules as prescribed in each Competition.
3.Participants may submit the Submissions for the assignment of each Competition during the period of such Competition and submit a proposal on the method of solving the problem to the Host by the end of the period of the said Competition.
4.Participants may submit the Final Submissions in the form specified in each Competition by the time specified by the said Competition.
5.The Final Submissions as submitted shall be evaluated by the evaluation method as specified in each Competition and the final rank order shall be determined based on such evaluation.
6.Any Participant may, as a general rule, check the evaluation results of the Participant him/herself and each of the other Participants on the Site for the Submissions that may be evaluated quantitatively.
7.Participants shall be liable or otherwise responsible for their own Submissions, including their legality.
8.Participants shall not submit any Submissions that have no direct relationship to each Competition.
9.Unless otherwise provided for, Participants shall not directly communicate to, consult with, make a request to, solicit or take any other actions with the Host in respect of the matters related to a Competition during the period of the said Competition.
10.Any Participant who has uncertainty or questions about any Competition shall make sure to contact the Company or its designee through the procedures prescribed by the Company as posted on the Site.
11.The Company shall not be obligated to pay any remuneration or other consideration other than those prescribed in the following Article for any act of the Participants as prescribed in paragraphs hereof.

Article 3. Reward and Vesting of Rights

1.Unless otherwise provided for, any Participant shall satisfy the following requirements in order to be entitled to receive a reward in any Competition that offers a reward:

(1)To be a winner;
(2)To agree to transfer to the Host and the relevant transferee of rights in such Competition all transferable rights, such as copyrights, rights to obtain patents and know-how, etc. in and to all analysis and prediction results, reports, written explanations on analysis and prediction model, algorism, source code and reproduction method, etc., and the Submissions contained in the Final Submissions and Submissions for Final Judgment (including the rights as prescribed in Article 27 and Article 28 of the Copyright Act and the rights to obtain patents; hereinafter referred to as the "Rights");
(3)To agree that any relevant transferee of rights exclusively has the right to use the know-how contained in the Final Submissions and Submissions for Final Judgment for its own business and other purpose without any restriction;
(4)To agree not to exercise moral rights to the Rights against the relevant transferee of rights;
(5)To enter into an agreement for the transfer of the Rights with the relevant eligible transferee of rights, including the agreement to the matter in the preceding three (3) items and other reasonable provisions;
(6)To have the personal identity of such Participant verified by the Company.
(7)Not to breach any provision of these Terms and the Terms of Use.

2.Any Winner Candidate shall, after having received a notice from the Company that he/she is nominated as a winner candidate, submit the Submissions for Final Judgment on or before the designated date and communicate the matters requiring confirmation or response in relation to the Final Submissions and the Submissions for Final Judgment to the Company on or before the designated date, in accordance with the instructions of the Company. The Company shall carry out the final judgment based on such matters requiring confirmation or response. If the Company receives no confirmation or response satisfactory to the Company on or before the designated date, the Company may exclude such Winner Candidate from the subject of the final judgment.
3.If the Company considers that the Final Submissions or Submissions for Final Judgment need to be amended or modified, or there occur any additional matters requiring confirmation, in the course of the final judgment, any Winner Candidate shall take action or make response in relation to the matters that require amendment, etc. or the detailed information on the matters requiring confirmation, on or before the designated date in accordance with the instructions of the Company. If the Company receives no action or response satisfactory to the Company on or before the designated date, the Company may exclude such Winner Candidate from the final judgment.
4.The Company shall determine the Winner through the final judgment and inform the Winner to that effect.

Article 4. Confidentiality

1.Participants shall treat any information, data, or content transmitted through the service where they receive from the Company in relation to each Competition (hereinafter referred to as the "Company-Provided Information") as confidential information and shall not disclose the same to any third party and use the same for any purpose other than for such Competition and purpose specified by the Company separately; provided, however, that the confidential information shall not include any information that falls under any of the following items:

(1)Information that is known to the public at the time of the disclosure;
(2)Information that is already possessed by the Participant at the time of the disclosure (only in the case where such Participant may demonstrate such fact by reasonable means);
(3)Information that becomes known to the public without the fault of the Participant after the disclosure;
(4)Information that is independently developed by the Participant without reference to any information as disclosed (except for those Submissions of the person eligible for a prize which are evaluated); or
(5)Information that is rightfully disclosed by any third party having a right to do so without the obligations of confidentiality (only in the case where such Participant may demonstrate such fact by reasonable means).

2.Any Participant shall delete or return to the Company the Company-Provided Information immediately after the completion of each Competition.
3.Any Winner shall handle his/her Final Submissions and Submissions for Final Judgment in the same manner as prescribed in paragraph 1 hereof.
4.If there is any separate arrangement in relation to the confidential information in each Competition, the provisions of such arrangement shall prevail over the provisions of these Terms.
5.If any dispute occurs between the Host or other third party and the Company due to the breach by any Participant of the provisions of this Article and such other party makes any claim against the Company, such Participant shall compensate for any damage, loss, expenses (including, but not limited to, attorneys’ fees), lost profits and lost revenues, etc. incurred by the Company.
6.The provisions of this Article shall survive the termination of the relevant Competition or the Participant’s completion of the procedures for withdrawal from the service of the Company, with respect to the Company-Provided Information and the Winner’s Final Submissions and Submissions for Final Judgment for a period of five (5) years thereafter.

Article 5. Prohibited Acts of Participants

1.The Company shall prohibit Participants from engaging in any of the following acts in any Competition:

(1)An act of cracking, cheating, spoofing other misconduct;
(2)An act of directly communicating to, consulting with, making a request to, soliciting or responding to solicitation or other activities to other Participants or the Host (other than the Company) without the involvement of the Company;
(3)Any profitmaking activities using the Competition (including solicitation or scouting activities, and use for a third party in educational business, etc.) without the prior approval of the Company in writing or any other manner specified by the Company;
(4)Transfer, offering as collateral or other disposition of the status as a Participant or the rights or obligations as a Participant (except with the prior written consent of the Company); and
(5)Any other act in breach of the Terms of Use.

2.If the Company deems that a Participant engages in any of the prohibited acts as prescribed in the preceding paragraph, the Company may, without prior notice to the Participant, disqualify the Participant from the Competition in which the Participant participates, temporarily suspend the Participant from using the service of the Company, withdraw the Participant’s membership, claim damages from the Participant or take any other measures deemed necessary by the Company.

Article 6. Change, Discontinuation or Termination of Provision of Services under These Terms

1.The Company may change or temporarily suspend the services provided by the Company under these Terms without prior notice to the members.
2.Upon one (1) month prior notice to the members, the Company may suspend for a long period of time or terminate the services provided by the Company under these Terms.
3.The Company shall not be liable for any results or damage arising from the measures taken by the Company under this Article.

Article 7. Modification of Terms

1.The Company may modify, add or delete any provisions of these Terms from time to time without the approval of the members.

Enforced on April 1, 2018
Last updated on January 18, 2019

評価関数
・平均絶対誤差率「MAPE(Mean Absolute Percentage Error)」を使用します。
・評価値は0以上の値をとり、精度が高いほど小さな値となります。




最終順位の決定
1. コンテスト最終日までの評価(暫定評価)は評価用データセットの一部で評価し、コンテスト終了後の評価(最終評価)は評価用データセットの残りの部分で評価します。
  スコアボードはコンテスト終了時に自動的に最終評価に切り替わり、それを元に最終順位を決定します。このため、開催中と終了後では順位が大きく変動する場合もあります。
2. スコアが同値の場合は、早い日時でご応募いただいた参加者を上位とします。
3. 入賞候補者の方には順位確定の際に下記の情報を提出していただきます。
・予測モデルのソースコード
・学習済モデル
・予測結果の再現の為の手順書(前処理部分、学習部分、予測部分が分かるよう明記)
・実行環境(OSのバージョン、使用ソフトウェア及び解析手法) 
・乱数シード(Random Forest等の乱数を利用した手法の場合)
・各説明変数の予測モデルへの寄与度(寄与度の算出が可能な手法を用いた場合)
・データの解釈、工夫点、モデリングから得られる示唆等
4. 再現性検証期間中、入賞候補者及び、その提出モデルが下記いずれかに該当する場合は懸賞の獲得資格を失います。
・事務局からの手続き上の連絡・要求に対して指定された期限内に対応しない
・参加条件やルールを満たしていない
・モデルの予測結果を再現できない


総合ランキング
・本コンペは、総合ランキング(スコア・メダル)の対象です。

心構え
・企業課題の達成、社会問題の解決、研究成果の共有等、大前提となる目的に合わせ、実用性を意識したアプローチで臨んでください。

システムの利用
・利用アカウントは1人につき1つまで。チームでの参加はできません。

情報の取り扱い
・他の参加者と本コンペの予測に関連するデータ・ソースコードを共有する行為は禁止です。

データの利用
・配布するデータ以外のデータを用いてモデルを学習することは原則禁止します。
 ただし、現場データ内の住居表示に対応する緯度・経度に限り利用可能とし、その場合に利用するツールは、オープン且つ無料なものに限定します。

実装方法
・モデルの学習に利用するツールは、オープン且つ無料なもの(python, R 等)に限定します。
・ソースコードは、以下のように前処理、学習、予測、の3つに分け、それぞれを実行すれば処理が進むように実装してください。

①preprocess
 提供データを読み込み、データに前処理を施し、モデルに入力が可能な状態でファイル出力するモジュール。
 get_train_dataやget_test_dataのように、学習用と評価用を分けて前処理を行う関数を定義。
 ※渡す情報として学習用データと評価用データを混在させてもよいですが、get_train_dataで返す結果は前処理された学習用データ、get_test_dataで返す結果は前処理された評価用データのように、処理の内容は独立させてください。
②train
 ①で作成したファイルを読み込み、モデルを学習するモジュール。
 学習済モデルや特徴量、クロスバリデーションの評価結果を出力する関数等を定義。
③predict
 ①で作成したテストデータ及び②で作成したモデルを読み込み、予測結果をファイルとして出力するモジュール。

Q. データ定義書で定義されていない値がいくつか入っていたり、住居表示にも一部抜けがあるようです。

A. 今回、できるだけ生の状態に近い形でデータを提供しており、データ入力時の不備がある場合もそのまま残しています。お手数ですが、必要に応じて、名寄せや補完等のデータクレンジング処理を行ってください。また、データを加工した場合は、元データに対して何をどう変更したかを、ソースコードの前処理部分に記録するようお願いいたします。なお、データクレンジング処理を行う場合も、配布するデータ内の情報のみを利用してください。


Q. 号棟データで、土地面積 (tc_mseki) より建物面積 (tt_mseki)の方が大きいケースがありますが、なぜですか?

A.建物面積は、延べ床面積が記載されています。このため、例えば、2階建てなら1階(50㎡)+2階(40㎡)の合計面積(90㎡が記載)が記載されるため、土地面積より大きくなることがあります。なお、扱っている家は全て戸建て(マンションなどではない)となります。


Q. タイトルには「土地の販売価格の推定」とありますが、号棟の区画に建物が建っている場合は、土地の価格に加えて建物の価格も含まれますか?

A.建物がある場合、予測対象は、土地・建物を合わせた販売価格、となります。誤解を与える表記となっており申し訳ございません。


Q. 現場データで、引渡の状態(hw_status)が"更地"となっているにも関わらず、建物が建っている号棟があるようですが。

A.本項目は、飯田産業様が外部から更地の状況で購入したということを表しております。このため、その後に建物が建つことがあります。


Q. 現場データの「販売種別(建売)」「販売種別(土地売り)」のフラグの数と、対応するpj_noの号棟データの建物有りの号棟と建物が無い号棟の数が合わないケースがあるようです。

A.販売種別は計画値が記入されており、稀に実際の結果とマッチしないケースがあります。


Q. 住居表示以外の項目、例えば駅名から緯度経度を取得し、モデルの学習に利用できますか?

A.住居表示以外の項目から取得した緯度経度の利用は禁止とします。


Q. 特徴量生成に一般常識は利用できますか?

A.配布するデータ内の名義尺度を一般常識に基づき別の名義尺度に再割当てするような、一般的な特徴量生成手法は利用可能です。判断に迷う場合はお問い合わせください。