-
Notifications
You must be signed in to change notification settings - Fork 1
/
data.html
executable file
·149 lines (133 loc) · 8.24 KB
/
data.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<meta name="description" content="">
<meta name="author" content="">
<link rel="shortcut icon" href="img/sheep1.jpeg">
<title>Yang Yang</title>
<!-- Bootstrap core CSS -->
<link href="dist/css/bootstrap.min.css" rel="stylesheet">
<!-- Custom styles for this template -->
<link href="jumbotron.css" rel="stylesheet">
<!-- Just for debugging purposes. Don't actually copy this line! -->
<!--[if lt IE 9]><script src="../../assets/js/ie8-responsive-file-warning.js"></script><![endif]-->
<!-- HTML5 shim and Respond.js IE8 support of HTML5 elements and media queries -->
<!--[if lt IE 9]>
<script src="https://oss.maxcdn.com/libs/html5shiv/3.7.0/html5shiv.js"></script>
<script src="https://oss.maxcdn.com/libs/respond.js/1.4.2/respond.min.js"></script>
<![endif]-->
</head>
<body>
<div class="navbar navbar-inverse navbar-fixed-top" role="navigation">
<div class="container">
<div class="navbar-header">
<button type="button" class="navbar-toggle" data-toggle="collapse" data-target=".navbar-collapse">
<span class="sr-only">Toggle navigation</span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
<!-- <img height="50" src="img/sheep2.jpeg" align="left" hspace="6" style="margin-left:-6p;margin-right:20px"> -->
<a class="navbar-brand" href="#">Yang Yang (杨洋)</a>
</div>
<div class="navbar-collapse collapse">
<ul class="nav nav-pills pull-right">
<li class="active"><a href="index.html">Home</a></li>
</ul>
<!--
<form class="navbar-form navbar-right" role="form">
<div class="form-group">
<input type="text" placeholder="Email" class="form-control">
</div>
<div class="form-group">
<input type="password" placeholder="Password" class="form-control">
</div>
<button type="submit" class="btn btn-success">Sign in</button>
</form>
-->
</div><!--/.navbar-collapse -->
</div>
</div>
<a name="housing_price"></a>
<div class="container">
<div class="page-header">
<h2>DGraph</h2>
</div>
<div class="page-header">
<p>
DGraph represents a collection of large-scale dynamic graph datasets, consisting of interactive objects, events and labels that envolves with time. It provides the opportunity to perform multiple tasks (e.g., node classification, link prediction, graph classification) on massive real-world data and has a wide range of applications with a particular focus on the financial sector. In view of the inclusion of realistic and valuable graph data, the benchmark datasets are expected to promote more relevant research and expand practical applications.
</p>
<!--
<p> <span style="color:#b22222"> Please kindly cite our paper if you would like to use this dataset: </span></p>
<p> Yang Yang*, Yuhong Xu*, Chunping Wang, Yizhou Sun, Fei Wu, Yueting Zhuang, and Ming Gu. Understanding Default Behavior in Online Lending. In CIKM'19 (*: equal contribution). [<a href="bibtex/28.html">BIB</a>] [<a href="works/loan_fraud/cikm19_loan.pdf">PDF</a>]</p>
-->
<p> <span style="color:#b22222"> Please kindly cite our paper if you would like to use this dataset: </span></p>
<p>Xuanwen Huang, Yang Yang, Yang Wang, Chunping Wang, Zhisheng Zhang, Jiarong Xu, and Lei Chen.
DGraph: A Large-Scale Financial Dataset for Graph Anomaly Detection.
<i>Preprint</i>.
[<a target="_blank" href="works/dgraph/dgraph_2022.pdf">PDF</a>]
</p>
<p><a href="https://dgraph.xinye.com/introduction" style="margin-left:-3px; margin-right: 3px;">[Data Download]</a></p>
</div>
</div>
<div class="container">
<div class="page-header">
<h2>Housing Price of Real Estates in Shanghai</h2>
</div>
<div class="page-header">
<p>
This dataset is crawled from <a href="https://shanghai.anjuke.com/">AnJuKe</a>, an online platform for real estate sales and renting. This dataset covers over 18K real estates at Shanghai in 2017.
</p>
<p>In the data, each line indicate a real estate, with the following format: </p>
<i>
<p> "name" is the Chinese name of the real estate; </p>
<p> "price" is the average housing price of the real estate; </p>
<p> "latitude" and "longitude" present the location of the real estate, which is obtained from <a href="http://map.baidu.com/">Baidu Maps</a>.
<p></P>
</i>
<p> <span style="color:#b22222"> Please kindly cite our paper if you would like to use this dataset: </span></p>
<p> Yang Yang, Zongtao Liu, Chenhao Tan, Fei Wu, Yueting Zhuang, and Yafeng Li. To Stay or to Leave: Churn Prediction for Urban Migrants in the Initial Period. In WWW'18. [<a href="bibtex/25.html">BIB</a>] [<a href="works/migrant/migrant_churn.pdf">PDF</a>]</p>
<p><a href="works/migrant/data/housing_price.txt" style="margin-left:-3px; margin-right: 3px;">[Data Download]</a></p>
</div>
</div>
<div class="container">
<div class="page-header">
<h2>Information Cascades on a Twitter Like Chinese Social Media</h2>
</div>
<div class="page-header">
<p>
This dataset is sampled from a real Twitter like Chinese social media. The sampling process is as follows: we first select top 100 source posts (ones not retweeting from others) with most retweets during Oct. 1st, 2012 to Oct. 7th, 2012 and put these posts into a set <i>V</i>. We then scan all other posts and add ones that retweet from one of the posts in <i>V</i> to <i>V</i>. The process is repeated until no more posts are newly added. In this way, we obtain a complete casacade process of those 100 source posts, involved with 96,782 posts in total.
</p>
<p> In the data, each line indicates a post, with the following formate: </p>
<i>
<p> post_idx post_time user_id root_id root_user_id parent_id parent_user_id </p>
<p> "post_idx" is the unique index of the post; </p>
<p> "post_time" is the time when the post was published; </p>
<p> "user_id" is the unique index of the user who published the post; </p>
<p> "root_id" is the index of the source post, would be zero if the current line indicates a source post; </p>
<p> "root_user_id" is the user who published the source post, would be zero if the current line indicates a source post; </p>
<p> "parent_id" is the index of another post the current post retweeted from, would be zero if the current line indicates a source post; </p>
<p> "parent_user_id" is the user who published the 'parent post', would be zero if the current line indicates a source post. </p>
<p> Notice: we remove all content information due to privacy issues. </i> </p>
<p></P>
<p> <span style="color:#b22222"> Please kindly cite our paper if you would like to use this dataset: </span></p>
<p> Yang Yang, Jie Tang, Cane Wing-Ki Leung, Yizhou Sun, Qicong Chen, Juanzi Li, and Qiang Yang. RAIN: Social Role-Aware Information Diffusion. In AAAI'15. 2015. [<a href="bibtex/10.html">BIB</a>] [<a href="works/roleaware/roleaware.pdf">PDF</a>]</p>
<p><a href="works/roleaware/data/retweet.txt" style="margin-left:-3px; margin-right: 3px;">[Data Download]</a></p>
</div>
</div>
<!--
<footer>
<p style="margin:10px;">Created by <a href="">Yang Yang</a>, using a design from <a href="http://getbootstrap.com/"> bootstrap </a></p>
</footer>
</div> -->
<!-- /container -->
<!-- Bootstrap core JavaScript
================================================== -->
<!-- Placed at the end of the document so the pages load faster -->
<script src="https://ajax.googleapis.com/ajax/libs/jquery/1.11.0/jquery.min.js"></script>
<script src="dist/js/bootstrap.min.js"></script>
</body>
</html>