Y.C. Tay/Social Network Datasets

Vision: Replacing TPC benchmarks for database systems. The biggest challenge in this vision lies in syntactically scaling empirical social network datasets: UpSizeR -- synthetically scaling an empirical relational database. This challenge requires a model for social network data, and graphs are the obvious candidate, but: sonSchema -- a social network is not a graph! A graph is a static, syntactic model that does not capture the dynamics and semantics of a social network; this is evident from: sonLP -- social network link prediction by principal component regression. Our ambition is to build a sonSchema-based open-source system that replaces MySQL as the default database management system for social network data: sonSQL -- an extensible relational DBMS for social network start-ups.