Kranzberg’s First Law: Technology is neither good nor bad—nor is it neutral. At the risk of spoiling its Zenlike nature, let me propose an interpretation: a technology isn’t inherently good or bad, but it will have an impact, which is why it’s not neutral. Almost every applied technology has a good side and a bad side. When you think of transportation technologies, do you think of how they enable a delightful vacation or get the family back together during the holidays—or do you think of traffic jams and pollution? Are books a source of wisdom and spirituality or a way to distribute pornography and hate? Do you applaud medical technology for curing plagues or deplore transportation technology for spreading them? Does encrypted e-mail keep honest people safe from criminals or criminals safe from the police? Are plastics durable conveniences or everlasting pollutants? Counterfeiting comes with money, obscene phone calls come with the telephone, spam comes with e-mail, and pornography comes with the Internet. Every law creates an outlaw.
~ Edward Tenner, Future Hype, Technology Good or Bad, Why Things Bite Back (1996) 

Session - Opening

Session - Federal 
Assessing the impacts of changes in Information Technology R&D Ecosystem : Retaining leadership in an increasingly global environment
2010 PCAST report
Cloud computing makes vulnerability a much bigger problem.

Session - Games (Ken Perlin)
Kids like to explore and customize things
Bought kinect for each student
collaborative music making
Ask school students to do prototyping
procedural computational education
games for learning website: http://g4li.org/

Session - Education 
* Video games and learning (Kurt Squire)
Would kids actually play this?  
"Aegis for growing"  Asessment has to be service in making better programming   
Education for any background and integration
psychometric theory
observations -> Model of Cognition -> Interpretive Framework
1. Technical Achievements 2. Records of Decisions 3. Peer Records
Assessment to be social, decide by others (esp. real professionals)
Incentivize assessments
* Foldit (Seth Cooper; Center for Game Science)
john brandsford (learning)
problem solving (give a scenario) >>> Ask NIE
Idea: get more data and prove statistically (requires a lot of students) >>> Refraction (Kongregate)
Optimal pathways from novice to experts
* Tracy Fullerton (USC)
watch other people teach others use the system

Peter Lee
Short Term <-> Long Term  vs. Reactive <-> Open-Ended
* Mission Focused (short term, reactive)
* Sustainable (long term, reactive)
* Blue Sky (long term, open-ended)
* Disruptive (reactive, open-ended)
Session - Big Data
Virtual Clustering in COSMOS / running Dryad / SCOPE
triple replication, composed into streams, 
store layer: extent nodes + cosmos store manager (CSM)
structured streams have data inside the stream, useful for cases where the data is read-only; like Pig
CPU bound not IO bound

Balkanized clusters
trend of discovering galaxies are tied to moore's law
fixed cost of new digital cameras and laptops are providing exponential improvements.
call for better incremental and randomized algorithms 
visualization service and client sent only the visualization representation.
gpu in database work; gpu changes the smart algorithms used in previous algorithmic work
100TB about bottleneck about too hard to move around > must plan to have move it incrementally while generating.
Random IO to SSD
noSQL / DB / column store / SciDB

Session - Knowledge Commodity Computing 
What is an ontology (McGuiness, RPI)
Sustainability in Ontologies
Making tools to help experts build it (using ML)
now building ontologies getting info from twitter, fb, mobile.
base ontologies that are simple.  -> modularity, ease of use
ease of use, tranformation to input data and push to the internal ontology.

Probase (Haixun Wang)
Big data inference for relationship discovery via collocation
consensus, typicality, ambiguity, temporal validity
harvest individual is-a ontologies nodes via simple bayesian analyses
use similarity of instances and attributes to compute synonyms
short text is everywhere: twitter, query logs, anchor text 

Bing (Susan Dumais)
CIKM 10 paper clickthrough
query time for localization. 

Twitris (Amit Sheth)

Academic Search (Xin Zou)
- Crawler, Publisher, BibTex provided > Raw DataBase
- Journal/Conf Info > Venue Integration > Paper Integration > Refined Paper
- Author Integration (Domain Classifier, Org Normalizer, Author Name Cleanup  > Refined Paper-Author
Author Org Name Picker + User Input Data > Final DB
- Store Generator > to K/V pair database 
= Author Integration / Metadata cleanup large challenges
= Ingestion every 48 hours, Weekly (new PDFs), Monthly (Crawling Scan; check old papers now can reference new papers), 3 months milestone
- Academic Search > Public API 
- Alex Szalay - Ingest other databases - ingesting 100 million publications

API Jevin West (U Wash; Citation Analysis)
- Visualization systems
- de Solla PRice (Science, 1965)
- Too much data, filtering out is an important part of this
- Focusing on network affects
- http://www.mapequation.org/mapdemo/index.html
- Finding important papers is the goal
- hierarchical maps
- look over time: alluvial diagram
- expert / classic / hot / serendipitous RS system

Next Steps (Adnan)
- Completeness / Cross Domain Analysis / CrossRef (assessibility)
- Beyond Co-Authorship > Mentorship
- What is influence?
- Extensibility: Cloud + Client, Investing in APIs, Encouraging an Ecosystem

Q&A - 
- Impact beyond citations (no one cites the Web as Tim Berners-Lee) / Usage Patterns / Textbook or curriculum
- Normalization for field, since certain places cite more
- Alex Wade - API
- Data Set projects!
- schema.org


Future of Social - Lili Cheng
- people's groups change over time quite dynamically, so people don't organize them
- the "social network" of your email, embed it within your email.
- outlook social connector released in 2010
- research
- Twigg 
- 500 million updates a day 2010 + 15000 storage requests a second
- montage => (Seattle Traffic, CompanyCrowd) / flipboard-like / huffington post ==> authoring tool for wikipedia-like page
- need to have UIs that inform privacy settings much better than in the future
- don't retroactively deal with private data
- google+ - circles need some method and policy for sharing circles

MSRA (Hon Hsiao Wen)
- EngKoo
- Bing Translator

Tech Transfer (Johnathan Tien)
- 1. motivation 2. mechanism
- structure: senior program management team + engineering team (dedicated)  <= the "&" in R&D
- tech/demofest as a shopping spree for 1 or 2nd tier managers
- buy in from bottom if you want to do top-down tech transfer
- idea > demo source code > components for integration > business and team incubations (EngKoo)
- commitment is the key for tt.  mutual commitment, maintenance
- dict.bing.com.cn, 1M+ daily
* Could be better placed in the context of the literature
* Could be better integrated into the two themes (language learning and cross language communication)
* Cross Project opportunity
* Technical evaluation and with respect to the theme
* Not integrated with respect to the project itself (between Zhao Jun and Min)

* Use Business Development person to do SWOT of market and convince MDA/NRF that funding was correctly utilized
* Patent doesn't make sense from a defensive perspective
Kotaro Minato (Dean, NAIST) <kotaro@is.naist.jp> www.naist.jp
Hideyuki Tokuda (Dean, Keio, Graduate School of Media and Governance) <hxt@sfc.keio.ac.jp> ww.ht.sfc.keio.ac.jp
Yeong-Gil Shin <yshin@cse.snu.ac.kr> vplab.snu.ac.kr
James Won-Ki Hong (Div of IT Convergence Engineering) <jwkhong@postech.ac.kr> dpnm.postech.ac.kr/~jwkhong
Chanik Park <cipark@postech.edu> 
Amy Yuexuan Wang (Institute for Interdisciplinary Info Sci) <amywang@tsinghua.edu.cn> iiis.tsinghua.edu.cn
Wen-Guey Tsang <wgtzeng@cs.nctu.edu.tw> 
Jonathan Tien (Innovation Engineering) <jtien@microsoft.com> 
Troy - VP Sales
Luis -

Bkg: John Lin, Mike Philips (former Speechworks)
Nuance competitor IVR
ScanSoft / now Nuance

speech rec to mobile

open domain (grammar less?) vs closed domain

Vlingo deals with companies / download apps
license to partners => 

3 layers / partner layer and embedded

watson: performed better and faster, 30% latency (domains/environment) 15% WER 
heavily integrated with watson applications 
uk model as baseline (3wks)

foreign names and/or words in sentences - combined, hierarchical models (fielded?).  Contact, address book
theory - language detection (won't work for word spotting), might 
adaptation to accent - per user or per language model (both), voice profile (3 to 5 utterances): parameterized to tie to the phone's user.
intent engines are for whole communities
intentions to add sex, gender other customization
NLP: NER how fast, accuracy?

- schema
anthology identifier
multiple venues

- authors
- volumes
- papers
- pdf icon
news reporting bias christiani
linked data
LeToR for keyphrase extraction
adaptor grammar / CRP for url segmentation / pyp instead of crp for language data
gpml library
Hang Li Sigir 08: CRF with edit distance