Technical side of Watson, David Boloker

Get Started. It's Free
or sign up with your email address
Rocket clouds
Technical side of Watson, David Boloker by Mind Map: Technical side of Watson, David Boloker

1. CTO, Emerging Technologies, IBM

2. Motivation

2.1. Offload more of the decision maker tasks to the engine

3. Strategy

3.1. read lots of texts

3.2. analyze subject-verb-object

3.3. build semantic network

3.4. find patterns

3.4.1. officials submit resignations (0.8)

4. Keywords match isn't enough

4.1. need

4.1.1. temporal reasoning

4.1.2. statistical paraphrasing

4.1.3. geospatial reasoning

5. Arch

5.1. decompose the question

5.2. run many many searches

5.2.1. our data

5.2.1.1. 2 FTS engines

5.2.1.1.1. Lucene

5.2.1.1.2. Indri

5.3. generate hypothesis

5.4. evidence sourcing

5.4.1. get confidence on hypothesis

5.5. deep evidence scoring

5.6. sythesis

5.7. apply machine learning model to get final confidence

5.7.1. learning from its mistakes

5.7.2. training data is the archive of all historic jeopardy games

5.8. output answer & confidence

6. Had iterations in architecture, each time improving accuracy

6.1. till they got 87%

7. HW

7.1. 2890 cores

7.1.1. very parallel architecture

7.2. 90 severs

7.3. 16TB RAM

7.4. 20TB disk

8. Challenges

8.1. Real language is full of slang, & metaphores

8.1.1. e.g., pun in question

9. History

9.1. PIQUANT

9.2. OpenEuphyra

10. uses UIMA for SOA

10.1. 100 annotators

10.1.1. in

10.1.1.1. Java

10.1.1.2. Prolog

10.2. reveal different kind of features

11. arch high-level

11.1. learn

11.1.1. ingest many sources

11.1.1.1. wikipedia

11.1.1.2. yago2

11.1.1.3. dbpedia

11.1.1.4. wordnet many more

11.1.2. store all in RDF store

11.2. Question analysis

11.3. Primary search

11.4. Shallow & deep scoring

11.5. Merging & ranking

12. Demo

12.1. Audience question

12.1.1. Country that has the largest solar dish?

12.1.1.1. Israel 24%

12.1.1.2. Untitled

12.1.1.3. Negev <10%

12.2. Demo machine uses only 7 servers

12.2.1. takes about 20 seconds

13. About