CSCI8380: Advanced Topics in Information Systems
Course content:
The main topic of this course is recent advances in web information systems. We will mainly focus on what the Semantic Web is, and what its advocates believe it will eventually be able to do. You will be introduced to many useful Semantic Web languages and tools. In addition, other critical topics such as web services, text mining, web search, and information fusion will be discussed
Subjects will be introduced by the instructor and by the students in their presentations to the class.
Time & Place:
Mon 12:20 - 1:10 GSRC 306, Tue/Thurs 12:30 - 1:45 Geo 155
Prerequisites:
- Java programming skills and database background
- Strong enthusiasm to work on web information systems and next generation web topics
Course Staff:
Instructor: I. Budak Arpinar
Office: Hardman Hall room 225
Phone: (706) 583-8249
Office Hours: Tuesday/Thursday 2:00 - 3:00 pm
and by appointment.
Topics:
Semantic Web:
- Ontology design, development and quality
- Ontology standards: RDF(S) & OWL(2)
- Storing & querying ontologies (SPARQL)
- Information extraction & document annotation
- Semantic association discovery and ranking (SemDis)
Web Services (WS):
- WS description, discovery and composition
- Semantic WS (METEOR-S)
- Transactional and Quality-driven WS
Text Mining:
- Question Answering
- Document Classification
- Document Summarization
New Information Systems Trends:
- Web-based data integration and interoperability
- Personal data management and user profiling
- Web search and document ranking
- Mobile data management (for iPhone, Android etc.)
Textbook:
Strongly advised - still optional:
- Semantic Web Programming by J. Hebeler, M. Fisher, R. Blace, A. Perez-Lopez, and M. Dean, Wiley, 2009.
Optional:
- A Semantic Web Primer by Grigoris Antoniou, and Frank van Harmelen, 2nd Ed., MIT Press, Cambridge, MA, 2008.
- Semantic Web Technologies: Trends and Research in Ontology-based Systems by John Davies (Editor), Rudi Studer (Co-Editor), Paul Warren (Co-Editor), John Wiley & Sons, July 2006.
- Building an Intelligent Web by R. Akerar, and P. Lingras, Jones and Bartlett Publishers, 2008.
- Towards the Semantic Web by J. Davies, D. Fensel, and F. van Harmelen, Wiley 2005.
- Spinning the Semantic Web by D. Fiensel, J. Hendler, H. Lieberman, and W. Wahlster, The MIT Press, 2003.
- Service-Oriented Computing by M. Singh, and M Huhns, Wiley, 2005.
- Semantic Web for the Working Ontologist by D. Allemang, and J. Hendler, Morgan Kauffman, 2008.
- The Text Mining Handbook by R. Feldman, and J. Sanger, Cambridge Univ. Press, 2007.
- Programming the Semantic Web by T. Segaran, C. Evans, and J. Taylor, O.Reilly Media, 2009.
Grading:
The components of the final grade will be distributed as follows (subject to adjustments):
- Term project (groups of 2) (40%)
- Individual project assignments (%20)
- Research paper reviews/presentations (15%)
- Class participation (%5)
- Exam(s) (20%)
Presentations:
- Introduction to Semantic Web
- Introduction to the Semantic Web (tutorial), Ivan Herman, 2009 Semantic Technology Conference in San Jose, CA, USA.
- The Semantic Web, Tim Berners-Lee, James Hendler and Ora Lassila, May 2001 Scientific American Magazine.
- Linked Data, Tim Berners-Lee, TED2009 ìThe Great Unveilingî in Long Beach, California, USA.
- Hands on Semantic Web, Denny Vrandecic, The 1st ACTIVE Summer School 2009.
- Semantic Web Standards (RDF(S) and OWL)
- Ontology Design and Development
- Ontology Development 101: A Guide to Creating Your First Ontology, Natalya F. Noy and Deborah L. McGuinness (paper, presentation).
- Semantic Web Programming by J. Hebeler: Chapter 2 on Ontology Design, Storage and Alignment.
- Storing and Querying Ontologies (SPARQL)
- SPARQL by Example.
- SPARQLeR: Extended Sparql for Semantic Association Discovery (paper, presentation).
- Jena: A Semantic Web Framework for Java.
- Executing SPARQL Queries over the Web of Linked Data, O. Hartig, C. Bizer, J.-C. Freytag, in ISWC 2009.
- Information Extraction and Document annotation
- Ontology-Driven Automatic Entity Disambiguation in Unstructured Text (paper, presentation).
- SOFIE: A Self-Organizing Framework for Information Extraction, Fabian M. Suchanek, Mauro Sozio, and Gerhard Weikum, WWW 2009, April 20ñ24, 2009, Madrid, Spain.
- A Framework for Schema-Driven Relationship Discovery from Unstructured Text, Ting Wang, Yaoyong Li, Kalina Bontcheva, Hamish Cunningham, and Ji Wang, ESWC 2006, LNCS 4011, pp. 215ñ229, 2006.
- Automatic Extraction of Hierarchical Relations from Text.
- A Framework for Schema-Driven Relationship Discovery from Unstructured Text, ISWC 2006 (paper, presentation).
- Semantic Association Discovery and Ranking (SemDis)
- Semantic Analytics on Social Networks: Experiences in Addressing the Problem of Conflict of Interest Detection (paper, presentation).
- Ranking Complex Relationships on the Semantic Web (paper, presentation).
- NAGA: Searching and Ranking Knowledge.
- Database and information-retrieval methods for knowledge discovery. In CACM 2009.
- Text Mining
- Frontiers of biomedical textmining: current progress.
- GATE: General Architecture for Text Engineering
- Question Answering
- Web Search and Document ranking
- Ranking Documents based on Relevance of Semantic Relationships (paper, presentation).
- Mobile Data Management
- Siri: A Virtual Personal Assistant.
- Collaborative Creation of Semantic POI, Semantic Web Challenge 2009.
- Deciphering Mobile Search Patterns: A Study of Yahoo! Mobile Search Queries, WWW 2008.
- Computers and iPhones and Mobile Phones, oh my!, WWW 2009.
- Structured Data on Semantic Web
- D2R Server: Publishing Relational Databases on the Semantic Web
- Triplify – Light-Weight Linked Data Publication from Relational Databases, WWW2009.
- Web 2.0 and Social Web
- Can Social Bookmarking Improve Web Search?
- From Web 2.0 to Semantic Web: A Semi-Automated Approach (video).
- Live Social Semantics, ISWC 2009 (presentation).
- The Live Social Semantics application: a platform for integrating face-to-face presence with on-line social networking, In: First International Workshop on Communication, Collaboration and Social Networking in Pervasive Computing Environments (PerCol 2010) (video).
- Web Science
- Web Science Emerges, Nigel Shadbolt and Tim Berners-Lee Scientific American, October 2008
- Web Science: An interdisciplinary approach to understanding the World Wide Web, James Hendler, Nigel Shadbolt, Wendy Hall, Tim Berners-Lee, and Danny Weitzner Communications of the ACM (cover story), July 2008
- Foundations and Trends in Web Science: Volume 1, Issue 1 A Framework for Web Science (PDF available for download) By Tim-Berners Lee (MIT), Wendy Hall (University of Southampton), James A. Hendler (Rensselaer Polytechnic University), Kieron O'Hara (University of Southampton), Nigel Shadbolt (University of Southampton) and Daniel J. Weitzner (MIT)
- Creating a Science of the Web, Tim Berners-Lee, Wendy Hall, James Hendler, Nigel Shadbolt, Daniel J. Weitzner SCIENCE VOL 313 11 AUGUST 2006
- Web Science Research Initiative Seminar November 2008
- Web-based Collective Intelligence
- Harnessing Crowds: Mapping the Genome of Collective Intelligence (presentation).
- Thomas W. Malone, Robert Laubacher, Josh Introne, Mark Klein, Hal Abelson, John Sterman, and Gary Olson, The Climate Collaboratorium: Project Overview, CCI Working Paper No. 2009-003
- Harnessing Collective Intelligence to Address Global Climate Change
- Enrichment and Ranking of the YouTube Tag Space and Integration with the Linked Data Cloud, ISWC2009.
- Mapping the World’s Photos, WWW2009.
Small Projects:
Term Projects:
- Ontology Learning (Khalifeh Al Jadda, Niranjan Chapalgaonkar)
- ProCGO: An Ontology for Comparative Genomics on Protein Kinases (Gurinder Gosal, Charula Vijayagopal)
- Semantic Document Ranking (Asmita Rahman, Shasha Liu)
- Context Sensitive Search on Mobile Devices (Frank Wang, Ankur Oberai)
- Relationship Discovery Among People (Johnny Chu, Kitae Myoung)
- Map Based News Events Aggregator (Haseeb Yousaf, Hari Devulapally)
- Ontology Visualization (Chaitanya Guttula, Singaram Sundar)
Student Web Pages:
- Al Jadda, Khalifeh
- Chapalgaonkar, Niranjan
- Chu, Jianan
- Devulapally, Hari
- Gosal, Gurinder
- Guttula, Chaitanya
- Liu, Shasha
- Myoung, Kitae
- Oberai, Ankur
- Rahman, Asmita Abdul
- Sundar, Singaram
- Vijayagopal, Charula
- Wang, Xin
- Yousaf, Haseeb
Late Policy:
Programming work will be submitted electronically, and must be submitted by midnight on the date that it is due. Programming work submitted after the deadline but less than 24 hours late will be accepted but penalized 10%, programming work submitted more than 24 hours but less than 48 hours late will be penalized 20%, and programming work submitted more than 48 hours but less than 72 hours late will be penalized 30%. No programming work will be accepted more than 72 hours late. THIS LATE POLICY WILL BE STRICTLY ENFORCED.
Regrade Policy:
Please send an email message to TA stating that you believe you deserve a project part regrade, and explaining precisely why. Please include your name, and the number of the project part in question.
Programming Work:
Eeach team is expected to submit their own original work. On many occasions it is useful to ask others (the instructor, the TA's, or other students) for hints or debugging help, or to talk generally about programming strategies. Such activity is both acceptable and encouraged, but you must indicate any assistance (human or otherwise - except liserv discussions/TA/instructor help) that you received. In any event, you are responsible for coding, understanding, and being able to explain on your own all project work that you submit. We will pursue aggressively all suspected cases of Honor Code violations, and they will be handled through official University channels. If you have any questions about this policy or about the degree to which we will pursue Honor Code violations, please discuss your concerns with the course staff immediately.
(c) 2023 Ismailcem Budak Arpinar