csH446, Fall 2024

csH446, search engines honors section
James Allan
Fall 2024

This page is for the honors seminar associated with COMPSCI 446. Please see that page for details about the associated course.

In this one-credit honors colloquium for COMPSCI 446, students will explore and discuss topics from the 446 curriculum in greater detail, with an intended focus on contemporary issues related to search engines -- for example, large language models, fairness, and/or explainability. Students will also collaboratively design programming project that builds on the programming project from 446.

All students will produce a final report and may implement the expanded programming project to replace part of that report.

The seminar is under development, but will depend upon the course textbook (free) and also these articles, all of which are available freely online, though some may require that you download them from on campus:

OK. Right now there is only one. Additional readings from open-source and freely available material will be added later.

Meeting times

The honors seminar is tentatively slated to meet Wednesdays from 2:30-3:20pm in a room that is not yet decided.

Grading

Your grade in this class will be based upon completing the readings as demonstrated by class participation in discussions (20%) and your final report (80%). The seminar cannot be taken pass/fail.

Eligibility

This class is open to Commonwealth Honors College junior and senior Computer Science majors only. Students must be concurrently enrolled in COMPSCI 446 or have taken it previously and received a B or better.

If you wish to take this seminar but do not satisfy those requirements (per SPIRE), you can fill out an override form to request consideration.

Collaboration and help

You may discuss the ideas behind assignments with others. You may ask others for help understanding class and search engine concepts. You may study with friends. However...

The work that you do and submit must be your own. It may not be copied from the web, from another student in the class, or from anyone else. If you use any ideas that are not your own and that are not clearly well-known, you must cite the source of the idea, even if it is another person. That is true whether you insert a quotation into your writing or just use the ideas you found. If in doubt, cite. If really in doubt, cite and also ask. Failure to cite in any of these cases is plagiarism and grounds for academic discipline.

LLMs and ChatGPT and friends. What is the implication for using large language models and so on? You have to cite your sources, so you need to cite the LLM/GPT system you used and how the resulting answer is appropriate (as with any source), including the prompt you used to find the information.

Although you may find them helpful to pull together some ideas or get you started, you are strongly discouraged to use LLMs/GPTs to write any of your actual final report. Why? It is already well known that those models hallucinate very reasonable sounding material, so you could end up with garbage. And a bad grade. It is also well known that they were trained on existing work and occasionally just spit it out verbatim, so you could end up "accidentally" plagiarizing. And, you recall, plagiarism is grounds for academic discipline, even if it was LLM/GPT that plagiarized "for" you without your knowledge.