Understanding Code Repos of Implemented Papers

tahir0khalil · January 29, 2024, 4:51pm

Hello all,
I am facing a lot of trouble when I look at repos of implemented papers. When we take a course we mostly work with simple notebooks but when papers are implemented they use a very complex structuring system which I am finding almost impossible to decipher.
My question is how do I get good with this task? Are there any resources which teach how to tackle this issue?
Any help here will be greatly appreciated.
Thanks.

paulinpaloalto · February 8, 2024, 5:00am

It’s a good question, but unfortunately I don’t think there is any easy answer. If it’s any comfort, you are definitely not alone in this. The problem is that academic papers are written with a target audience of other experts in the field. They typically go through a review process, but the reviewers are also domain experts. The reviewers are looking for correctness and completeness of evidence, but are most likely not using accessibility of the explanations to non-experts as one of the review criteria. It’s unfortunate, but you could look at this as being analogous to text books in a field: if you pick up say a math text book for a third year undergraduate course, it assumes you have taken the first and second year courses, so it may well not be understandable to someone who has not taken those prerequisites.

You can get lucky in some cases and the authors are more generous in terms of making more of an effort to make their work accessible to a wider audience. Also in some cases the authors will release their source code on GitHub showing an implementation of their ideas. Of course with source code you also have levels of accessibility: if you’re a software engineer writing code that you know needs to be maintained in a production environment for years, you hold yourself to a higher standard for clarity and maintainability of code than if it’s just a “throwaway” effort to demonstrate some idea. Meaning that just because there is source code doesn’t mean it will be easy to understand.

The best advice I can think of is just not to get discouraged and keep trying. Keep upgrading your knowledge by taking more courses. One positive idea is that the courses here frequently give references to papers. I’m most familiar with DLS and for example in Course 4 about Convolutional Networks, Prof Ng gives references to the papers behind the ideas that he is presenting. I also recall a few cases in which he is specific about encouraging us to look at the paper because it’s more accessible than most. It would be worth checking the references in DLS C4. If you can find some papers there which are more accessible, that might give you a way to get more comfortable in the discipline of reading and understanding papers.

Topic		Replies	Views
How can I implement ML research on code? AI Discussions	1	61	December 16, 2022
Reading Research papers AI Discussions	3	175	May 17, 2021
This is a course for developers -- where is the GitHub repo? GenAI4SD Resources coursera-platform	9	162	October 16, 2024
Lab notebooks available to audit Machine Learning Specialization	3	74	July 25, 2022
Another Deep Learning Resources Focusing on Practical Aspect AI Discussions ai-discussions	1	288	February 23, 2024

Understanding Code Repos of Implemented Papers

Related topics