csv upload feature for academic centers by ankitamk14 · Pull Request #558 · Spoken-tutorial/spoken-website

ankitamk14 · 2025-07-05T05:43:45Z

No description provided.

sunilshetye

added comments. initialise reader and loop only once. build cache if required. do not use bulk query.

sunilshetye · 2025-07-07T10:51:04Z

events/views.py

+            decoded_file = csv_file.read().decode('utf-8')
+            reader = csv.DictReader(io.StringIO(decoded_file))


Suggested change

decoded_file = csv_file.read().decode('utf-8')

reader = csv.DictReader(io.StringIO(decoded_file))

decoded_file = io.TextIOWrapper(csv_file, encoding='utf-8')

reader = csv.DictReader(decoded_file)

This will still read the entire file in memory. Use something like the one suggested.

sunilshetye · 2025-07-07T10:53:58Z

events/views.py

+            row_count = sum(1 for _ in reader)
+            if row_count > MAX_ROWS:
+                messages.error(request, f"CSV has too many rows ({row_count}). Limit is {MAX_ROWS}.")
+                return redirect('events:new_ac')


Remove the row count check here as this is still going to read the entire file.

sunilshetye · 2025-07-07T10:54:58Z

events/views.py

+            messages.error(request, f"CSV columns do not match the expected format. Expected columns: {', '.join(EXPECTED_COLUMNS)}")
+            return redirect('events:new_ac')
+
+        reader = csv.DictReader(io.StringIO(decoded_file))


Don't initialise reader again.

sunilshetye · 2025-07-07T10:57:28Z

events/views.py

+        for idx, row in enumerate(reader, start=2): # Start from 2nd row    
+            _states.add(row.get('state').strip())
+            _universities.add(row.get('university').strip())
+            _institution_types.add(row.get('institution_type').strip())
+            _districts.add(row.get('district').strip())
+            _cities.add(row.get('city').strip())
+
+        # Bulk query
+        states = { s.name.lower(): s for s in State.objects.filter(name__in=_states)}
+        universities = { (u.name.lower(), u.state.name.lower()): u for u in University.objects.filter(name__in=_universities).select_related('state')}
+        districts = { (d.name.lower(), d.state.name.lower()): d for d in District.objects.filter(name__in=_districts).select_related('state')}
+        cities = { (c.name.lower(), c.state.name.lower()): c for c in City.objects.filter(name__in=_cities).select_related('state')}
+        institution_types = { i.name.lower(): i for i in InstituteType.objects.filter(name__in=_institution_types)}


this bulk query is also incorrect. you have to loop only once. you may build the cache in that loop.

sunilshetye · 2025-07-07T10:58:12Z

events/views.py

+                return 1 if val == 'yes' else 0
+            else:
+                return 0
+        reader = csv.DictReader(io.StringIO(decoded_file))


don't initialise the reader again

sunilshetye · 2025-07-07T10:58:38Z

events/views.py

+            else:
+                return 0
+        reader = csv.DictReader(io.StringIO(decoded_file))
+        for idx, row in enumerate(reader, start=2): # Start from 2nd row  


don't specify the start parameter.

ankitamk14 added 2 commits July 5, 2025 11:12

csv upload feature for academic centers

7e15f3c

implemented review feedback

1f7cb3f

sunilshetye requested changes Jul 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

csv upload feature for academic centers#558

csv upload feature for academic centers#558
ankitamk14 wants to merge 2 commits intomasterfrom
batch_upload

ankitamk14 commented Jul 5, 2025

Uh oh!

sunilshetye left a comment

Uh oh!

sunilshetye Jul 7, 2025

Uh oh!

sunilshetye Jul 7, 2025

Uh oh!

sunilshetye Jul 7, 2025

Uh oh!

sunilshetye Jul 7, 2025

Uh oh!

sunilshetye Jul 7, 2025

Uh oh!

sunilshetye Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		decoded_file = csv_file.read().decode('utf-8')
		reader = csv.DictReader(io.StringIO(decoded_file))

Conversation

ankitamk14 commented Jul 5, 2025

Uh oh!

sunilshetye left a comment

Choose a reason for hiding this comment

Uh oh!

sunilshetye Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

sunilshetye Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

sunilshetye Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

sunilshetye Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

sunilshetye Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

sunilshetye Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants