-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Better college attendance? #9
Comments
+1. |
I like the idea. |
Nope nope. We integrate a page where it does a weekly pull on it's own. Automated end to end. |
That would be very cool indeed. Let's do this |
Awesome. A new branch then? @libhide how about it? |
Branch from the level-up branch please |
@theSage21 I'm so in! How are we going about this then? |
Let's set up a todo list of sorts. Here's my proposal @libhide @deadbeatfour. Perhaps @utk-dev would want to join in? I don't know anyone else from college other than you guys who are also on Github.
|
I remember doing the data extraction for my analysis of college attendance. I can take that up. Could you guys make up a list of what we want to show? @libhide @deadbeatfour |
I would suggest if you want data for testing pull some data right now of
the college data because it's right now proper after all ECAs and medical ,
they probably will remove it soon . And that data would help in crucial
testing
…On Thu, 13 Jul 2017 at 6:50 PM, arjoonn sharma ***@***.***> wrote:
I remember doing the data extraction for my analysis of college
attendance. I can take that up. Could you guys make up a list of what we
want to show? @libhide <https://github.com/libhide> @deadbeatfour
<https://github.com/deadbeatfour>
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#9 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AILKQqoyUdmII9JNo5DgwfzaYVl93I-Sks5sNhmbgaJpZM4OT8Zi>
.
|
@anshulabraham done. Available at https://github.com/theSage21/notebooks/tree/master/support. The SSCattendance notebook was used to download this. Actually, I'm thinking of keeping a history of attendance over years. That should lead to some interesting analysis too. |
@anshulabraham |
Of course they did 😝. |
Ha.... Ha.... Ha.... Damn. 😢 See... humaara ped kitna bada ho gaya hai bro. |
Anyone working on this feature? I remember @deadbeatfour created a new repo related to this. |
Doubt this needs to be worked on now that the attendance stuff is being done by Koush. We can close this issue, I guess. |
Yeah. I am. I'm doing things in a separate repo first. Integration into the site shouldn't be a problem. The problem with Koush's implementation is VERY slow fetch times, especially for total semester attendance. It pretty much crashes towards the end of the semester. I assume he's calculating totals every time it's requested. |
Oh, of course Koush's approach is shit. sigh. |
Final purpose is to have people use this right? That means they should be able to quickly find their own attendance. Hashing the name seems ok in the front. Plus, with SQL as the workhorse aggregating requests per call should be fast. Maybe set it up as a verrrryyyy flat table?
This lets us obtain anyone's attendance with a quick filter->select->aggregate query. Sprinkle some nice JS chart rendering and projections on top of that and voila! Or maybe I'm just getting carried away.. 😄 The real problem would be when to ping koush's service for updates? A nightly cron job should work just ok with pythonanywhere unless we're planning to move to some other service. |
The idea is to get as fine grained a set of information as possible since when you have aggregated info it's very difficult to get back the fine grained info. |
I was thinking along similar lines. So the structure of the table is what I'm stuck at. 1st and 3rd years have 4 papers per semester. 2nd years have 5. The best we can do is get lecture/labs/tuts attended and held numbers for each paper. 6 columns per paper. And then tutorial groups are separated out by letter like A, B, C and so on. So individual attendance analysis shouldn't be too difficult. I'm thinking about aggregate analysis, of we want to do that. Per paper. Also arts papers don't have labs so empty column there. Stuff like that. So should we just make a table like Subject 1 name | LD | LA | PH | PA | TH | TA | Subject 2 name | ... Can this be done more elegantly? |
Also, if we want to move to heroku, I think it's a good idea to port the website to the latest libraries? Django 2.0 is out. So I'm thinking in January, we'll rewrite the entire site from the ground up. Basically make sure a feature works before moving onto the next one? |
That sounds right. When recording datasets we try to make them as flat as possible so that later analysis is not hindered. Slight change to the structure then.
We enforce uniqueness for the To facilitate search we can have the name column in this table point to a different table containing people's names/courses/food habits etc. |
The more I use heroku the more I like it over pythonanywhere. PyAnywhere's appeal now to me is the ability to share a terminal over network. For web serving we could pick up heroku as it takes away a lotttt of pain regarding balancing and hosting. |
@theSage21 how about automating the process of deployment? Right now we have to do it manually everytime master branch gets something new. |
With heroku you can set up deploy hooks. No issue there. Pythonanywhere we can set up a cron job which pulls at midnight everyday from the master. |
Maybe we could also set up Travis CI with some tests. Deployment on heroku is just so much more convenient. Also, I was thinking we could set a scheduled job using python itself? Something like celery with RabbitMQ? Thoughts? @theSage21 |
Scheduled job for deployment? I think heroku auto deploy would be a better way for that. Clean and more explicit. |
I meant let's use celery for pulling the data nightly. Heroku auto deploys got me hooked. |
Ah accha. That sounds good. |
I've lost touch with Django to be honest. You guys keeping up with it? I usually use bottle now. |
I used bottle for a project recently and it's just so easy. Haven't written django code in months. But I'll dive back in soon. |
Auto-deploy should not be an issue with both PyAnywhere and Heroku. That said, Heroku is definitely the way to go. Migrate ASAP. |
It would be cool to pull data from the college site and perform attendance analysis on it for public display on the Compsoc site.
The text was updated successfully, but these errors were encountered: