pictuga
b66ac2bc5e
Make it possible not to use caching
2014-05-24 19:13:41 +02:00
pictuga
25fdca4bf0
Add do-it-all function
...
For quick lib use
2014-05-24 19:02:22 +02:00
pictuga
26c91070f5
Time-based Cache
...
Solves the :proxy issue for good. More convenient, more flexible
2014-05-24 19:01:21 +02:00
pictuga
5e64696031
Fix '/morss.py/' url fixer
2014-05-22 22:53:36 +02:00
pictuga
364fbc4ba6
Remove apparent limit
...
Cause no longer works, cause of all-bool args introduced earlier
2014-05-22 22:52:49 +02:00
pictuga
b03d865b7b
Get rid of ParseOptions()
...
That thing wasn't nice, and depended too much on the various use case. The new approach is to turn morss into a library and turn the use cases into some pre-implemented lib usages
2014-05-22 22:44:59 +02:00
pictuga
3c48c58127
Remove useless HOLD var
...
Was needed in DEBUG at some point
2014-05-21 12:19:49 +02:00
pictuga
e8e7f170a6
Include super dumb http file server
...
For index.html, other files can be added, but everything has to be hard-coded (mimetype included)
2014-05-18 12:34:23 +02:00
pictuga
c41a1fe226
Support for wikipedia fetured articles feed
...
Should work with most wikipedias
2014-05-18 12:17:14 +02:00
pictuga
d8a3c4e9af
Add support for Google News
2014-05-18 11:58:45 +02:00
pictuga
bbf1ffbb15
Remove 'persistent' and 'dic' arg in Cache
...
'dic' was mostly intended for facebook now-bygone advanced buggy token storage. 'persistent' was needed by fb and 'proxy' mode, but a small workaround was found for the proxy mode (basically making sure the cache object is always at least 5-item long)
2014-05-15 00:54:40 +02:00
pictuga
76e7f1ea00
Try to use more generic 302/303 redirections
...
Still far from being great, but at least I can use it on both morss.it and test.morss.it now
2014-05-14 15:05:14 +02:00
pictuga
031b67a8db
Remove some useless options
...
progress and a accidentaly-disclosed one, cause useless
2014-05-14 15:03:40 +02:00
pictuga
974bad7974
Fix and strip down facebook
...
Remove unstable non-working facebook semi-automatic token renewal (a simple warning on morss.it should be enough). Also commited some forgotten stuff.
2014-05-14 15:01:41 +02:00
pictuga
b7136f2056
Pull iTunes raw feed out of iTunes url
...
This iTunes thinggy somehow qualifies as yet-another-apple-tech-rape: just some old tech behind iron curtains…
2014-05-12 23:15:51 +02:00
pictuga
d8074d6b6d
Redirect google translate links to original link
...
Cause anyway Google Translate isn't scrappable. So it's better to have at least some content.
2014-03-22 20:53:33 +01:00
pictuga
a4cf5e0daa
Google link cleaner now works on all .dot versions
2014-03-22 20:52:25 +01:00
pictuga
c94ef92131
Fix Facebook support
...
Now token is grabbed directly by the server, and sent back by means of a cookie. This does unify token "creation" and renewal.
2014-02-21 14:36:06 +01:00
pictuga
a1f5c3db3a
Have .csv files be downloaded
...
So that users can open it in LibreOffice/OpenOffice/Word without having to save it to disk beforehands
2014-02-05 00:37:12 +01:00
pictuga
6c33bb6e1c
Safer Cache saving
...
Create tmp file and then move it to destination. Avoids corrupt files during write
2014-01-29 20:36:45 +01:00
pictuga
6eaec96af7
Keep "dic" param in Cache.new
2014-01-22 15:56:08 +01:00
pictuga
4e549dc88a
Change lim/max settings only for current "run"
2014-01-19 23:36:41 +01:00
pictuga
0f7bc568e4
Send CGI HTTP headers earlier
...
So that browsers show that sth is going on
2014-01-15 21:02:47 +01:00
pictuga
4d6ef92504
Separate function for output. Add csv
2014-01-13 00:10:57 +01:00
pictuga
7fbe728f93
Feeds: allow json, csv export
...
Uses OrderedDict
2014-01-13 00:08:03 +01:00
pictuga
ec55f5e856
Use smarter order for RSS.dict
2014-01-13 00:07:04 +01:00
pictuga
3d78cfb638
Fix HTTP bug when returning empty page
2014-01-11 18:21:37 +01:00
pictuga
840b0b1ded
Remove yet another silly log message
2014-01-11 18:18:02 +01:00
pictuga
8209f243bb
Fix rss-redirection code
...
And add log, which was lost when splitting functions (which made this fix needed)
2014-01-11 18:15:36 +01:00
pictuga
3b3ac4c8a6
Remove batch of useless imports
2014-01-11 17:31:27 +01:00
pictuga
5feb061bf7
First attempt at decent folder structure
...
Use setup.py, subfolder for code.
2014-01-11 17:11:57 +01:00
pictuga
851dacdfbc
Renamed to .py.
2013-04-04 18:17:12 +02:00
pictuga
6783bbf992
Improved shebang.
2013-04-04 17:56:37 +02:00
pictuga
82084c2c75
Move to OOP.
...
This is a huge commit. The whole code is ported to Object-Oritented Programming. This makes the code cleaner, which became required to deal with all the different cases, for example with encoding detection. Encoding detection now works better, and uses 3 different methods. HTML pages with an xml declaration are now supported. Feed urls with parameters (eg. "index.php?option=par") are also supported. Cache is now smarter, since it no longer grows indefinitely, since only in-use pages are kept in the cache. Caching is now mandatory. urllib (not urllib2) is no longer needed. Solved a possible crash with log function (when passing list of str with non-unicode encoging).
README is also updated.
2013-04-04 17:43:30 +02:00
pictuga
05b5bc7783
Catch extra errors (timeout).
2013-03-29 20:06:31 +01:00
pictuga
6f6c5fbaad
Faster xml cleaning
2013-03-01 14:26:51 +01:00
pictuga
e305f387ab
Hopefully fixed encoding issues
...
with the dirtiest trick out there...
2013-02-27 15:12:32 +01:00
pictuga
ed8a45875c
Default to "//h1/.." since most website use it
...
because it is said to be good for SEO. Debug now requires env variable "DEBUG" to be set to something else than "".
2013-02-25 21:36:02 +01:00
pictuga
d39604c453
Support for cookies added
...
NYT needs them
2013-02-25 20:53:59 +01:00
pictuga
d6179a734f
Clearer debug info
2013-02-25 20:53:22 +01:00
pictuga
eb63ce3f4f
Handle more errors
2013-02-25 18:32:23 +01:00
pictuga
b63f91a151
Added cache, easier debug
2013-02-25 18:01:59 +01:00
pictuga
51fe6ce81b
First commit
2013-02-25 15:50:32 +01:00