7504cfed2a
Add 'theforce' option to redownload xml
...
Useful for debugging
2013-11-03 13:29:35 +01:00
5737adc253
Improve Cache()
...
Add the ability to create new Cache() objects from the current one (in the same directory). Allow the storage of int() (via str()). Don't store the key in the feed (useless, and not keeping it might make this service a little more less privacy-infrighting).
2013-11-03 13:26:56 +01:00
09ba8ca644
Use HOLD in log()
...
So as not to write to the output when using on an HTTP server, during the writing of headers
2013-11-03 13:24:27 +01:00
97acdd9530
Custom "getopt" as Class
...
Easier to use, an can now take value:
* CLI: morss key=value http://...
* HTTP: http://path/to/morss/:key=value/http://...
2013-11-03 13:18:09 +01:00
53c9b07d19
Split Fill() into Fix and Fill
...
Fix improves links and suck things
2013-10-22 20:55:24 +02:00
71f20c8b29
Don't fill facebook for now
2013-10-21 21:31:02 +02:00
8a5202efd5
Improve facebook and feedburner workarounds
2013-10-21 21:30:31 +02:00
1f40f2a099
Add support for JSON APIs in feedify
2013-10-21 21:28:43 +02:00
1802827d31
Fix decoding
...
Was dropped in previous commit by mistake
2013-10-21 21:24:19 +02:00
a1f9fe24c8
Use one single urllib2 handler
...
Makes feedify on html pages more powerful
2013-10-21 21:17:52 +02:00
76b81a31ec
Drop desc by default
...
Arguably the reason is that users tend to believe the program does nothing, cause Firefox (and others?) only shows the description in the feed preview. The "keep" parameter keeps the description in place.
2013-10-02 12:06:14 +02:00
ae22104831
Center "* * *" in clip mode
2013-10-02 12:05:52 +02:00
145f59b546
Fix already-filled feeds
...
By pulling the content from "content" when needed
2013-10-01 19:50:28 +02:00
ae5e947417
Add support for facebook, google links
2013-10-01 19:49:53 +02:00
cf5040020e
Use desktop browser UA for xml
...
Convenient for rss links detection and "feedify"
2013-10-01 19:47:45 +02:00
e3c1cd8619
Accept more types as "text" before readability
2013-10-01 19:47:06 +02:00
78706952fe
Remove "clip" from Fill
...
Put that in Gather. Also removed from feeds.py. "alone" mode was also added (it removes the description).
2013-10-01 19:45:54 +02:00
1b7fe8fbee
Use "options" in Gather instead of "progress"
...
Also made it possible to toggle Fill's toggle through parameters
2013-09-29 15:32:58 +02:00
a5a327388a
Add ability not to fetch an item's article
2013-09-25 13:47:05 +02:00
0657077191
Add support for twitter
...
Grabs "feed" from the html page, clips tweet and article together.
2013-09-25 12:37:14 +02:00
da14242bcf
Add feedify, and use it in morss
2013-09-25 12:36:21 +02:00
9bc4417be3
More flexible xml caching
...
New includes a 'type' var, to remember what we did out of it (normal, nothing, grabbed xml link, etc). xml/html mimetype are now saved in a dict, for easier editing, and consistency.
2013-09-25 12:32:40 +02:00
edff54a016
Add pushContent in feeds.py
...
Useful for twitter (later) for it's "clip" toggle, which keeps the original desc/content above the article. Makes changing the content, while keeping the original stub in place, easier.
2013-09-25 12:18:22 +02:00
208d70d3db
Use separate var in Fill for final url
...
That way the url can be changed altogether for the article-fetching part, without changing the item link itself. Useful for upcoming twitter feeds.
2013-09-25 11:51:48 +02:00
fd1501a0c0
Check relative url earlier
2013-09-25 11:49:45 +02:00
1e621099e0
Log cache hash in Gather
2013-09-25 11:15:11 +02:00
3d6d7e70b6
Remove useless "as" in error catch
2013-09-25 11:14:22 +02:00
e73cbf56c2
Add 'html' option, usefull to see error on server
2013-09-25 11:13:33 +02:00
03014a8cbf
Typo in UA_HTML var name
2013-09-25 11:11:11 +02:00
4a5cbcfd18
Move httplib in common code
...
Needed for error catch
2013-09-25 11:10:16 +02:00
3fd34ff1a6
decodeHTML works without connection object
2013-09-25 11:08:58 +02:00
658f51e5a9
Support feeds handed out as text/html
...
<http://www.pro-linux.de/rss/index1.xml> and <http://tehrantimes.com/index.php?option=com_ninjarsssyndicator&feed_id=1&format=raw> are on an equal footing…
2013-09-16 00:33:24 +02:00
8eb2f7c249
Added another letter to feedsportal table
2013-09-15 19:38:59 +02:00
23246ca6c1
Save the key in cache file
2013-09-15 19:20:51 +02:00
1b7777c331
Find RSS links within html pages' <head>
...
And cache those links
2013-09-15 19:19:50 +02:00
1bd17f1365
Faster relative link resolution
2013-09-15 19:18:39 +02:00
7575291f8f
Log url in Gather
...
Useful for upcoming commits
2013-09-15 18:53:35 +02:00
532852a408
Use cleaner http error catch
...
One error type was inheriting from another one
2013-09-15 18:52:34 +02:00
43bf021f23
Catch more http exceptions
...
Such as InvalidURL. Subclasses of httplib.HTTPException
2013-09-15 17:19:33 +02:00
9252e75923
Ensure var in parseOptions are defined
...
Caused a bug on morss.it's server
2013-09-15 15:56:08 +02:00
75b51fc2c2
Add ability to bypass ETag support
...
Add the ":force" argument over http to bypass ETag support, which is convenient to debug code
2013-09-15 15:54:42 +02:00
d2de6cf23d
Extra doc for DELAY, for xml cache, & now for etag
2013-09-15 15:45:15 +02:00
89187ab6a6
Log generation time
2013-09-15 15:44:25 +02:00
04840d9843
More flexible parameters can be passed
...
Multiple parameters can now be passed. HTTP "API" has been improved, and url now have to be like "http://<path to morss>/:<param1>:<param2>/<url>". The code handling the parameters parsing is now way cleaner. Debug toggle is now a var, which can be changed with parameters. Also http logging is no longer done into a file, which tended to grow way too fast, while lacking an "error 403 protection", but instead the parameter ":debug" can be passed in the url, and the page will be delivered as "text/plain" with the debug written into it. Therefore some logging had to be moved around, so as not to output anything during http headers definition.
2013-09-15 15:38:03 +02:00
c25aec7107
Only perform <meta> redirects on html pages
2013-09-15 15:33:14 +02:00
3ba74649f6
Test if linked pages are text documents
...
Useful for feeds such as HackerNews
2013-09-10 15:25:55 +02:00
5ebd84ee55
Fix broken feeds.py calls for items count
2013-09-08 15:47:15 +02:00
d3c163fb74
Use ETag for user-side caching
...
Pretty hard-code ETag use. ETag is just a timestamp, and the server checks whether it's recent enough.
2013-08-24 23:43:32 +02:00
e2c3375eb6
Log url earlier
...
Now logging it in both use cases
2013-08-24 23:41:40 +02:00
0c6e28205a
Use seconds for every parameter
2013-08-24 23:40:37 +02:00