|
a1dc96cb50
|
feeds: remove mimetype from function call as no longer used
|
2020-04-28 22:07:25 +02:00 |
|
|
818cdaaa9b
|
Make it possible to call sub-libs in non interactive mode
Run `python -m morss.feeds http://lemonde.fr` and so on
|
2020-04-27 18:00:14 +02:00 |
|
|
2806c64326
|
Make it possible to directly run sub-libs (feeds, crawler, readabilite)
Run `python -im morss.feeds http://website.sample/rss.xml` and so on
|
2020-04-27 17:19:31 +02:00 |
|
|
59ef5af9e2
|
feeds: fix bug when deleting attr in html
|
2020-04-24 22:12:05 +02:00 |
|
|
325a373e3e
|
feeds: add SyntaxError catch
|
2020-04-20 16:15:15 +02:00 |
|
|
4ce3c7cb32
|
Small code clean ups
|
2020-04-19 12:50:05 +02:00 |
|
|
7375adce33
|
sheet.xsl: fix & improve
|
2020-04-15 23:34:28 +02:00 |
|
|
fe82b19c91
|
Merge .xsl & html template
Turns out they somehow serve a similar purpose
|
2020-04-15 22:30:45 +02:00 |
|
|
8e5e8d24a4
|
Timezone fixes
|
2020-04-10 20:33:59 +02:00 |
|
|
9e7b9d95ee
|
feeds: properly use html template
|
2020-04-09 20:00:51 +02:00 |
|
|
987a719c4e
|
feeds: try all parsers regardless of contenttype
Turns out some websites send the wrong contenttype (json for html, html for xml, etc.)
|
2020-04-09 19:17:51 +02:00 |
|
|
3c7f512583
|
feeds: handle several errors
|
2020-04-09 19:09:10 +02:00 |
|
|
b0f80c6d3c
|
morss: fix csv output encoding
|
2020-04-09 19:05:50 +02:00 |
|
|
f3d1f92b39
|
Detect encoding everytime
|
2020-04-07 10:38:36 +02:00 |
|
|
a09831415f
|
feeds: fix bug when mimetype matches nothing
|
2020-04-06 18:53:07 +02:00 |
|
|
aad2398e69
|
feeds: turns out lxml.etree doesn't have drop_tag
|
2020-04-05 21:50:38 +02:00 |
|
|
568e7d7dd2
|
feeds: make BS's output bytes for lxml's sake
|
2020-04-05 20:46:04 +02:00 |
|
|
40c69f17d2
|
feeds: parse html with BS
More robust & to make it consistent with :getpage
|
2020-04-05 16:12:41 +02:00 |
|
|
4d785820d9
|
feeds: ignore provided stylesheets and add ours
Provided sheets usually create errors. Ours is (hopefully) more informative for users not familiar with RSS feeds
|
2020-03-20 15:32:44 +01:00 |
|
|
6a01fc439e
|
feeds: better handle "empty" datetime
|
2020-03-20 12:30:42 +01:00 |
|
|
dd2651061f
|
feeds & morss: clean up comments/empty lines
|
2020-03-20 12:25:48 +01:00 |
|
|
912c323c40
|
feeds: make function output more consistent
e.g. setters return nothing, getters return sth relevant or None (i.e. no empty strings)
|
2020-03-20 12:23:15 +01:00 |
|
|
5705a0be17
|
feeds: fix delete/rmv code
|
2020-03-20 12:22:07 +01:00 |
|
|
4735ffba45
|
feeds: fix .convert auto-convert
To fix inheritance loophole
|
2020-03-20 12:20:41 +01:00 |
|
|
08e39f5631
|
feeds: give simpler name to helper functions
|
2020-03-20 12:20:15 +01:00 |
|
|
765a43511e
|
feeds: remove unused import
|
2020-03-20 12:19:08 +01:00 |
|
|
5865af64f9
|
Fix indent output for html/xml
|
2020-03-20 12:18:13 +01:00 |
|
|
d12d44a500
|
Remove unused hash-bangs
Leftovers from debugging
|
2020-03-19 15:06:28 +01:00 |
|
|
ee8c57c1fc
|
feeds: avoid convert to self
|
2020-03-19 12:54:04 +01:00 |
|
|
bda51b0fc7
|
feeds & morss: many encoding/tostring fixes
|
2020-03-19 12:53:25 +01:00 |
|
|
c09b457168
|
feeds: fix .dic code
Bug introduced recently…
|
2020-03-19 11:36:20 +01:00 |
|
|
b47e40246c
|
feeds: clean up html code handling
|
2020-03-19 11:35:51 +01:00 |
|
|
9cf933723f
|
feeds: clean up time handling
Includes a shameful fix on @property
|
2020-03-19 11:35:02 +01:00 |
|
|
f48961a7e4
|
feeds: small code cleanups
|
2020-03-19 10:13:22 +01:00 |
|
|
d3fe51cea5
|
feeds: remove duplicated code
|
2020-03-19 09:49:52 +01:00 |
|
|
449bc3c695
|
feeds: fix handling of html code
|
2020-03-19 09:48:53 +01:00 |
|
|
13ea52ef80
|
feeds: add .torss()
|
2020-03-19 09:47:58 +01:00 |
|
|
aa2b56b266
|
feeds: small code cleanups
|
2020-03-19 09:47:17 +01:00 |
|
|
4a70aa9dfa
|
feeds: auto-parse()
|
2020-03-18 16:34:40 +01:00 |
|
|
c2f85da94a
|
feeds: add html support, adapt .tohtml()
|
2020-03-18 16:33:10 +01:00 |
|
|
e3528a8f36
|
feeds: use FeddJSON for .tojson()
Clean up related code
|
2020-03-18 16:31:36 +01:00 |
|
|
2dd9ae202d
|
feeds: add 'mode' to Parsers
|
2020-03-18 16:24:08 +01:00 |
|
|
8e3d32f24c
|
feds: rename 'ruleset' into 'default_ruleset'
Better reflects its use
|
2020-03-18 16:22:03 +01:00 |
|
|
6ce616106b
|
feeds: disable 'multi' ruleset
RSS ruleset has been cut into 4 rulesets
|
2020-03-18 16:20:42 +01:00 |
|
|
e9d46cb6a9
|
feeds: move mimetypes into .py from .ini
|
2020-03-18 16:08:42 +01:00 |
|
|
e460fdc8f4
|
feeds: remove little-used properties
id, is_permalink
Also a good opportunity to trash the unreliable bool-parsing code
|
2020-03-17 16:46:42 +01:00 |
|
|
1e714ab34b
|
feeds: add ability to convert to another typo of feed
|
2020-03-17 14:02:24 +01:00 |
|
|
4ba4d73ce6
|
feeds: add json support
|
2020-03-17 14:02:01 +01:00 |
|
|
6d5aa8c222
|
feeds: clean up .append()
|
2020-03-17 13:59:51 +01:00 |
|
|
f10727f94a
|
feeds: small code cleanup's
|
2020-03-17 12:26:34 +01:00 |
|