summary |
shortlog | log |
commit |
commitdiff |
tree
first ⋅ prev ⋅ next
Christian Weiske [Fri, 11 Nov 2016 19:54:12 +0000 (20:54 +0100)]
status page
Christian Weiske [Thu, 10 Nov 2016 19:52:35 +0000 (20:52 +0100)]
add log class
Christian Weiske [Thu, 10 Nov 2016 14:22:05 +0000 (15:22 +0100)]
pager: move next and prev links to the outside for easier clicking
Christian Weiske [Thu, 10 Nov 2016 14:13:51 +0000 (15:13 +0100)]
add command to shut down a worker
Christian Weiske [Wed, 9 Nov 2016 20:46:05 +0000 (21:46 +0100)]
properly handle noindex pages
Christian Weiske [Mon, 7 Nov 2016 20:41:36 +0000 (21:41 +0100)]
Big patch merging crawling+indexing into one command, new json document structure
Christian Weiske [Sun, 6 Nov 2016 16:16:15 +0000 (17:16 +0100)]
setup: check json before dropping current index
Christian Weiske [Fri, 2 Sep 2016 16:05:00 +0000 (18:05 +0200)]
Make title configurable
Resolves: #11
Christian Weiske [Fri, 2 Sep 2016 16:04:30 +0000 (18:04 +0200)]
Link github
Christian Weiske [Fri, 2 Sep 2016 16:01:58 +0000 (18:01 +0200)]
Support multiple "nick:" terms in search field
Resolves: #17
Christian Weiske [Fri, 2 Sep 2016 15:54:15 +0000 (17:54 +0200)]
performance debug timer
Christian Weiske [Fri, 2 Sep 2016 13:20:17 +0000 (15:20 +0200)]
Fix chat log links
Resolves: #16
Christian Weiske [Fri, 2 Sep 2016 09:01:28 +0000 (11:01 +0200)]
massively improve crawl speed by ditching "exists" queries
Christian Weiske [Thu, 1 Sep 2016 18:36:23 +0000 (20:36 +0200)]
micro optimization for "exists" ES queries
Christian Weiske [Thu, 1 Sep 2016 06:11:44 +0000 (08:11 +0200)]
Make search result hit template configurable, add chat template
Resolves: #9
Christian Weiske [Thu, 1 Sep 2016 05:47:49 +0000 (07:47 +0200)]
Always show text, make text extract size configurable.
Resolves: #8
Christian Weiske [Thu, 1 Sep 2016 05:38:08 +0000 (07:38 +0200)]
remove anchor from source URLs
Christian Weiske [Tue, 30 Aug 2016 19:37:50 +0000 (21:37 +0200)]
tell why crawler stops
Christian Weiske [Tue, 30 Aug 2016 11:35:05 +0000 (13:35 +0200)]
Add crawlBlacklist configuration option
Resolves: #7
Christian Weiske [Tue, 30 Aug 2016 11:10:03 +0000 (13:10 +0200)]
Allow worker instances of multiple projects in parallel
Change "queuePrefix" configuration in each project
Resolves: #5
Christian Weiske [Tue, 30 Aug 2016 11:05:14 +0000 (13:05 +0200)]
Fix notice
Christian Weiske [Tue, 30 Aug 2016 11:03:26 +0000 (13:03 +0200)]
Make phinde-worker configurable; allow queue selection
Resolves #6
Christian Weiske [Tue, 30 Aug 2016 06:13:33 +0000 (08:13 +0200)]
Option to disable linked URL indexing
Resolves: #2
Christian Weiske [Tue, 30 Aug 2016 06:05:00 +0000 (08:05 +0200)]
Add support for modification date queries: "before:", "after:" and "date:"
Resolves: #4
Christian Weiske [Tue, 30 Aug 2016 05:36:34 +0000 (07:36 +0200)]
Support "nick:cweiske" search syntax as alias for "author.name"
Resolves: #3
Christian Weiske [Mon, 29 Aug 2016 20:59:16 +0000 (22:59 +0200)]
Respect <meta name="robots" content="noindex"/>
Fixes: #1
Christian Weiske [Mon, 29 Aug 2016 18:30:45 +0000 (20:30 +0200)]
Send If-Modified-Since header on crawling and indexing
Christian Weiske [Thu, 26 May 2016 13:20:23 +0000 (15:20 +0200)]
add LICENSE file
Christian Weiske [Thu, 31 Mar 2016 18:46:01 +0000 (20:46 +0200)]
wip pubsubhubbub
Christian Weiske [Fri, 12 Feb 2016 16:04:42 +0000 (17:04 +0100)]
opensearch paging
Christian Weiske [Fri, 12 Feb 2016 06:43:25 +0000 (07:43 +0100)]
trim query string
Christian Weiske [Thu, 11 Feb 2016 21:43:34 +0000 (22:43 +0100)]
opensearch support
Christian Weiske [Thu, 11 Feb 2016 19:02:30 +0000 (20:02 +0100)]
support base href
Christian Weiske [Thu, 11 Feb 2016 16:37:12 +0000 (17:37 +0100)]
sanitize title better
Christian Weiske [Thu, 11 Feb 2016 16:00:58 +0000 (17:00 +0100)]
use correct meta robots attribute
Christian Weiske [Thu, 11 Feb 2016 07:43:01 +0000 (08:43 +0100)]
debug option for crawler
Christian Weiske [Wed, 10 Feb 2016 21:02:11 +0000 (22:02 +0100)]
add date sorting
Christian Weiske [Wed, 10 Feb 2016 20:15:35 +0000 (21:15 +0100)]
remove debug statement
Christian Weiske [Wed, 10 Feb 2016 16:26:15 +0000 (17:26 +0100)]
crawler supports "nofollow" now
Christian Weiske [Wed, 10 Feb 2016 16:09:56 +0000 (17:09 +0100)]
send accept header during crawl
Christian Weiske [Wed, 10 Feb 2016 14:14:34 +0000 (15:14 +0100)]
some styling, noindex for search result pages
Christian Weiske [Wed, 10 Feb 2016 13:56:20 +0000 (14:56 +0100)]
rework crawler; add atom link extraction
Christian Weiske [Sat, 6 Feb 2016 19:27:58 +0000 (20:27 +0100)]
about section readme
Christian Weiske [Fri, 5 Feb 2016 05:48:45 +0000 (06:48 +0100)]
add site GET parameter
Christian Weiske [Thu, 4 Feb 2016 22:59:52 +0000 (23:59 +0100)]
default config
Christian Weiske [Thu, 4 Feb 2016 22:58:00 +0000 (23:58 +0100)]
do not exit on null query
Christian Weiske [Thu, 4 Feb 2016 22:55:41 +0000 (23:55 +0100)]
check for content attributes
Christian Weiske [Thu, 4 Feb 2016 22:46:45 +0000 (23:46 +0100)]
remove multiple tags
Christian Weiske [Thu, 4 Feb 2016 16:23:14 +0000 (17:23 +0100)]
do not show filter headline if there are none
Christian Weiske [Thu, 4 Feb 2016 16:20:23 +0000 (17:20 +0100)]
show query time
Christian Weiske [Thu, 4 Feb 2016 16:12:14 +0000 (17:12 +0100)]
change default query operator to AND
Christian Weiske [Thu, 4 Feb 2016 16:10:49 +0000 (17:10 +0100)]
Show site search reset link
Christian Weiske [Thu, 4 Feb 2016 15:58:33 +0000 (16:58 +0100)]
escape html in search results
Christian Weiske [Wed, 3 Feb 2016 21:37:15 +0000 (22:37 +0100)]
fix indexing, boost config
Christian Weiske [Wed, 3 Feb 2016 21:18:52 +0000 (22:18 +0100)]
no simplexml anymore, content extraction improvements
Christian Weiske [Wed, 3 Feb 2016 20:25:34 +0000 (21:25 +0100)]
follow redirect, do not verify ssl certificates, use final after-redirect url
Christian Weiske [Wed, 3 Feb 2016 20:12:17 +0000 (21:12 +0100)]
add site search, highlighting
Christian Weiske [Wed, 3 Feb 2016 19:03:35 +0000 (20:03 +0100)]
show elasticsearch query time
Christian Weiske [Wed, 3 Feb 2016 16:23:06 +0000 (17:23 +0100)]
filtering works
Christian Weiske [Wed, 3 Feb 2016 05:21:30 +0000 (06:21 +0100)]
first frontend
Christian Weiske [Mon, 1 Feb 2016 19:18:59 +0000 (20:18 +0100)]
first kinda working version