Commit Graph

131 Commits

Author SHA1 Message Date
Aaron Boodman 3adab7a874 picasa/find-photos (#2714)
* Introduce picasa/find-photos

* npm test
2016-10-19 09:07:49 -07:00
cmasone-attic 23f94bdf8c demo-server: Remove cachingReadThroughChunkStoreFactory (#2689)
demo-server was using a read-through cache to allow it to serve more
concurrent requests more quickly by reducing disk I/O. As seen in issue
across Databases, leading to incorrect sync results in some instances.

Since we're not worried about demo-server load right now, simply delete
the cache.

Fixes #2688
2016-10-10 14:14:54 -07:00
Aaron Boodman fe922da8ed photo-index: tag commit with run date (#2687)
Would be nice to also tag with pinned path, but that's more work
2016-10-08 01:03:36 -07:00
Ben Kalman 4a562d81ef Add tag and face counts to the photo index (#2660) 2016-10-03 11:52:42 -07:00
Eric Halpern 574bdd8483 Make config.Resolve methods public and make photo-index .nomsconfig aware (#2661) 2016-10-03 10:03:11 -07:00
zcstarr 40b28f94e5 Refactor Chunks and ChildValues API to work iteratively (#2599)
* Refactors Chunks and ChildValues API to be iterative change also
exposes WalkValues which replaces SomeP and AllP
2016-09-30 16:53:00 -07:00
Aaron Boodman 596a91d341 Add face indexing to photo-index (#2654) 2016-09-29 13:27:04 -07:00
Dan Willhite 885dd5a21c Add tx-regex, tx-replace, and tx-convert args. (#2651)
Also, make help text more complete.
2016-09-29 13:03:53 -07:00
Ben Kalman 9f5725bd27 Run url-fetch perf tests against a fresh database each time
The subsequent runs of url-fetch on jenkins are way faster, and this
appears to be because commiting is much faster on subsequent runs. The
perf tests now use a new database each time.
2016-09-29 12:49:37 -07:00
Eric Halpern d9715dba0e Support db aliases and default db for noms cli
This patch implements evolving support for configuring aliases and defaults for the noms cli (started with #2131)

For an introduction, please take a look at the sample code here: https://github.com/attic-labs/noms/blob/master/samples/cli/nomsconfig/README.md

Improvements include: 

 - All go samples now work with .nomsconfig
 - Absolute paths in ldb specs are now properly handled 
 - Add -v|--verbose flag to commands to debug expansion
 - Make default just another alias and change [default] section to [db.default]
 - Introduce the `.` shorthand to refer to a previously mentioned dataset/object
2016-09-27 22:21:32 -07:00
Ben Kalman 81673c2591 Add perf test for url-fetch 2016-09-27 16:52:54 -07:00
Aaron Boodman 577c99ff38 Factor out datas.ReadAbsolutePaths() (#2623) 2016-09-27 14:21:54 -07:00
Aaron Boodman e52775f838 Refactor exit mockery into go/util/exit (#2622) 2016-09-27 13:51:27 -07:00
Ben Kalman 097863ea6f Use parallel NewBlob in the csv perf tests (#2625)
This just involves changing types.NewBlob(io.MultiReader(files...)) to
types.NewBlob(files...). On my laptop it improves
Test01ImportSfCrimeBlobFromTestdata from 21s to 16s - though much of
this is dominated by commit, which wouldn't be affected by this change.
2016-09-27 12:22:25 -07:00
Ben Kalman 35d88dd3c6 Implement Blob.Concat and make NewBlob parallel
Blob.Concat is a simple use of the sequence concat code that List.Concat uses.
NewBlob uses Blob.Concat to construct a Blob in parallel.

Perf tests for parallel NewBlob write N temporary files then constructs a Blob
from them, so there is some I/O, but it appears to be mostly CPU bound.  NewBlob
doesn't get much more than 50% faster with any P >= 2.
2016-09-27 11:08:31 -07:00
Aaron Boodman 362a5630d9 Add photo-index: a simple photo indexer. For now only indexes by tag. (#2610)
Add photo-index: a simple photo indexer. For now only indexes by tag.

Will add indexing by face/geo in subsequent patches.
2016-09-27 10:50:37 -07:00
Dan Willhite 5de36728e8 Make nomdex_update use GraphBuilder (#2619) 2016-09-26 17:00:12 -07:00
cmasone-attic 2e462b11a5 Make Database a mutable API that vends immutable Datasets (#2617)
Noms SDK users frequently shoot themselves in the foot because they're
holding onto an "old" Database object. That is, they have a Database
tucked away in some internal state, they call Commit() on it, and
don't replace the object in their internal state with the new Database
returned from Commit.

This PR changes the Database and Dataset Go API to be in line with the
proposal in Issue #2589. JS follows in a separate patch.
2016-09-26 12:18:14 -07:00
Dan Willhite 7bb7a068d6 Fix hyperlink in nomdex Readme file (#2618) 2016-09-26 11:00:33 -07:00
Dan Willhite 403bfa6560 Create Readme.md (#2616) 2016-09-26 10:42:15 -07:00
Dan Willhite e351f718e4 Use smaller dataset to testing csv-import/multi-map (#2609)
Also reuse data already imported as blob by another test.
2016-09-23 10:59:30 -07:00
Dan Willhite e5541f9343 Make csv importer use GraphBuilder (#2600) 2016-09-22 15:19:37 -07:00
Dan Willhite 3b17956907 Add perf test for multi-key maps. (#2605) 2016-09-22 13:34:37 -07:00
Erik Arvidsson 5edf89cf3d Replace d.Chk.True with d.PanicIfFalse (#2563)
And same for d.Chk.False
2016-09-14 13:11:28 -07:00
Mike Gray 1996e0a3d8 Add Noms commit command (#2474)
* Add "noms commit" command
* Updated csv-import, json-import, xml-import and url-fetch to (optionally) not commit results
* Added helpers for creating commit meta-data struct through command line or function calls
2016-09-09 12:42:27 -04:00
cmasone-attic 1c69c6b891 Update merge.ThreeWay() to allow very basic custom conflict resolution (#2505)
This patch modifies merge.ThreeWay() to take a callback that allows
for custom conflict resolution. The noms-merge command-line tool uses
this to inject a callback that accepts input from the console
dictating whether to accept the value from the 'left' or 'right' merge
candidates.

Toward #2445
2016-09-07 13:21:22 -07:00
zcstarr ef817db179 Adds error for invalid skip records argument and exits csv importer (#2522) 2016-09-06 17:37:05 -07:00
zcstarr b1c0aeb9c5 Adds checks for bad column-type and header csv-import flag values (#2525) 2016-09-06 17:12:21 -07:00
zcstarr ef11062cab Fixes bug where delimiter applied to header arguments (#2523) 2016-09-06 16:50:05 -07:00
Mike Gray 47565f39d1 Improve code based on tool analysis feeback (#2521)
Fixes are based on Go report card output:
- `gofmt -s` eliminates some duplication in struct/slice initialization
- `golint` found some issues like: `warning: should drop = nil from declaration of var XXX; it is the zero value`
- `golint` found some issues like: `warning: receiver name XXX should be consistent with previous receiver name YYY for ZZZ`
- `golint` says not to use underscores for function/variable names
- `golint` found several issues like: `warning: if block ends with a return statement, so drop this else and outdent its block`

No functional changes are included - just source code quality improvements.
2016-09-06 16:35:25 -04:00
zcstarr 3cdebb7e77 Add run safe method that reads stderr and stdout regardless of panic (#2475)
Run method will now always return stdout,stderr, and a recoveredErr
on Exit or Panic. MustRun will Panic with recoveredErr.
2016-09-06 11:30:57 -07:00
cmasone-attic e5fcfd6ebf Make poke hang on to parent's Commit metadata (#2504)
Before this, poke would drop any commit metadata from the dataset being modified. Now, it just pulls it forward.
2016-09-01 17:21:31 -07:00
cmasone-attic 9f080a2fa7 Fix test assertion in noms-merge::TestLose() (#2498)
This was caused by me changing an error string and failing
to update the test.
2016-09-01 11:34:04 -07:00
cmasone-attic 771eb092da Add failure tests for noms-merge (#2484)
Tests command line validation for noms-merge

Toward #2445
2016-09-01 11:31:29 -07:00
Aaron Boodman 42fd80be2e Add a super quick indeterminite progress meter to noms-merge (#2488) 2016-09-01 10:24:30 -07:00
Daniel Krech 01bdeab025 Add progress reporting to json_importer (#2494)
Fixes #2494
2016-09-01 00:04:26 -07:00
Ben Kalman 9c694f024b Add a perf test for CSV map import (#2461)
Currently we only have a perf test for CSV list import, which uses the
sf-crime dataset. This test uses the 43MB sf-registered-businesses
dataset instead, since sf-crime is too slow. Which is ironic, since we
normally parse sf-crime into a map.

I've also tightened up some of the other perf tests.
- Fixed a bug where Database was shared between runs.
- Make the pure CSV parsing test use a smaller dataset, it doesn't need
  to use something as large as ny-vehicle-registrations.
2016-08-31 17:05:00 -07:00
cmasone-attic 49ea5ec3c0 Introduce noms-merge, a standalone noms tool for merging datasets (#2470)
This is a first pass at exposing the new merge package to users.  The
tool is very basic, and currently only works on datasets in the same
database. It requires the 'parent' (i.e. a common ancestor of the two
datasets being merged) to be provided by a commandline option; a
follow-on patch will make the code discover this ancestor automatically.

Toward #2445
2016-08-31 14:20:21 -07:00
zcstarr aeb5c42bcc Add special encoding to csv imported struct fields (#2441)
CSV importing is changed to strip invalid characters from csv fields 
and camel case spaces. i.e. ca-mel case is translated to camelCase.
2016-08-30 14:59:10 -07:00
Aaron Boodman 24c99ae3b5 Introduce poke: a simple tool for modifying Noms data (#2449)
Introduce poke: a simple tool for modifying Noms data
2016-08-30 00:24:38 -07:00
Ben Kalman 66b98ea2a1 Add perf test for just parsing a CSV (ny-vehicle-registrations) (#2439) 2016-08-25 17:13:06 -07:00
Mike Gray 2f66e67763 fixing misspellings, fixing IneffAssign reported issues (#2436)
also removing encode-perf-rig since codec-perf-rig is more current and real
2016-08-25 13:32:34 -04:00
Mike Gray 6a661361ad csv-import/csv-export support for compound keys/nested maps (#2433) 2016-08-25 09:34:21 -04:00
Dan Willhite d7b7345218 Add informative message at base path of demo-server. (#2432) 2016-08-24 13:41:01 -07:00
Dan Willhite de32de82a2 Add != operator to nomdex query language. (#2430)
Fixes #2415
2016-08-24 10:05:42 -07:00
Dan Willhite 924492eed1 Describe find command in help output (#2429)
Fixes #2404
2016-08-23 23:05:46 -07:00
Dan Willhite 1090ba4090 Add integration test for nomdex (#2425)
Fixes #2414
2016-08-23 17:04:51 -07:00
Mike Gray 4e54c44d56 no functional changes, improving code quality (#2410)
fix misspellings; fix code that was not gofmt'd - plus take advantage of gofmt -s too; couple of unreachable golint reported fixes; reference go report card results and tests
2016-08-23 13:51:38 -04:00
Dan Willhite 7b3adfe657 Add support to nomdex to query against multiple indexes. (#2396)
Fixes #2405
Fixes #2405
Fixes #2114 (Implement nomdex)
2016-08-23 10:26:11 -07:00
Ben Kalman 892f98050e Make perf suite testdata path configurable by -perf.testdata flag (#2406)
Needed by Jenkins shared workspaces.
2016-08-22 16:40:27 -07:00