Commit Graph

377 Commits

Author SHA1 Message Date
cmasone-attic 1b5ac05793 csv-invert command line tool (#3689)
Takes the output of a CSV file imported as a List of Struct and
"inverts" it so that it's now a Struct of Lists.

Example:

List<Struct Row {
  Base?: String,
  DOLocationID?: String,
}>

becomes

Struct Columnar {
  base: List<String>,
  dolocationid: List<String>,
}
2017-09-12 15:05:31 -07:00
Aaron Boodman a18bd984d1 Update importer.go 2017-09-09 19:59:09 -07:00
wardn b213146428 fix csv-importer dest-type description (#3682) 2017-09-09 19:58:06 -07:00
Aaron Boodman 484587a2b4 Fix broken build (#3680) 2017-09-08 02:54:45 -07:00
Aaron Boodman f0568592b8 Add import to p2p sample (#3679) 2017-09-08 02:10:01 -07:00
Aaron Boodman a25ea915a7 add a simpler p2p ipfs-chat (#3678) 2017-09-07 18:32:47 -07:00
Ben Kalman 03b7221c36 Use stretchr/testify not attic-labs/testify (#3677)
stretchr has fixed a bug with the -count flag. I could merge these
changes into attic-labs, but it's easier to just use strechr.

We forked stretchr a long time ago so that we didn't link in the HTTP
testing libraries into the noms binaries (because we were using d.Chk in
production code). The HTTP issue doesn't seem to happen anymore, even
though we're still using d.Chk.
2017-09-07 15:01:03 -07:00
Eric Halpern 5d1723674b Add --lowercase option to map column names to lowercase struct names (#3675)
* Add --lowercase option to map column names to lowercase struct names

By default, each column name maps to a struct field preserving the original case.
If --lowercase is specified the resulting struct fields will always be lowercase.
2017-09-07 11:21:31 -07:00
Aaron Boodman 759cf4fe58 gofmt (#3671) 2017-09-05 18:47:56 -07:00
Dan Willhite 071ba838d2 Modifications to ipfs-chat can ipfs chunkstore (#3666) 2017-09-05 18:35:50 -07:00
Ian Davis 63a72d3bfb Use ETag/If-None-Match in url-fetch (#3664) 2017-09-04 01:02:21 -07:00
Dan Willhite aa65868741 Changes to accommodate new version of ipfs 2017-08-31 10:48:27 -07:00
Dan Willhite b1cb8a0fff Add rate limit to ipfs chunkstore, increase thread limit 2017-08-30 17:09:54 -07:00
Dan Willhite 04e837bad9 Increase rlimit and auto-initialize IPFS repos 2017-08-30 11:52:50 -07:00
Rafael Weinstein c3f98d1631 Remove in mem graphs (#3635)
This patch removes the ability to keep alive uncommitted prolly-tree sequences.
2017-08-29 13:12:10 -07:00
Rafael Weinstein 61f3d87dcf Introduce Sloppy (#3631)
Introduce Sloppy - an estimating compression function for snappy - which allows for the rolling hash to better produce a given target chunk size after compression.
2017-08-28 13:23:00 -07:00
Dan Willhite 60f676616b Various improvements
* Fix scrolling issue
* Make importer merge scripts in with pre-existing texts before committing
2017-08-25 17:25:38 -07:00
Dan Willhite c24e10646f Change msg key to be date-based rather than consecutive integer 2017-08-25 17:25:38 -07:00
Dan Willhite 996741a669 In pubsub, skip replication work on same hash notification 2017-08-25 17:25:38 -07:00
Dan Willhite 500a33eb51 Fix disappearing highlighting and scrolling issue 2017-08-25 17:25:38 -07:00
Dan Willhite cc42196818 Clean up usernames from import and limit to top 30 2017-08-25 17:25:38 -07:00
Dan Willhite 95aac7ca1e Merge modifications made by Aaron. (#3638)
* Add kingpin library for argument handling.
* Update hard-coded Version number in chunkstore.
* Store noms repo information in ipfs home directory.
2017-08-24 17:50:15 -07:00
Dan Willhite 23e0bb388b Introduce ipfs-chat (#3637)
* Add github.com/mattn/go-runewidth to vendor directory

* Add golang.org/x/net/html library to vendor directory

* Add github.com/nsf/termbox-go library to vendor directory

* Introduce ipfs-chat

* Add github.com/jroimartin/gocui library to vendor directory
2017-08-24 11:02:59 -07:00
cmasone-attic 461ff64579 NBS: Fix large HTTP {get,has}Refs/ requests (#3629)
When we added GetMany and HasMany, we didn't realize that requests
could then be larger than the allowable HTTP form size. This patch
makes the body of getRefs and hasRefs be serialized as binary instead,
which addresses this issue and actually makes the request body more
compact.

Fixes #3589
2017-08-22 14:24:13 -07:00
Jesse Ditson 5db3cf1679 kingpin docs, noms blob [put | get] (#3621)
* use kingpin for help and new commands, set up dummy command for noms blob

* document existing commands using kingpin

* remove noms-get and noms-set in favor of new noms blob command

* normalize bool flags in tests, remove redundant cases that kingpin now handles

* add kingpin to vendor files

* make profile flags global

* move --verbose and --quiet to global flags
2017-08-16 16:35:22 -07:00
Jesse Ditson bb6f68e1e6 update README for OSXFuse (#3611) 2017-08-03 19:33:25 -07:00
Rafael Weinstein f290a711c2 BlobEditor (#3599) 2017-07-25 13:40:49 -07:00
cmasone-attic f1c0b80bf6 Allow server to re-try commits if it can (#3596)
Right now, the only kinds of Commits that the server will retry are those
to different datasets. That is, if another client concurrently landed a change 
to some other dataset, and that is the only thing that causes your Commit 
attempt to fail, the server will retry.
 
Fixes #3582
2017-07-20 14:24:05 -07:00
Rafael Weinstein 44941ee44c Refactor Blob reading (#3593) 2017-07-19 15:01:44 -07:00
Rafael Weinstein fe0fc3ad86 Introduce SetEditor, Nestable Editors (#3557) 2017-06-21 15:22:04 -07:00
Rafael Weinstein 763a87aa60 Reland advance to fix (#3555) 2017-06-20 12:45:51 -07:00
Rafael Weinstein 1796d2f865 Streaming p tree updates + Map Editor (#3545) 2017-06-15 15:49:04 -07:00
Rafael Weinstein d675e4d8f4 Chunking V2 (#3521) 2017-06-13 10:48:03 -07:00
cmasone-attic 46cf38eaae Simplify Pull() (#3490)
In an NBS world, bulk 'has' checks are waaaay cheaper than they used
to be. In light of this, we can toss out the complex logic we were
using in Pull() -- which basically existed for no reason other than to
avoid doing 'has' checks. Now, the code basically just descends down a
tree of chunks breadth-first, using HasMany() at each level to figure
out which chunks are not yet in the sink all at once, and GetMany() to
pull them from the source in bulk.

Fixes #3182, Towards #3384
2017-05-22 15:50:12 -07:00
Dan Willhite f196e50e03 Remove vestigial 'verbose' argument to NewGraphBuilder (#3482) 2017-05-17 15:57:49 -07:00
cmasone-attic 1d52617eb5 NBS table names now just hash of suffix block (#3421)
Used to be that an NBS table was named by hashing the hashes
of every chunk present in the table, in hash order. That means
that to generate the name of a table you'd need to iterate
the prefix map and load every associated suffix. That would
be expensive when e.g. compacting multiple tables. This is
waaay cheaper and only slightly more likely to wind up with a
name collision.

Toward #3411
2017-04-24 14:45:54 -07:00
cmasone-attic 27556b6148 Rev Noms version (#3418)
Should've done this last week when I tweaked the HTTP
protocol to return the current root hash when a POST
to the root/ endpoint fails.
2017-04-24 12:22:55 -07:00
cmasone-attic 16ef8884a7 Make MemoryStore come correct (#3406)
It's important that MemoryStore (and, by extension TestStore)
correctly implement the new ChunkStore semantics before we go
shifting around the Flush semantics like we want to do in #3404

In order to make this a reality, I introduced a "persistence"
layer for MemoryStore called MemoryStorage, which can vend
MemoryStoreView objects that represent a snapshot of the
persistent storage and implement the ChunkStore contract.

Fixes #3400

Removed Rebase() in HandleRootGet, and added ChunkStore
tests to validate the new Put behavior more fully
2017-04-21 14:13:52 -07:00
Rafael Weinstein 5d6032a9aa Add types.MakeStructTemplate (#3412)
Add types.MakeStructTemplate
2017-04-20 14:41:57 -07:00
cmasone-attic cb930dee81 Merge BatchStore into ChunkStore (#3403)
BatchStore is dead, long live ChunkStore! Merging these two required
some modification of the old ChunkStore contract to make it more
BatchStore-like in places, most specifically around Root(), Put() and
PutMany().

The first big change is that Root() now returns a cached value for the
root hash of the Store. This is how NBS worked already, so the more
interesting change here is the addition of Rebase(), which loads the
latest persistent root. Any chunks that appeared in backing storage
since the ChunkStore was opened (or last rebased) also become
visible.

UpdateRoot() has been replaced with Commit(), because UpdateRoot() was
ALREADY doing the work of persisting novel chunks as well as moving
the persisted root hash of the ChunkStore in both NBS and
httpBatchStore. This name, and the new contract (essentially Flush() +
UpdateRoot()), is a more accurate representation of what's going on.

As for Put(), the former contract for claimed to block until the chunk
was durable. That's no longer the case. Indeed, NBS was already not
fulfilling this contract. The new contract reflects this, asserting
that novel chunks aren't persisted until a Flush() or Commit() --
which has replaced UpdateRoot(). Novel chunks are immediately visible
to Get and Has calls, however.

In addition to this larger change, there are also some tweaks to
ValueStore and Database. ValueStore.Flush() no longer takes a hash,
and instead just persists any and all Chunks it has buffered since the
last time anyone called Flush(). Database.Close() used to have some
side effects where it persisted Chunks belonging to any Values the
caller had written -- that is no longer so. Values written to a
Database only become persistent upon a Commit-like operation (Commit,
CommitValue, FastForward, SetHead, or Delete).

/******** New ChunkStore interface ********/

type ChunkStore interface {
     ChunkSource
     RootTracker
}

// RootTracker allows querying and management of the root of an entire tree of
// references. The "root" is the single mutable variable in a ChunkStore. It
// can store any hash, but it is typically used by higher layers (such as
// Database) to store a hash to a value that represents the current state and
// entire history of a database.
type RootTracker interface {
     // Rebase brings this RootTracker into sync with the persistent storage's
     // current root.
     Rebase()

     // Root returns the currently cached root value.
     Root() hash.Hash

     // Commit atomically attempts to persist all novel Chunks and update the
     // persisted root hash from last to current. If last doesn't match the
     // root in persistent storage, returns false.
     // TODO: is last now redundant? Maybe this should just try to update from
     // the cached root to current?
     // TODO: Does having a separate RootTracker make sense anymore? BUG 3402
     Commit(current, last hash.Hash) bool
}

// ChunkSource is a place chunks live.
type ChunkSource interface {
     // Get the Chunk for the value of the hash in the store. If the hash is
     // absent from the store nil is returned.
     Get(h hash.Hash) Chunk

     // GetMany gets the Chunks with |hashes| from the store. On return,
     // |foundChunks| will have been fully sent all chunks which have been
     // found. Any non-present chunks will silently be ignored.
     GetMany(hashes hash.HashSet, foundChunks chan *Chunk)

     // Returns true iff the value at the address |h| is contained in the
     // source
     Has(h hash.Hash) bool

     // Returns a new HashSet containing any members of |hashes| that are
     // present in the source.
     HasMany(hashes hash.HashSet) (present hash.HashSet)

     // Put caches c in the ChunkSink. Upon return, c must be visible to
     // subsequent Get and Has calls, but must not be persistent until a call
     // to Flush(). Put may be called concurrently with other calls to Put(),
     // PutMany(), Get(), GetMany(), Has() and HasMany().
     Put(c Chunk)

     // PutMany caches chunks in the ChunkSink. Upon return, all members of
     // chunks must be visible to subsequent Get and Has calls, but must not be
     // persistent until a call to Flush(). PutMany may be called concurrently
     // with other calls to Put(), PutMany(), Get(), GetMany(), Has() and
     // HasMany().
     PutMany(chunks []Chunk)

     // Returns the NomsVersion with which this ChunkSource is compatible.
     Version() string

     // On return, any previously Put chunks must be durable. It is not safe to
     // call Flush() concurrently with Put() or PutMany().
     Flush()

     io.Closer
}

Fixes #2945
2017-04-19 13:31:58 -07:00
Aaron Boodman 9eda8eeb89 Fix broken test (#3387) 2017-04-14 10:23:16 -07:00
Aaron Boodman 06cebbe346 Introduce @target annotation for paths (#3352)
Introduce @target annotation for paths

Fixes #2172
2017-04-13 14:56:31 -07:00
Erik Arvidsson fd997f7bfa Add IsValueSubtypeOf and IsCommit (#3375)
This adds IsValueSubtypeOf which skips computing the type of the value.

Use IsValueSubtypeOf to implement IsCommit which checks if a value is a
commit.

Replace usages of IsSubtype(t, TypeOf(v)) with IsValueSubtypeOf(v, t).

Fixes #3326
Fixes #3348
2017-04-13 10:49:17 -07:00
Erik Arvidsson 7c4e2385ab Normalize our number encoding (#3370)
Our Number encoding consists of two parts. Firsts we convert the float
into f * 2**exp, then we uvarint encode f and exp. However, we didn't
normalize f so in theory we could end up with multiple representations
of the same number.

This changes the representation to make the f the smallest possible
integer that fulfills the formula above.

For example we used to encode 256 as (0x100, 0) but with this we instead
encode it as (0x01, 8).

Fixes #2307
2017-04-10 12:20:52 -07:00
Rafael Weinstein 0b10350af3 cleanup (#3372) 2017-04-10 11:45:26 -07:00
Rafael Weinstein fbfdd317fc Encode all noms quantities as varint (#3368) 2017-04-08 22:48:03 -07:00
Rafael Weinstein d8b5d03520 Zero-tolerance for unnamed struct cycles (#3365) 2017-04-08 10:39:03 -07:00
cmasone-attic fe2c476469 Fix NBS optimistic locking (#3353)
Introduce a "lock" hash into NBS manifests to address the bad
interaction between Flush() and optimistic locking. Our original
design didn't include Flush(), which changes the set of tables without
updating the root. Thus... an optimistic locking strategy predicated
on checking the currently-persisted root hash is not robust to
interleaved Flush() calls from multiple clients.

Fixes #3349
2017-04-07 16:55:39 -07:00
Erik Arvidsson 3cda20e251 Add types.HasStructCycle and cleanup MakeStructType (#3354) 2017-04-07 16:27:17 -07:00
Erik Arvidsson fd815b10ad Compute type based on value (#3338)
This moves the type off from the value and instead we compute it as we ask for.

This also changes how we detect cycles. If a named struct contains a struct with the
same name we now create a cycle between them. This also means that cycle types
now take a string and not a number.

For encoding we no longer write the type with the value (unless it is a types.Ref).

This is a format change so this takes us to 7.6

Fixes #3328
Fixes #3325
Fixes #3324
Fixes #3323
2017-04-06 17:43:49 -07:00