I try to load documents using MLCP import and the -output_uri_replace option, such as
-output_uri_replace
".*/,'',---,':',___,'/'"
Everything is ok, except that I need to keep square brackets in my URIs and MLCP always encode them into %5B and %5D
I have tried different patterns to force it (or no pattern at all) but no way...
-output_uri_replace
".*/,'',---,':',___,'/',\[,'U\+005B',\],'\]'"
Anyone with the same experience or with a solution? :)
URI.encode(): github.com/marklogic/marklogic-contentpump/blob/master/…, which will escape square brackets, since the URI spec doesn't allow them in the path portion. I think @grtjn is right, the only way would be to have a transform that writes to the un-escaped URI on ingest. - Daniel Quinlan