Yaml files can be surprisingly lenient in what can be parsed as a yaml file. A reasonable way of validation a yaml file is to ensure the top level is a map or array (although it is valid yaml to have scalars at the top level, but often this is not what you want). This can be done by:
To merge all given yaml files into one, use the reduce operator with the * (multiply) operator. Note the use of ea or eval-all to load all files into memory so that they can be merged.
Merge - showing the source file and line
To see the original source file and line number of your merged result, you can pre-process the files and add that information in as line comments, then perform the merge.
Creating a new file / working with blank documents
To create a new yaml file simply:
Comparing yaml files
The best way to run a diff is to use yq to normalise the yaml files and then just use diff. Here is a simple example of using pretty print -P to normalise the styling and running diff:
This way you can use the full power of diff and normalise the yaml files as you like.
You may also want to remove all comments using ... comments=""
Reading multiple streams (STDINs)
Like diff and other bash commands, you can use <(exp) to pipe in multiple streams of data into yq. instance:
Updating deeply selected paths
or why is yq only returning the updated yaml
The most important thing to remember to do is to have brackets around the LHS expression - otherwise what yq will do is first filter by the selection, and then, separately, update the filtered result and return that subset.
Combining multiple files into one
In order to combine multiple yaml files into a single file (with --- separators) you can just:
Multiple updates to the same path
You can use the with operator to set a nested context:
The first argument expression sets the root context, and the second expression runs against that root context.
Logic without if/elif/else
yq has not yet added if expressions - however you should be able to use with and select to achieve the same outcome. Lets use an example:
Now, if you were using good ol' jq - you may have a script with ifs like so:
Using yq - you can get the same result by:
Note that the logic isn't quite the same, as there is no concept of 'else'. So you may need to put additional logic in the expressions, as this has for the 'else' logic.
yq adds a !!merge tag automatically
The merge functionality from yaml v1.1 (e.g. <<:has actually been removed in the 1.2 spec. Thankfully, yq underlying yaml parser still supports that tag - and it's extra nice in that it explicitly puts the !!merge tag on key of the map entry. This tag tells other yaml parsers that this entry is a merge entry, as opposed to a regular string key that happens to have a value of <<:. This is backwards compatible with the 1.1 spec of yaml, it's simply an explicit way of specifying the type (for instance, you can use a !!str tag to enforce a particular value to be a string.
Although this does affect the readability of the yaml to humans, it still works and processes fine with various yaml processors.
# load array into a bash array
# output each entry as a single line json
readarray identityMappings < <(yq -o=j -I=0 '.identities[]' test.yml )
for identityMapping in "${identityMappings[@]}"; do
# identity mapping is a single json snippet representing a single entry
roleArn=$(echo "$identityMapping" | yq '.arn' -)
echo "roleArn: $roleArn"
done