Snakemake Using expand with dictionary

I am writing this rule:

rule process_files:
 input: 
 dataout=expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) 
 output:
 "dataset/sample.ref.state.case.endresult.tsv"
 shell:
 do something ...

Were expand will get value from dictionary my_dictionary based on the ref value. I used wildcards like this my_dictionary[wildcards.ref]. But it ends up with this error name 'wildcards' is not defined

my_dictionary something like:
A:[1,2,3], B:[s1,s2..].....

I could use

def myfun(wildcards):
 return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_dictionary[wildcards.ref])

and use myfun as input , but this does not answer why I can not use expand in place directly

Any suggestion how to fix it?

edited Nov 14 '18 at 15:22

asked Nov 13 '18 at 22:26

Medhat

934820

add a comment |

I am writing this rule:

rule process_files:
 input: 
 dataout=expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) 
 output:
 "dataset/sample.ref.state.case.endresult.tsv"
 shell:
 do something ...

my_dictionary something like:
A:[1,2,3], B:[s1,s2..].....

I could use

def myfun(wildcards):
 return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_dictionary[wildcards.ref])

and use myfun as input , but this does not answer why I can not use expand in place directly

Any suggestion how to fix it?

edited Nov 14 '18 at 15:22

asked Nov 13 '18 at 22:26

Medhat

934820

add a comment |

I am writing this rule:

rule process_files:
 input: 
 dataout=expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) 
 output:
 "dataset/sample.ref.state.case.endresult.tsv"
 shell:
 do something ...

my_dictionary something like:
A:[1,2,3], B:[s1,s2..].....

I could use

def myfun(wildcards):
 return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_dictionary[wildcards.ref])

and use myfun as input , but this does not answer why I can not use expand in place directly

Any suggestion how to fix it?

edited Nov 14 '18 at 15:22

asked Nov 13 '18 at 22:26

Medhat

934820

I am writing this rule:

rule process_files:
 input: 
 dataout=expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) 
 output:
 "dataset/sample.ref.state.case.endresult.tsv"
 shell:
 do something ...

my_dictionary something like:
A:[1,2,3], B:[s1,s2..].....

I could use

def myfun(wildcards):
 return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_dictionary[wildcards.ref])

and use myfun as input , but this does not answer why I can not use expand in place directly

Any suggestion how to fix it?

expand snakemake

edited Nov 14 '18 at 15:22

asked Nov 13 '18 at 22:26

Medhat

934820

edited Nov 14 '18 at 15:22

asked Nov 13 '18 at 22:26

Medhat

934820

edited Nov 14 '18 at 15:22

asked Nov 13 '18 at 22:26

Medhat

934820

asked Nov 13 '18 at 22:26

Medhat

934820

asked Nov 13 '18 at 22:26

Medhat

934820

add a comment |

2 Answers
2

active

oldest

votes

Your question seems similar to snakemake wildcards or expand command and the bottom line is that wildcards is not defined in the input. So your solution of using an input function (or a lambda function) seems correct.

(As to why wildcards is not defined in input, I don't know...)

answered Nov 14 '18 at 8:25

dariober

1,0211221

Thanks, The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

– Medhat
Nov 14 '18 at 16:37

add a comment |

As @dariober mentioned there is the wildcards objects but this is only accesible in the run/shell portion but can be accessed using an input function in input.

Here is an example implementation that will expand the input based on the wildcards.ref:

rule all:
 input: expand("dataset/sample.ref.state.case.endresult.tsv", dataset=["D1", "D2"], sample=["S1", "S2"], ref=["R1", "R2"], state=["STATE1", "STATE2"], case=["C1", "C2"])


my_list = "R1": [1, 2, 3], "R2": ["s1", "s2"]

rule process_files:
 input:
 lambda wildcards: expand(
 "dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref])
 output:
 "dataset/sample.ref.state.case.endresult.tsv"
 shell:
 "echo 'input' > output"

If you implement it as the lambda function example above, it should resolve the issue you mention:

The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

answered Nov 14 '18 at 17:47

JohnnyBD

11115

Actually my function is the same as your lambda function and raises this error. def myfun(wildcards): return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) . to overcome the issue I need to resolve each var for example ref . would be `wildcards.ref`` and so on.

– Medhat
Nov 14 '18 at 19:38

There should not really be need to do that. You are saying you pass to expand, in the case of dataset, dataset = wildcards.dataset? Seems redundant. I am using snakemake 5.3.0 in the example and it works using your myfun or lambda.

– JohnnyBD
Nov 14 '18 at 20:43

The issue is after using expand; the variable passed to sample is sample so it would be sample=sample not the actual value of sample, which makes problem in processing for next step because now there is nothing called dataset/sample... in the input file

– Medhat
Nov 14 '18 at 21:23

I am sorry but I cannot seem to reproduce this issue you are mentioning. Could you maybe edit your question and provide example of what an input would look like for one input wildcard combination? Either I am misunderstanding what are you trying to do or our implementations are different? You want to have a single value for all the wildcards except name? Essentially group a set of name inputs together? In that case you should have sample in the result of expand as that wildcard will be deduced from rule all and output.

– JohnnyBD
Nov 14 '18 at 22:26

add a comment |

Your Answer

StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53290456%2fsnakemake-using-expand-with-dictionary%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

(As to why wildcards is not defined in input, I don't know...)

answered Nov 14 '18 at 8:25

dariober

1,0211221

Thanks, The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

– Medhat
Nov 14 '18 at 16:37

add a comment |

(As to why wildcards is not defined in input, I don't know...)

answered Nov 14 '18 at 8:25

dariober

1,0211221

Thanks, The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

– Medhat
Nov 14 '18 at 16:37

add a comment |

(As to why wildcards is not defined in input, I don't know...)

answered Nov 14 '18 at 8:25

dariober

1,0211221

(As to why wildcards is not defined in input, I don't know...)

answered Nov 14 '18 at 8:25

dariober

1,0211221

answered Nov 14 '18 at 8:25

dariober

1,0211221

answered Nov 14 '18 at 8:25

dariober

1,0211221

answered Nov 14 '18 at 8:25

dariober

1,0211221

Thanks, The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

– Medhat
Nov 14 '18 at 16:37

add a comment |

Thanks, The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

– Medhat
Nov 14 '18 at 16:37

Thanks, The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

– Medhat
Nov 14 '18 at 16:37

add a comment |

As @dariober mentioned there is the wildcards objects but this is only accesible in the run/shell portion but can be accessed using an input function in input.

Here is an example implementation that will expand the input based on the wildcards.ref:

rule all:
 input: expand("dataset/sample.ref.state.case.endresult.tsv", dataset=["D1", "D2"], sample=["S1", "S2"], ref=["R1", "R2"], state=["STATE1", "STATE2"], case=["C1", "C2"])


my_list = "R1": [1, 2, 3], "R2": ["s1", "s2"]

rule process_files:
 input:
 lambda wildcards: expand(
 "dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref])
 output:
 "dataset/sample.ref.state.case.endresult.tsv"
 shell:
 "echo 'input' > output"

If you implement it as the lambda function example above, it should resolve the issue you mention:

The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

answered Nov 14 '18 at 17:47

JohnnyBD

11115

Actually my function is the same as your lambda function and raises this error. def myfun(wildcards): return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) . to overcome the issue I need to resolve each var for example ref . would be `wildcards.ref`` and so on.

– Medhat
Nov 14 '18 at 19:38

There should not really be need to do that. You are saying you pass to expand, in the case of dataset, dataset = wildcards.dataset? Seems redundant. I am using snakemake 5.3.0 in the example and it works using your myfun or lambda.

– JohnnyBD
Nov 14 '18 at 20:43

The issue is after using expand; the variable passed to sample is sample so it would be sample=sample not the actual value of sample, which makes problem in processing for next step because now there is nothing called dataset/sample... in the input file

– Medhat
Nov 14 '18 at 21:23

I am sorry but I cannot seem to reproduce this issue you are mentioning. Could you maybe edit your question and provide example of what an input would look like for one input wildcard combination? Either I am misunderstanding what are you trying to do or our implementations are different? You want to have a single value for all the wildcards except name? Essentially group a set of name inputs together? In that case you should have sample in the result of expand as that wildcard will be deduced from rule all and output.

– JohnnyBD
Nov 14 '18 at 22:26

add a comment |

As @dariober mentioned there is the wildcards objects but this is only accesible in the run/shell portion but can be accessed using an input function in input.

Here is an example implementation that will expand the input based on the wildcards.ref:

rule all:
 input: expand("dataset/sample.ref.state.case.endresult.tsv", dataset=["D1", "D2"], sample=["S1", "S2"], ref=["R1", "R2"], state=["STATE1", "STATE2"], case=["C1", "C2"])


my_list = "R1": [1, 2, 3], "R2": ["s1", "s2"]

rule process_files:
 input:
 lambda wildcards: expand(
 "dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref])
 output:
 "dataset/sample.ref.state.case.endresult.tsv"
 shell:
 "echo 'input' > output"

If you implement it as the lambda function example above, it should resolve the issue you mention:

The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

answered Nov 14 '18 at 17:47

JohnnyBD

11115

Actually my function is the same as your lambda function and raises this error. def myfun(wildcards): return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) . to overcome the issue I need to resolve each var for example ref . would be `wildcards.ref`` and so on.

– Medhat
Nov 14 '18 at 19:38

There should not really be need to do that. You are saying you pass to expand, in the case of dataset, dataset = wildcards.dataset? Seems redundant. I am using snakemake 5.3.0 in the example and it works using your myfun or lambda.

– JohnnyBD
Nov 14 '18 at 20:43

The issue is after using expand; the variable passed to sample is sample so it would be sample=sample not the actual value of sample, which makes problem in processing for next step because now there is nothing called dataset/sample... in the input file

– Medhat
Nov 14 '18 at 21:23

I am sorry but I cannot seem to reproduce this issue you are mentioning. Could you maybe edit your question and provide example of what an input would look like for one input wildcard combination? Either I am misunderstanding what are you trying to do or our implementations are different? You want to have a single value for all the wildcards except name? Essentially group a set of name inputs together? In that case you should have sample in the result of expand as that wildcard will be deduced from rule all and output.

– JohnnyBD
Nov 14 '18 at 22:26

add a comment |

As @dariober mentioned there is the wildcards objects but this is only accesible in the run/shell portion but can be accessed using an input function in input.

Here is an example implementation that will expand the input based on the wildcards.ref:

rule all:
 input: expand("dataset/sample.ref.state.case.endresult.tsv", dataset=["D1", "D2"], sample=["S1", "S2"], ref=["R1", "R2"], state=["STATE1", "STATE2"], case=["C1", "C2"])


my_list = "R1": [1, 2, 3], "R2": ["s1", "s2"]

rule process_files:
 input:
 lambda wildcards: expand(
 "dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref])
 output:
 "dataset/sample.ref.state.case.endresult.tsv"
 shell:
 "echo 'input' > output"

If you implement it as the lambda function example above, it should resolve the issue you mention:

The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

answered Nov 14 '18 at 17:47

JohnnyBD

11115

As @dariober mentioned there is the wildcards objects but this is only accesible in the run/shell portion but can be accessed using an input function in input.

Here is an example implementation that will expand the input based on the wildcards.ref:

rule all:
 input: expand("dataset/sample.ref.state.case.endresult.tsv", dataset=["D1", "D2"], sample=["S1", "S2"], ref=["R1", "R2"], state=["STATE1", "STATE2"], case=["C1", "C2"])


my_list = "R1": [1, 2, 3], "R2": ["s1", "s2"]

rule process_files:
 input:
 lambda wildcards: expand(
 "dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref])
 output:
 "dataset/sample.ref.state.case.endresult.tsv"
 shell:
 "echo 'input' > output"

If you implement it as the lambda function example above, it should resolve the issue you mention:

The function worked but it did not resolve the variable between double curly braces so it will ask for input for dataset/sample.ref.state.caseand raise an error.

answered Nov 14 '18 at 17:47

JohnnyBD

11115

answered Nov 14 '18 at 17:47

JohnnyBD

11115

answered Nov 14 '18 at 17:47

JohnnyBD

11115

answered Nov 14 '18 at 17:47

JohnnyBD

11115

Actually my function is the same as your lambda function and raises this error. def myfun(wildcards): return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) . to overcome the issue I need to resolve each var for example ref . would be `wildcards.ref`` and so on.

– Medhat
Nov 14 '18 at 19:38

There should not really be need to do that. You are saying you pass to expand, in the case of dataset, dataset = wildcards.dataset? Seems redundant. I am using snakemake 5.3.0 in the example and it works using your myfun or lambda.

– JohnnyBD
Nov 14 '18 at 20:43

The issue is after using expand; the variable passed to sample is sample so it would be sample=sample not the actual value of sample, which makes problem in processing for next step because now there is nothing called dataset/sample... in the input file

– Medhat
Nov 14 '18 at 21:23

I am sorry but I cannot seem to reproduce this issue you are mentioning. Could you maybe edit your question and provide example of what an input would look like for one input wildcard combination? Either I am misunderstanding what are you trying to do or our implementations are different? You want to have a single value for all the wildcards except name? Essentially group a set of name inputs together? In that case you should have sample in the result of expand as that wildcard will be deduced from rule all and output.

– JohnnyBD
Nov 14 '18 at 22:26

add a comment |

Actually my function is the same as your lambda function and raises this error. def myfun(wildcards): return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) . to overcome the issue I need to resolve each var for example ref . would be `wildcards.ref`` and so on.

– Medhat
Nov 14 '18 at 19:38

There should not really be need to do that. You are saying you pass to expand, in the case of dataset, dataset = wildcards.dataset? Seems redundant. I am using snakemake 5.3.0 in the example and it works using your myfun or lambda.

– JohnnyBD
Nov 14 '18 at 20:43

The issue is after using expand; the variable passed to sample is sample so it would be sample=sample not the actual value of sample, which makes problem in processing for next step because now there is nothing called dataset/sample... in the input file

– Medhat
Nov 14 '18 at 21:23

I am sorry but I cannot seem to reproduce this issue you are mentioning. Could you maybe edit your question and provide example of what an input would look like for one input wildcard combination? Either I am misunderstanding what are you trying to do or our implementations are different? You want to have a single value for all the wildcards except name? Essentially group a set of name inputs together? In that case you should have sample in the result of expand as that wildcard will be deduced from rule all and output.

– JohnnyBD
Nov 14 '18 at 22:26

Actually my function is the same as your lambda function and raises this error. def myfun(wildcards): return expand("dataset/sample.ref.state.case.myresult.name.tsv", name=my_list[wildcards.ref]) . to overcome the issue I need to resolve each var for example ref . would be `wildcards.ref`` and so on.

– Medhat
Nov 14 '18 at 19:38

There should not really be need to do that. You are saying you pass to expand, in the case of dataset, dataset = wildcards.dataset? Seems redundant. I am using snakemake 5.3.0 in the example and it works using your myfun or lambda.

– JohnnyBD
Nov 14 '18 at 20:43

The issue is after using expand; the variable passed to sample is sample so it would be sample=sample not the actual value of sample, which makes problem in processing for next step because now there is nothing called dataset/sample... in the input file

– Medhat
Nov 14 '18 at 21:23

I am sorry but I cannot seem to reproduce this issue you are mentioning. Could you maybe edit your question and provide example of what an input would look like for one input wildcard combination? Either I am misunderstanding what are you trying to do or our implementations are different? You want to have a single value for all the wildcards except name? Essentially group a set of name inputs together? In that case you should have sample in the result of expand as that wildcard will be deduced from rule all and output.

– JohnnyBD
Nov 14 '18 at 22:26

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

qN P,CrDJ5BGvDE2bHzH,9kK bOx2M4T

搜尋此網誌

Odtnhj