{"id":23,"date":"2013-07-04T14:09:00","date_gmt":"2013-07-04T02:09:00","guid":{"rendered":"http:\/\/deborahfitchett.com\/blog\/?p=23"},"modified":"2013-07-04T14:09:00","modified_gmt":"2013-07-04T02:09:00","slug":"nesi-publishing-data-open-licenses-nzes","status":"publish","type":"post","link":"https:\/\/deborahfitchett.com\/blog\/2013\/07\/nesi-publishing-data-open-licenses-nzes\/","title":{"rendered":"NeSI; publishing data; open licenses #nzes"},"content":{"rendered":"<p><b>    Connecting Genetics Researchers to NeSI<\/b><br \/><i>    James Boocock &#038; David Eyers, University of Otago<br \/>Phil Wilcox, Tony Merriman &#038; Mik Black, Virtual Institute of Statistical Genetics (VISG) &#038; University of Otago<\/i><br \/>Theme of conference &#8220;eResearch as an enabler&#8221; &#8211; show researchers that eresearch can benefit them and enabling them.<br \/>There&#8217;s been a genomic data explosion &#8211; genomic, microarray, sequencing data. Genetics researchers need to use computers more and more. Computational cost increasing, need to use shared resources. &#8220;Compute first, ask questions later&#8221;.<\/p>\n<p><a href=\"http:\/\/galaxyproject.org\/\">Galaxy<\/a> aims to be web-based platform for computational biomedical research &#8211; accessible, reproducible, transparent. Has a bunch of interfaces. Recommends shared file system and splitting jobs into smaller tasks to take advantage of HPC.<\/p>\n<p>Goal to create an interface between NeSI and Galaxy. Galaxy job > a job splitter > subtasks performed at NeSI then &#8216;zipped up&#8217; and returned to Galaxy. Not just file spliting by lines, but by genetic distance. Gives different sized files.<\/p>\n<p>Used git\/github to track changes, and Sphynx for python documentation. Investigating Shibboleth for authentication. Some bugs they&#8217;re working on. Further looking at efficiency measures for parallelization, building machine-learning approach do doing this.<\/p>\n<p><b>    Myths vs Realities: the truth about open data<\/b><br \/><i>    Deborah Fitchett &#038; Erin-Talia Skinner, Lincoln University<\/i><br \/>Our slides and notes available at the <a href=\"http:\/\/hdl.handle.net\/10182\/5513\">Lincoln University Research Archive<\/a><\/p>\n<p><b>    Some rights reserved: Copyright Licensing on our Scholarly record<\/b><br \/><i>    Richard Hosking &#038; Mark Gahegan, The University of Auckland <\/i><\/p>\n<p>Copyright law has effect on reuse of data. Copyright = bundle of exclusive rights you get for creating work, to prevent others using it. Licensing is legal tool to transfer rights. Variety of licensing approaches, not created equal.<\/p>\n<p>Linked data, combining sources with different licenses, makes licensing unclear &#8211; interoperability challenges.<\/p>\n<p> * Lack of license &#8211; obvious problem<br \/> * Copyleft clauses (sharealike) &#8211; makes interoperability hard<br \/> * Proliferation of semi-custom terms &#8211; difficulties of interpretation<br \/> * Non-open public licenses (eg noncommercial) &#8211; more difficulties of interpretation<\/p>\n<p>Technical, semantic, and legal challenges.<br \/>Research aims to capture semantics of licenses in a machine-readable format to align with, and interpret in context of, research practice. Need to go beyond natural language legal text.  License metadata: RDF is a useful tool &#8211; allows sharing and reasoning over implications. Lets us work out whether you can combine sources.<\/p>\n<p>Mapping terminology in licenses to research jargon.<br \/>Eg &#8220;reproduce&#8221;  &#8220;making an exact Copy&#8221;<br \/>&#8220;collaborators&#8221;  &#8220;other Parties&#8221;<\/p>\n<p>This won&#8217;t help if there&#8217;s no license, or legally vague, or for novel use cases where we&#8217;re waiting for precedent (eg text mining over large corpuses)<\/p>\n<p>Compatibility chart of Creative Commons licenses &#8211; some very restricted. &#8220;Pathological combinations of licenses&#8221;. Computing this can help measure combinability of data, degree of openness. Help understanding of propagation of rights and obligations. <\/p>\n<p>Discussion of licensing choices should go beyond personal\/institutional policies.<\/p>\n<p>Comment: PhD student writing thesis and reusing figures from publications. For anything published by IEEE legally had to ask for permission to reuse figures he&#8217;d created himself. Not just about datasets but anything you put out.<\/p>\n<p>Comment: &#8220;Best way to hide data is to publish a PhD thesis&#8221;.<\/p>\n<p>Q: Have you started implementing?<br \/>A: Yes but still early on coding as RDF structure and asking simple questions. Want to dig deeper.<\/p>\n<p>Q: Get in trouble with practicing law &#8211; always told by institution to send questions to IP lawyers etc. Has anyone got mad at you yet?<br \/>A: I do want to talk to a lawyer at some point. Can get complex fast especially pulling in cross-jurisdiction.<br \/>Comment: This will save time (=$$$) when talking to lawyer.<br \/>A: There&#8217;s a lot of situations where you don&#8217;t need a lawyer &#8211; that&#8217;s more for fringe cases.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Connecting Genetics Researchers to NeSI James Boocock &#038; David Eyers, University of OtagoPhil Wilcox, Tony Merriman &#038; Mik Black, Virtual Institute of Statistical Genetics (VISG) &#038; University of OtagoTheme of conference &#8220;eResearch as an enabler&#8221; &#8211; show researchers that eresearch can benefit them and enabling them.There&#8217;s been a genomic data explosion &#8211; genomic, microarray, sequencing [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[],"tags":[8,10,9,6],"_links":{"self":[{"href":"https:\/\/deborahfitchett.com\/blog\/wp-json\/wp\/v2\/posts\/23"}],"collection":[{"href":"https:\/\/deborahfitchett.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/deborahfitchett.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/deborahfitchett.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/deborahfitchett.com\/blog\/wp-json\/wp\/v2\/comments?post=23"}],"version-history":[{"count":0,"href":"https:\/\/deborahfitchett.com\/blog\/wp-json\/wp\/v2\/posts\/23\/revisions"}],"wp:attachment":[{"href":"https:\/\/deborahfitchett.com\/blog\/wp-json\/wp\/v2\/media?parent=23"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/deborahfitchett.com\/blog\/wp-json\/wp\/v2\/categories?post=23"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/deborahfitchett.com\/blog\/wp-json\/wp\/v2\/tags?post=23"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}