Brainingstorming Notes
Rob's group
| | Include some free text about this service / workflow |
| | Certified workflows |
| | big registry across webservices world wide |
| | tag workflows |
| | search over tags |
| Paolo | News, changes, new workflows, new services (advertising / finding) |
| | Portal interface should be customizable, user accounts, roles |
| Raul | Asynchronous services |
| June | Must include some prestige to encourage participation - featured workflows / projects |
| Katy | show other people your workflows, sharing, finding workflows related to your subject |
| | Ontologies of service description |
| Paolo | Examples of usages of webservices / workflows |
| Simon | Access to high performance data analysis, canned routines |
| Raul | should be able to invoke in parallel (asynchronous) |
| Wang | determine whether services are reliable or unavailable - doing it before you run the workflows or continually check services |
| | find publications related to workflows |
| Simon | Meta ????, library of analysis |
| June | Customisable UI, project logo's, colour schemes |
| Katy | Find data sets for use in your workflow |
| Katy | Find examples of workflow patterns |
| | Certified workflows |
It was noted by the group: Workflows differ from services in that they can be changed, but the distinction is blurred
| Raul | Persitence in results (Provenance,) make intermediate results available |
It was pointed out by Alan: Not all workflows and services are determinable
| Alan | Fellow researches in domain |
| Simon | Interactivity, computational steering |
| Raul | e-portal and taverna - must be able to support different kinds of workflow, e.g. BPEL |
| Stuart | Bring people together with over lapping expertise (fellow researchers in domain) |
| Antoon | Service harbour(?), services that are easy to use with example of journal papers showing them alive and kicking |
| | Monitor quality of service, workflows, uptime |
| Raul | Workflow repository, download, execution |
| Raul | Minimum requirements for e-portal: create, upload, store and execute |
| | Bring together people with similar expertise |
| | Get example data for testing my workflow |
| Stuart | Find people that fill gaps in my or my groups expertise |
| | Compare results with other people |
| | Preserve my interlectual property |
| Anika | Edit workflows once uploaded |
| Paolo | Reference site for workflow thats included in a publication |
| | Grouping of people, e.g. my lab, that might have different rights to see my stuff |
| Anika | Showing of intermediate results |
| Anika | Easy saving of results |
Overall consensus is that portal should be able to execute a workflow
| | Ranking services / workflows |
| | Clustering services supporting the same function |
| | Support for end users, faqs, documentation |
| Katy | Calendar, links to forthcoming events |
| Katy | Control over what users can see - grouping |
| Katy | Gold standard for workflows and a way of measuring |
| June | Must allow commenting |
| | New projects, new services |
| June | RSS feeds and rating mechanism - prestige |
| Raul | Call by reference (meaning passing of data as a reference token rather than the data contents) |
| June | Self tagging rather than controlled vocabulary |
| June | make a results pool, repository for data (what people have done with data) |
| | Stamped and authorised, package data workflows and provenance |
| | Ontologies of domains, who develops, who uses |
| Anika | problem: Taverna engine consumes a lot of memory |
| Anika | ping webservices before enacting |
| | Asynchonous web services |
| | Show author of webservices |
| | edit, upload workflows |
| | Funding calls and funding projects |
| | Finding data sets for use within my workflow |
| Katy | Act as a supplementary workflow share for publications - standard way to share workflows and data |
Group vote for Instant Messaging resulted in 0 votes
Workflows are not built collaboratively
Alex' group
Consider when to use portal vs. application. Run locally when needed
(like visualisation), run on grid/server when possible.
Error messages -> solutions. Give help, not stacktraces.
Rating, trust network. "I'm a five-starrer on myExperiment" should be a
good thing to mention in a job interview. Would mean changing current
science culture! "Where's your papers?". Rating on "How many are
collaborating with you?" or vice versa. Open peer
reviewing, bypass journals. Review the reviewers. Reward good citation
practice.
Economy system, like Slashdot, earn "points" by contributing.
Google everything. This doesn't mean that we should depend on Google,
but that there should be an easy text search that searches everything in
the portal. Avoid massive categories and ontologies, just use tags and
search.
Annotate and link anything in the portal, including links.
The me-Science. Scientists are selfish, they don't naturally want to
collaborate. Two scientists at the same institutions might never talk
if not for some 3rd party bringing them together.
Negative result space, this is what I couldn't get to work.
Domain spaces, I'm an expert in biology/astronomy/X, just show me
relevant stuff.
Tags, tag clustering, text mining. Use auto-complete to help people
choose "right" tags, could include (but not require) ontology terms. New
and popular tags suggests changes to ontology.
myGene, club for people interested in a particular gene. There are
scientists that are so focused that when a new service comes up, they
immediately check if say their favourite gene is there. If not, they'll
leave and never come back.
Running Taverna in web browser (AJAX), build workflows online.
Guidelines and tutorials, demo movies like Ruby-on-Rails. Best
practices. Gold standard workflows.
Workflow logs, lab logs (stop calling it provenance)
There's always issues with security in todays technology, "No, you can't
do that, you need a certificate X and permission Y. What about just do
it non-secure and say that it is non-secure? Non-secure space,
non-secure grid, etc, should just work.
Easy grid submission, "Add your local grid".
Should run in "every" browser, like IE 5. No Word documents, no PDF,
open standards.
No manuals should be needed. "Simple, clean interface".
Software development through mash-ups, "empty" workflows that can later
be realized with real services.
Offline usage. What happens when my laptop can't reach myExperiment?
Automatic citation generation, the "Generate Nature-paper" button would find
all the references for workflows and data you have used.
Publishing a paper with corresponding published workflow and data on
myExperiment, data/wf-sharing could be a requirement set out by funders.
Fear: "People will laugh of me." Choose carefully what to make public,
hide the crap, "edit" provenance before publishing (remove bad runs).
After sharing with "friend", remove such access as the friend turned out
to not be your friend anymore.
Rumours and gossip space.
Find scientists in your field or interest. For instance,
PhD? students
from different universities doing the same thing could connect up.
Finding project partners. Finding funding. The real boom of the internet
was people connecting to each other. Job search, CV/Job space. Virtual
professor, both for consulting and.. eh.. funding.
Knowing the concequences in advance, avoid submitting a workflow that
will hammer down all services.
Data management and visualisation.
Different myExperiment installations in different labs should be
cross-linkable. People will probably not submit all their data to some
3rd party site, but if they can install and run it locally they would
start straight away. In fields like bioinformatics and biology, the
workflows are not that important and easily sharable, while the data are
precious and best kept in a safe.
Ship data to curator. Curator sympathy, they also need help!
Pimp my workflow! Help me improve this workflow.
What about signal/noise ratio? Computer scientists are well known to
produce massive amounts of text. Need policing, content vs. comments vs.
rating. Some comments should be better than none.
How to get people to buy in? Need incentives. "I publish my workflow.
Why? I think it's nice to see it reused". Citing.
Science is about re-use. Why aren't we sharing?