Hi
user
Admin Login:
Username:
Password:
Name:
FLOSSing in the lab – What Plant and Food Research does with FLOSS
--client
lca
--show
lca_2015
--room OGGB_260098 9442 --force
Next: 12 Connecting Containers: Building a PaaS with Docker and Kubernetes
show more...
Marks
Author(s):
Zane Gilmore
Location
OGGB 260-098
Date
jan Thu 15
Days Raw Files
Start
15:40
First Raw Start
15:01
Duration
0:45:00
Offset
0:38:24
End
16:25
Last Raw End
16:26
Chapters
00:00
Total cuts_time
45 min.
http://lca2015.linux.org.au/schedule/30122/view_talk
raw-playlist
raw-mp4-playlist
encoded-files-playlist
host
public
tweet
mp4
svg
png
assets
release.pdf
FLOSSing_in_the_lab_What_Plant_and_Food_Research_does_with_FLOSS.json
logs
Admin:
episode
episode list
cut list
raw files day
marks day
marks day
image_files
State:
---------
borked
edit
encode
push to queue
post
richard
review 1
email
review 2
make public
tweet
to-miror
conf
done
Locked:
clear this to unlock
Locked by:
user/process that locked.
Start:
initially scheduled time from master, adjusted to match reality
Duration:
length in hh:mm:ss
Name:
Video Title (shows in video search results)
Emails:
email(s) of the presenter(s)
Released:
Unknown
Yes
No
has someone authorised pubication
Normalise:
Channelcopy:
m=mono, 01=copy left to right, 10=right to left, 00=ignore.
Thumbnail:
filename.png
Description:
Roy Storey, Ben Warren, Eric Burgueno, Zane Gilmore, and Matthew Laurenson Plant & Food Research (PFR; www.plantandfood.co.nz) is a New Zealand government-owned research institute that directs significant effort to breeding and delivering new cultivars to industry. Breeding is a numbers game. Each year PFR produces hundreds of thousands of new plant genotypes, the bulk of which are deliberately discarded before significant resource has been invested in growing them. Some plant characteristics cannot be observed until plants have reached reproductive maturity, which can take several years. Analysing a seedling’s genetic sequence data with genomics tools provides an opportunity to identify early on whether it carries useful genes, such as those associated with disease resistance or fruit colour. The compressed genetic sequence of a single plant can occupy hundreds of gigabytes. Generating such data is becoming cheaper every year - scientists respond by using sequencing technology more often, which increases reliance on robust data processing and management methods. Tracking sample processing requires careful identification and management of individual plants, their associated genotypes, any samples collected from those plants, and the locations of both plants and samples. PFR has developed several software assemblages to address this challenge. "Kea", a sample handling system developed using the Python-based Django framework and an underlying postgreSQL database, helps users to manage laboratory analysis of plant samples. Kea tracks hundreds of thousands of samples through laboratory processing. It uses Elastic Search for scalable indexing to provide rapid response times for data filtering and reporting. "powerPlant" provides scientists with the tools carry out their own bioinformatics (the science of managing and analyzing genetic data) using open source analysis tools such as Galaxy, Ensembl, and R-Studio. powerPlant employs the open source job scheduler OpenLava to allocate processing tasks from front end web servers to back end processing nodes and databases. Open data directives like the New Zealand Government Open Access and Licensing framework (NZGOAL), Reproducible Research, and the desire to collaborate with other organisations led PFR to establish scinet.org.nz to share data sets. Each data set has a MediaWiki instance for commentary and content management, coupled with a data browsing application such as GBrowse, for manipulating and displaying annotations on genomes. PFR now has a single 300 TB file system capable of storing large datasets, but we currently lack smart ways of processing them. We use a range of Linux-based servers with RAM ranging from 84 GB to 4 TB with 8 to 160 hyper-threaded CPU cores. Typical analyses can fully utilize even the largest of these servers. Much of current Information Technology industry activity is about "dividing the hardware" or even the OS itself into tinier chunks. In contrast, we need to identify how to divide a single process and distribute it across multiple servers to maximize resource use. Standards like Open MPI are only partially adopted and numerous bioinformatics applications don't utilize processors across multiple servers. In the future we plan to explore using systems like iRods to automate file and metadata handling, and perhaps enable asynchronous integrations with national processing facilities like NeSI.
markdown
Comment:
production notes
2015-01-15/15_01_36.dv
Apply:
15:01:36 - 15:41:11 ( 00:39:35 )
S:
15:01:36 -
E:
15:41:11
D:
00:39:35
show more...
vlc ~/Videos/veyepar/lca/lca_2015/dv/OGGB_260098/2015-01-15/15_01_36.dv :start-time=00.0 --audio-desync=0
Raw File
Cut List
15:01:36
seconds: 0.0
Wall: 15:01:36
Duration
00:39:35
15:41:11
seconds: 0.0
Wall: 15:01:36
Comments:
mp4
mp4.m3u
dv.m3u
Split:
Sequence:
:
delete
2015-01-15/15_41_11.dv
Apply:
15:41:11 - 16:26:55 ( 00:45:44 )
S:
15:41:11 -
E:
16:26:55
D:
00:45:44
show more...
vlc ~/Videos/veyepar/lca/lca_2015/dv/OGGB_260098/2015-01-15/15_41_11.dv :start-time=00.0 --audio-desync=0
Raw File
Cut List
15:41:11
seconds: 0.0
Wall: 15:41:11
Duration
00:45:44
16:26:55
seconds: 0.0
Wall: 15:41:11
Comments:
mp4
mp4.m3u
dv.m3u
Split:
Sequence:
:
delete
Rf filename:
root is .../show/dv/location/, example: 2013-03-13/13:13:30.dv
Sequence:
get this:
check and save to add this
2015-01-15/15_01_36.dv
2015-01-15/15_41_11.dv
Veyepar
Video Eyeball Processor and Review