刚才在知乎什么看到了一篇分享pacbio的数据特征,顺便看到了Minnesota大学的关于生物信息的教程的ppt合集,所以就想打包下载。
https://www.msi.umn.edu/tutorial-materials
这个网页里面有64篇pdf格式的ppt,还有几个压缩包,本来是准备写爬虫来爬去的,但是后来想了想有点麻烦,而且还不一定会看,反正也是玩玩
就用linux的命令行简单实现了这个爬虫功能。
curl https://www.msi.umn.edu/tutorial-materials >tmp.txt
perl -alne '{/(https.*?pdf)/;print $1 if $1}' tmp.txt >pdf.address
perl -alne '{/(https.*?txt)/;print $1 if $1}' tmp.txt
perl -alne '{/(https.*?zip)/;print $1 if $1}' tmp.txt >zip.address
wget -i pdf.address
wget -i pdf.zip
这样就可以啦!
教程ppt列表如下,大家有兴趣的可以自行下载浏览。
2009-04-22-mrm-presentation_0.pdf Matlab_viz_image_UMR.pdf
Analyzing ChIP at the command line.pdf MaxQuant_Introduction_112409.pdf
Analyzing ChIP using Galaxy.pdf Maxquant-step-by-step_rs091124.pdf
Badalamenti_PacBio_tutorial_12-10-2014.pdf MSI Applications Catalog Oct 21 MB slides.pdf
basics_chip_seq.pdf MSIIntro2013Jun18.pdf
Best_Practices_GATK_Variant_Detection_v1_0.pdf MSIIntroBMEN5311.pdf
blast2go.pdf MSI_Workshop_for_Introduction_to_Structure_based_Drug_Design.pdf
ClinProTools_0.pdf MTLB_GPUs.pdf
CUDA_Programming.pdf OpenMP.tutorial_1.pdf
cuda_tutorial_performance.pdf Open_Source_Proteomics_1.pdf
FLUENT_2009April21_final.pdf OptimizingWithGA.pdf
FLUENT_tutorial_2008aug14fin.pdf Orbi_Data_Analysis_092811.pdf
galaxy_101_V4_ljm_0.pdf Partek Training Handout_miRNA and mRNA Data Analysis.pdf
GPU_tools.pdf PerformanceTuning_itasca_11_27_12_0.pdf
gpututorial-msi.pdf PETSc_Tutorial.pdf
Hands_On_Tutorial_Using_ProTIP.pdf Phi_Intro.pdf
Introduction to MSI Systems.pdf Protein_Grouping_FDR_Analysis_and_Database_Pratik_March2012_Draft.pdf
Introduction_to_PEAKS_0.pdf Proteomics_MSI_072309_Print.pdf
Introduction_to_SBDD.pdf pymol_v5.pdf
IntroMPI2011july19c.pdf QC_illumina_galaxy_V1_ljm.pdf
IntroMPI2012_July25-part1.pdf Quality Control of Illumina Data at the Command Line.pdf
IntroMSI2014.pdf remotevisualization.pdf
IntroNWChem.pdf RISS_Hsapiens_variant_Detection_v3.0-small.pdf
IntroOpenMP_2011jun28b.pdf RNA_seq_Lecture2_2014_v2.pdf
Intro_to_GAMESS.pdf RNA-Seq mod1v6.pdf
IntroToGaussian09.pdf R_Spring2012_ver2.pdf
introtomolpro.pdf SchrodingerTutorial2011.pdf
Intro_to_MSI_Physicists.pdf Sybyl.pdf
intro-to-perl.pdf Tutorial-Hsap-v15.pdf
Matlab_11_29_UMR.pdf Tutorial-Stuber-v12-1.pdf
Matlab_PCT.pdf unix2013.6.18.pdf
MATLAB_Tuning.pdf WRKSP_2_19.pdf
Total wall clock time: 40m 22s
Downloaded: 64 files, 249M in 40m 2s (106 KB/s)
我都已经下载好了,打包压缩到群里面啦!