Science topics: Computer Science and EngineeringBioinformatics
Science topic
Bioinformatics - Science topic
Explore the latest questions and answers in Bioinformatics, and find Bioinformatics experts.
Questions related to Bioinformatics
I am wondering what is your tool of choice for making a powerful statistical analyses and beautiful publication-ready plots in case of genomic intervals overlays, usually starting with a two or more .bed files. In last few days I was playing with BedSect (https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2020.00003/full), but would like to try also some alternatives :)
Thank you in advance, for your tips and tricks!
Constant touchstones following the protocol introduces a new dimension of bioethics to the bioinformatics about big ideas, modes of inquiry, intellectual traits and habits of mins but it is relativity of shared economy that transcends attention span to specific evidence of integration.
Hello,
In the literature, there are some MS/MS results that include hypothetical proteins, which can be shorter than 40 amino acids. I can also find these when I search for an organism in the protein section of NCBI. My question is, would it be absurd if I synthetically synthesize these peptides called hypothetical proteins and test them as drug candidates in certain disease models? Or are studies like the one I mentioned feasible and being conducted? If so, what procedure should I follow? For example, when I find a hypothetical protein, should I first perform a blast and then synthesize and use it if it meets certain conditions?
Is there any chance you could share some references with me that have been done in this manner?
I hope I have been able to convey what I want to ask.
Thank you for your answers.
Example link: https://www.ncbi.nlm.nih.gov/protein?term=txid562%5Borganism%3Aexp%5D+AND+((%2210%22%5BSLEN%5D+%3A+%2220%22%5BSLEN%5D)&cmd=DetailsSearch
Fatal error:
Atom HD1 in residue HIS 822 was not found in rtp entry HSE with 17 atoms while sorting atoms.
For a hydrogen, this can be a different protonation state, or it
might have had a different number in the PDB file and was rebuilt
(it might for instance have been H3, and we only expected H1 & H2).
Note that hydrogens might have been added to the entry for the N-terminus.
Remove this hydrogen or choose a different protonation state to solve it.
Option -ignh will ignore all hydrogens in the input.
I also Followed the suggestion to using -ignh in the code but it give me this error:
Fatal error:
Atom OXT in residue VAL 961 was not found in rtp entry VAL with 16 atoms
while sorting atoms.
Would anyone please help me? I can't concentrate on my studies unless I solve them.
Can anyone assist with running the Linkage Analysis Tool 'Easylinkage' or any alternative tool for conducting linkage analysis and calculating LOD scores?
Hi, I'm new to bioinformatics. I have a weighted UniFrac metric generated from some bacterial samples and I want to test the effect of two factors A and B (both binary) on the composition using adonis2 in R. When I only include factor A, the result is not significant. But if I include the interaction of A and B, the interaction of A*B is not significant, but the individual effect of A and B are both significant. I'm wondering why the effect of A becomes significant, cause from the PCoA plot, the four groups (i.e., A1*B1, A1*B2, A2*B1, A2*B2) somehow overlapped.
Hi all,
I have just started to learn about bioinformatics and I need help with it.
I have enriched some microbes from wastewater anaerobic sludge and sent them for 16S rRNA sequencing.
Based on the QC result I got after running trimmomatic, I am still not able to get a good quality sequence. The following is the code I ran for trimmomatic. Can you all help me with this?
trimmomatic PE -threads 2 -phred33 \
Raw160823_1.fastq.gz Raw160823_2.fastq.gz \
Raw160823_1P.fastq.gz Raw160823_1F.fastq.gz Raw160823_2P.fastq.gz Raw160823_2F.fastq.gz \
HEADCROP:10 SLIDINGWINDOW:4:30 MINLEN:50
Thank you very much!
Regards,
Kai
2024 IEEE 7th International Conference on Computer Information Science and Application Technology (CISAT 2024) will be held on July 12-14, 2024 in Hangzhou, China.
---Call For Papers---
The topics of interest for submission include, but are not limited to:
◕ Computational Science and Algorithms
· Algorithms
· Automated Software Engineering
· Bioinformatics and Scientific Computing
......
◕ Intelligent Computing and Artificial Intelligence
· Basic Theory and Application of Artificial Intelligence
· Big Data Analysis and Processing
· Biometric Identification
......
◕ Software Process and Data Mining
· Software Engineering Practice
· Web Engineering
· Multimedia and Visual Software Engineering
......
◕ Intelligent Transportation
· Intelligent Transportation Systems
· Vehicular Networks
· Edge Computing
· Spatiotemporal Data
All papers, both invited and contributed, the accepted papers, will be published and submitted for inclusion into IEEE Xplore subject to meeting IEEE Xplore's scope and quality requirements, and also submitted to EI Compendex and Scopus for indexing. All conference proceedings paper can not be less than 4 pages.
Important Dates:
Full Paper Submission Date: April 14, 2024
Submission Date: May 12, 2024
Registration Deadline: June 14, 2024
Conference Dates: July 12-14, 2024
For More Details please visit:
Invitation code: AISCONF
*Using the invitation code on submission system/registration can get priority review and feedback
Please suggest bioinformatics journals which do not need wet lab experiments.
I have several pairs of parameters (obtained from females and males) and want to find the difference in correlation between the two sexes for each parameter, but also want to give a weight so that the parameter showing the highest correlation with survival in either females or males have a greater weight. This way, I hope to find factors that shows a combination of strong correlation differences between females and males (with regard to survival) - and most positively correlated with survival for either sex (which I will resolve further).
To do this, if I have a correlation of parameter 1 for males as A and for females as B: I plan to do (A-B) multiplied by A or B (the highest correlation) - to acknowledge the weight of highest positive correlation with survival. For the next parameter, the correlation is C for males and D for females, I will do (C-D) X C or D (whichever is highest) - with the final aim to rank the parameters most differing between females and males, as well as, most correlating with survival of either sex. Do you think it is a reasonable idea?
I would be very very grateful for your advice, suggestions and tips.
I want to find the UTR sequence of mRNA sequence of bacteria protein. Can anyone suggest a insilico process for that
I am new to Desmond simulations and I want to know how can I find the estimated time left for a simulation to be completed? my 2nd query is how to perform B-Factor analysis after performing simulation on Desmond? Any help in this regard will be highly appreciated.
Thanks
I learnt to add 1 sequence at once but couldn't find an option to upload a bunch at once, still a tyro at bioinformatics, any leads?
Here are some examples of software that can be used for each step of RNA-seq data analysis:
- Quality Control: FastQC, PRINSEQ, Sickle
- Read Trimming: Trimmomatic, Cutadapt, AdapterRemoval
- Alignment: STAR, HISAT2, TopHat
- Quality Control of Alignment: Qualimap, RSeQC, Picard
- Assembly: Trinity, Oases, Trans-ABySS
- Quantification: RSEM, Kallisto, eXpress
- Differential Expression Analysis: DESeq2, EdgeR, limma
- Functional Annotation: Blast2GO, KEGG, Reactome
- Pathway Analysis: KEGG Pathway, Reactome, Enrichr
- Network Analysis: Cytoscape, STRING, ClueGO
- Visualization: IGV, GenomeBrowse, JBrowse
- Interpretation: GSEA, DAVID, IPA
Dear Scientists and Researchers,
I'm thrilled to highlight a significant update from PeptiCloud: new no-code data analysis capabilities specifically designed for researchers. Now, at www.pepticloud.com, you can leverage these powerful tools to enhance your research without the need for coding expertise.
Key Features:
PeptiCloud's latest update lets you:
- Create Plots: Easily visualize your data for insightful analysis.
- Conduct Numerical Analysis: Analyze datasets with precision, no coding required.
- Utilize Advanced Models: Access regression models (linear, polynomial, logistic, lasso, ridge) and machine learning algorithms (KNN and SVM) through a straightforward interface.
The Impact:
This innovation aims to remove the technological hurdles of data analysis, enabling researchers to concentrate on their scientific discoveries. By minimizing the need for programming skills, PeptiCloud is paving the way for more accessible and efficient bioinformatics research.
Join the Conversation:
- How do you envision no-code data analysis transforming your research?
- Are there any other no-code features you would like to see on PeptiCloud?
- If you've used no-code platforms before, how have they impacted your research productivity?
PeptiCloud is dedicated to empowering the bioinformatics community. Your insights and feedback are invaluable to us as we strive to enhance our platform. Visit us at www.pepticloud.com to explore these new features, and don't hesitate to reach out at [email protected] with your thoughts, suggestions, or questions.
Together, let's embark on a journey towards more accessible and impactful research.
Warm regards,
Chris Lee
Bioinformatics Advocate & PeptiCloud Founder
How are the AMRFinderPlus and CARD different from each other for predication of AMR genes from bacterial genomic sequences?
How much overlap do AMRFinderPlus and CARD database have?
My question relates to the implicit assumption that topologically associating domains(TAD) have to be contiguous along the genome. This seems odd to me given that the DNA molecule exists in a 3D space while this contiguity criteria relates only to the 1D genome coordinate, which might not be appropriate to delimit interactions in a 3D space.
Consequently I am wondering if I'm missing anything obvious to impose such a criteria to characterise TADs.
Thanking you in advance.
I am currently learning about PyMol to utilize in my project. I used PyMol to visualize potential H-bond interactions in specific amino acid residues. However, I have discovered that Arg465 and Ser461 show a distinct interaction, as shown.
Please help identify this interaction.
hello,
i am getting idle1.2.2 error in autodock1.5.6. to open my pdb 1j5e file, file is not even visualized on my screen. so i am getting error in first step of docking.
please give response
thank you.
2024 3rd International Conference on Biomedical and Intelligent Systems (IC-BIS 2024) will be held from April 26 to 28, 2024, in Nanchang, China.
It is a comprehensive conference which focuses on Biomedical Engineering and Artificial Intelligent Systems. The main objective of IC-BIS 2024 is to address and deliberate on the latest technical status and recent trends in the research and applications of Biomedical Engineering and Bioinformatics. IC-BIS 2024 provides an opportunity for the scientists, engineers, industrialists, scholars and other professionals from all over the world to interact and exchange their new ideas and research outcomes in related fields and develop possible chances for future collaboration. The conference also aims at motivating the next generation of researchers to promote their interests in Biomedical Engineering and Artificial Intelligent Systems.
Important Dates:
Registration Deadline: March 26, 2024
Final Paper Submission Date: April 22, 2024
Conference Dates: April 26-28, 2024
---Call For Papers---
The topics of interest for submission include, but are not limited to:
- Biomedical Signal Processing and Medical Information
· Biomedical signal processing
· Medical big data and machine learning
· Application of artificial intelligent for biomedical signal processing
......
- Bioinformatics & Intelligent Computing
· Algorithms and Software Tools
· Algorithms, models, software, and tools in Bioinformatics
· Biostatistics and Stochastic Models
......
- Gene regulation, expression, identification and network
·High-performance computational systems biology and parallel implementations
· Image Analysis
· Inference from high-throughput experimental data
......
For More Details please visit:
I am interested in analyzing the correlation between the expression of a set of genes and transposable elements (TEs) in cancer. However, despite there are multiple online databases for gene expression in cancer, including TCGA, they do not include repetitive elements. Despite I've found some papers analyzing transposable elements and quantifying their expression in different cancer using TCGA data, supplemental tables only provide the fold change end p-values for differentially expressed TEs. Also, to identify and quantify TEs, the raw sequencing data, which have controlled access, would be necessary. Therefore, I was wondering if there is some database or published resource where I could find information regarding TE expression per sample in TCGA database. Does someone know something like that? Alternatively, if someone have analyzed this type of data and have some worksheet with pre-processed data that could be shared, I would be deeply grateful.
Dear ResearchGate network,
Recently I received an invitation to act as Associate Section Editor for Bentham Science for Current Bioinformatics for a period of 2 years, depending on de performance. Due I don’t have sufficient experience as editor, So, they requested me to given a name of a senior researcher with a h-index of 15 at least and knowledge in Bioinformatics to act together with me as a Section Editor (coeditor). As role of this coeditor is to propose at least one issue per year of a relevant theme for a special edition In Bioinformatics. Anyone here have knowledge from anyone who fits to this profile and could indicatr he/she for me, please?
I thank you in advance.
Pedro Paulo Gattai Gomes, PhD
What to do if ChimeraX software doesn't recognise the .chimerax file downloaded from SwissDock after docking?
Besides, the zip file of prediction done was empty.
Thank you.
Explore the synergistic impact of machine learning on improving the precision of predicting protein structures in bioinformatics. Seeking insights into the specific methodologies and advancements that contribute to enhanced accuracy.
In the rapidly evolving landscape of the Internet of Things (IoT), the integration of blockchain, machine learning, and natural language processing (NLP) holds promise for strengthening cybersecurity measures. This question explores the potential synergies among these technologies in detecting anomalies, ensuring data integrity, and fortifying the security of interconnected devices.
I've tried trimAl and Gblocks, but I'm unable to access the programs through the provided links. Thank you.
:)
This question blends various emerging technologies to spark discussion. It asks if sophisticated image recognition AI, trained on leaked bioinformatics data (e.g., genetic profiles), could identify vulnerabilities in medical devices connected to the Internet of Things (IoT). These vulnerabilities could then be exploited through "quantum-resistant backdoors" – hidden flaws that remain secure even against potential future advances in quantum computing. This scenario raises concerns for cybersecurity, ethical hacking practices, and the responsible development of both AI and medical technology.
Dear all, it is with great pleasure that I make public my latest exploration of openAI APIs. On this prototype, I have tested a medical chatbot.
Hope you enjoy the reading!
#bioinformatics #healthinformatics #medicine #chatbots #largelanguagemodels #openai #computervision #deeplearning #medicalimaging
You can leave a public review on
Recently, I installed Modeller 10.4 software into my windows 10, 10GB RAM, 64x bit laptop to predict a 3D structure of a membrane protein (a.a length 574).
In this case , i used advanced modeller option to prediction. Because we can use multiple templates for structure prediction. But from the start I got errors when running the python script.
1)May I know what is the maximum number of templates,which can be used for advanced modeling.
What are good resources for an undergraduate student to start getting familiar with bioinformatics and, if possible, get some practical experience? Any favorite websites, blogs, videos, etc?
Thanks!
Hello, I've recently started exploring molecular docking applications, and I'm still in the early stages. I'd like to ask Can I choose a ligand by giving the amino acid sequence and then do docking? Which applications would you suggest?
Thank you
Hello, I've recently started exploring molecular docking applications, and I'm still in the early stages.I'd like to ask which proteins should be considered when examining the antimicrobial effects of certain molecules.
Is there a list of these proteins(that I should use as a docking protein), or are there general rules for proteins that should definitely be examined?
Also, can I perform docking not with a molecule but directly with an organism? If so, what should I look for to predict antimicrobial effects?
Could you please guide me on this?
Thank you.
Hello. We understand that a volcano plot is a graphical representation of differential values (proteins or genes), and it requires two parameters: fold change and p-value. However, for IP-MS (immunoprecipitation-mass spectrometry) data, there are many proteins identified in the IP (immunoprecipitation group) with their intensity, but these proteins are not detected in the IgG (control group)(the data is blank). This means that we cannot calculate the p-value and fold change for these "present(IP) --- absent(IgG)" proteins, and therefore, we cannot plot them on a volcano plot. However, in many articles, we see that these proteins are successfully plotted on a volcano plot. How did they accomplish this? Are there any data fitting methods available to assist in drawing? need imputation? but is it reflect the real interaction degree?
Hi, I am working on protein-protein interaction studies, specifically on antibody-antigen interaction. I would like to observe the changes in interaction if there's mutation occurs in the protein. Could anyone suggest a tool that can be used to induce substitution mutation to a targeted amino acid of a 3D protein and tools to validate that the mutation is not a nonsense mutation that produces truncated protein?
I want to find the UTR sequence of mRNA sequence of bacteria protein. Can anyone suggest a insilico process for that
As of now, there is no public database available for this kind of sample to take as a control.
I'm on the lookout for remote bioinformatics and computational biology opportunities where I can actively contribute to research projects. Compensation is not a priority for me; my main focus is to gain hands-on experience in these fields.
#biopython
#computational_biology
#bioinformatics
#biology
#R
Hi,
I am beginner in "Bioinformatics" and want to learn " how to analyse bacterial and fungal genomic data?". Would you suggest me some materials and sources so that I can devleop myself?
Note: My interest is now on " Bacterial and fungal genome and proteome analysis by using bioinformatics"
Hello,
I am trying to construct phylogenetic tree of HIV-1. I downloaded sequences from few neighbor countries from Los Alamos HIV database. After aligning and trimming the length of sequences is usually 722 nucleotides. I can't trim less, because there are a lot of gaps within alignment file. When I construct Maximum Liklehood tree in FastTree or PhyML, the branches look very short. What could be a possible reason for it?
If 722 nucleotides length sequences can be used for constructing reliable phylogenetic tree?
Thank you!
Hi,
I am beginner in "Bioinformatics" and want to learn " how to analyse bacterial and fungal genomic data?". Would you suggest me some materials and sources so that I can devleop myself?
Note: My interest is now on " Bacterial and fungal genome and proteome analysis by using bioinformatics"
Hello fellow researchers,
I wanted to start a discussion on the exciting topic of the future of bioinformatics and its evolution. Bioinformatics has come a long way in recent years, but there are undoubtedly new frontiers to explore and challenges to overcome. What are your thoughts on the current trends, emerging technologies, and the potential impact of bioinformatics in the years to come? I'm eager to hear your insights and predictions on the future of this rapidly evolving field.
I am reaching out to #researchers in the field of #Biochemistry, #Biophysics and #Bioinformatics, for collaborative partnership in scientific research. The researcher should be academic staff at the tertiary institutions in following listed countries:
#Afghanistan
#Angola
#Bangladesh
#Belarus
#Belize
#Benin
#Bhutan
#Burkina Faso
#Burma
#Burundi
#CaboVerde
#Cambodia
#Cameroon
#CentralAfricanRepublic
#Chad
#Comoros
#Congo
#CookIslands
#Cuba
#Democratic People's Republic of Korea
#Democratic Republic of the Congo
#Djibouti
#Dominica
#EquatorialGuinea
#Eritrea
#Eswatini
#Ethiopia
#Gambia
#Ghana
#Grenada
#Guinea
#Guinea-Bissau
#Guyana
#Haiti
#Iran
#IvoryCoast
#Kenya
#Kiribati
#Kyrgyzstan
#Lao People's Democratic Republic
#Lebanon
#Lesotho
#Liberia
#Madagascar
#Malawi
#Maldives
#Mali
#Marshall Islands
#Mauritania
#Micronesia (Federated States of)
#Mozambique
#Myanmar
#Nauru
#Nepal
#Nicaragua
#Niger
#Niue
#Palau
#PapuaNewGuinea
#Moldova (Republic of)
#Rwanda
#SaintHelena
#SaintLucia
#SaintVincent and the #Grenadines
#Samoa
#SaoTome and #Principe
#Senegal
#Sierra Leone
#SolomonIslands
#Somalia
#SouthSudan
#Sudan
#Suriname
#Syrian Arab Republic
#Tajikistan
#Timor-Leste
#Togo
#Tokelau
#Tonga
#Tuvalu
#Uganda
#Ukraine
#Tanzania (United Republic of)
#Vanuatu
#Yemen
#Zambia
#Zimbabwe
Interested researcher should kindly email to [email protected] with the subject: Research Collaboration from "your country".
Thanks.
Toluwase H. Fatoki
Visionary @ Heze-Sapience International, Nigeria.
Lecturer @ Department of Biochemistry, Federal University Oye-Ekiti, Nigeria.
Our lab have a bioinformatics project about developing a functional enrichment software. We have several ideas but we realize we need real feedback from wet lab researchers as well to make sure our functional enrichment web application will be reliable and useful for all of you.
Therefore, if you are a wet lab scientist who have experience using functional enrichment software (such as Metascape, DAVID, etc), what kind of questions do you want to address in the functional enrichment result? Are there any information that they are still unable to give to you?
I already know the pathway but want to know the upstream lncRNAs that regulates that pathway using the datasets and bioinformatics.
In their website they mentioned it's IF is 5.8. But in the JIF2022 report, I did not find. Is it because of its inclusion in the Emerging Source Citation Index? and because of not included in the "Science Citation Index Expanded" Please help.
From where can I get valid IF. One more thing, this journal is not included in BioxBio, have checked.
Could someone explain to me why the p-value in the right column of the forest plot is different than the p-value in the test for effect in the subgroup?
I thought that these two p.values should be the same.
Hello, I've recently been studying Ancestral Sequence Reconstruction (ASR), attempting to infer ancestral sequences of viruses. I understand that this inference is constrained by factors like sample size and models, and represents a plausible sequence that may have existed. However, I'm curious about whether directly comparing these inferred ancestral sequences holds biological significance. Can they reflect the differences among the extant sequences from various lineages that were used to infer them?
Dear All,
Ph.D. full-time position in Bangalore with fellowship:
Eligibility: M.Sc. Chemistry/Biochemistry/Biotechnology/Microbiology/Bioinformatics with first class of 60%.
GATE or UGC-NET or UGC-CSIR or SLET or JRF should be qualified.
RS 25,000 per month for full three years will be given.
For further details, contact me on: +919182864256. Call or what's app me for further details.
I am trying to analyse mutation data for endometrial cancer obtained from different studies within several databases (COSMIC, cBioportal, Intogen). I have collated the data and grouped the mutations by gene. The focus of the analysis are non-synonymous coding mutations - because these mutations are most likely to cause a change in the normal protein function.
The aim of the study is to understand the mutational landscape of Endometrial cancer. The main objectives of the study are to find the commonly mutated genes in endometrial cancer, to find significantly damaging gene mutations in endometrial cancer and to create an updated list of genes comparable to commercial gene panels.
I have created this table with the collated data:
- Gene name
- Number of samples with coding mutations
- Frequency ( number of samples with coding mutations / total number of samples with coding mutation)
- CDS length
- Total number of unique coding mutations
- Number of unique coding: synonymous mutations
- Number of unique coding: non-synonymous mutations
- Mutation burden (number of unique coding: non-synonymoys mutations / CDS length)
- Composite score [(frequency of samples * 0.7) + (mutation burden * 0.3)]
The idea here is to use mutation burden to imply damaging effects of the genes' mutations in endometrial cancer. We then created a composite score to use as a comparable figure between the genes.
At the moment, our list of genes is at 16,000+. We are currently trying to think of a way to narrow down the list of genes to only focus on those significantly mutated compared to the other genes by way of statistics. Any advice is greatly appreciated.
We had sent some phytoplankton samples for sequencing. And we had just received the generated sequences, and the next step was to do BLAST to identify what the phytoplankton that we sent is. Basically DNA Barcoding.
To give some context, when we send our samples for sequencing to the sequencing facility, they send us back two files, one for the forward sequence and another for the reverse sequence, based on the primers (forward and reverse) we gave.
So, the initial step involves us checking the quality of the sequences, specifically looking for any signs of low quality, ambiguity, or overlapping signals in the chromatograph.
Now, I'm a bit uncertain about the next steps.
The following step would be sequence trimming. To do this, I need to identify the start of each sequence by locating the primer sequence. This means finding the forward primer sequence in the generated forward sequence and doing the same for the reverse primer in the reverse sequence.
Afterward, I perform reverse complementation on the reverse sequence.
Following that, I conduct a pairwise alignment between the generated forward and reverse sequences and subsequently generate the consensus sequence.
My questions are, as I am a bit stumped with this (I apologize in advance, I'm a bit new with bioinformatics), (1) what if neither of the generated sequences have the primer sequences? Would that mean the sequences generated were of bad/low quality? and (2) Is this approach correct, or have I missed a crucial step?
Thank you!
I have extensively searched google scholar but I am struggling to find any groups who have previously used Rosetta to conduct ab-initio structure modelling of single-pass or membrane anchored proteins and I'm specifically not talking about homology modelling just ab-initio.
Please let me know if you have read any papers or know anyone who has done this,
thanks.
2nd year PhD student at University of Liverpool.
Dear all,
I'm working on the finer details of my experimental design, and have some questions regarding bridging channels for TMT based experiments.
I have two conditions to test, across nine biological replicates, in order to run as one 18-plex TMT-pro experiment.
I am aware of the use of one or more bridging channels being used with pooled samples to combine multiple TMT mixtures, however a colleague has mentioned that a bridging channel should also be considered for normalisation if only one set is used.
Does anyone have any experience using a bridging channel for normalisation in a single mixture? Is it worth sacrificing one or more biological replicates for?
I will be using MSstatsTMT for normalisation and summarisation.
Sam
Hello there,
I'm searching for reliable bioinformatics/immunoinformatics tools for predicting the immunogenicity of B-Cell Epitopes. Your expertise is invaluable! Could you kindly recommend any devices that have proven effective in this area? Your insights will significantly contribute to advancing our understanding of immunogenicity prediction.
Thank you in advance for your suggestions!
Molecular dynamics simulation , bioinformatics , molecular docking
Are you familiar with Research4Life? It's a program that provides free or low-cost access to scientific research in low-income countries. Research4Life has two eligibility lists: Group A and Group B. Group A includes countries with the lowest gross domestic product, lowest human development index, and other factors that indicate lower-income countries. As an immunoinformatics, Bioinformatics and Molecular Modelling researcher, I'm calling on researchers from Research4Life's Group A countries to join me in collaborative research efforts. By working together and utilizing the program's valuable resources, we can advance our research and make a difference in the world. Best of all, with this collaboration, it will be completely free. #Research4Life #immunoinformatics #bioinformatics #molecularmodelling #collaboration
Hello everyone; I am new to R programming. I want to calculate the firmicutes to Bacteroides ratio from my OTU table. I couldn't find the command and don't know how to do it. Please guide me on this.
I put an example of my OTU table.
Hello,
I measured the distance between two centers of mass during a MD run using gmx distance. Even though the -oall file shows me that the distance changed over time the histogram file -oh puts 100% of probability on the last bin.
As this makes no sense does anyone have an idea on what happened?
Both files are attached
Thank you very much in advance and have a nice day!
I have been trying to dock a certain protein with nd ion i downloaded from rcsb but after i add it to pyrx and try to convert it to ligand i get the following error. I tried converting the sdf file to pdb using pymol, chimeraX, avogadro, open babel but even then when i open the file it gives me this error: ligand: :UNK0:Nd and ligand: :UNK0:Nd have the same coordinates. Could someone please help?
Update: I want to dock an unbound protein with the neodymium metal ion which i downloaded from rcsb in sdf format and later tried to convert it to pdb using the aforementioned softwares for autodock to accept it but i can't get it to be accepted by autodock as a proper ligand. Apparently I am unable to get any of the rare earth elements to be accepted properly as ligands.
I know many websites have simple tools like transcription and translation available, but are there any analysis tools that researchers need that either do not exist or are not publicly available? It could be anything from algorithms to visuals. Thanks!
Hello All,
I am very new to bioinformatics and biological data , please bare with my question.
I have differential expression data of three, Parental cellines(drug sensitive ) and 10 isoforms (made resistant to the drug) by these three parental cells.
Is the data enough to generate a coexpression network.?
I Have tried constructing it using GWENA , and was also successful but I am not confident about it because of two reasons one number of samples and second can isoforms be treated as samples or not.
I would really appreciate any suggestions and anr reading resource that can be helpful in this regard.
Thankyou
In recent years, number of vaccine have been approved to fight against Covid-19, list of approved is available at FDA site. We are looking for sequence of these vaccine (RNA sequence in case of mRNA vaccines and amino acid sequence in case of protein based vaccines. I will highly appreciate help of community in searching sequence of vaccines.
Greetings,
I have recently isolated a new E.coli phage and during the assessment of its host range, I discovered that this particular phage was effective against Pseudomonas aureginosa and staphylococcus aureus in wet lab experiments. However, upon examining the complete genome of the phage on NCBI, I noticed that it did not exhibit any similarities with known P. aureuginosa and S. aureus phages. Additionally, when I performed a blastp analysis on all the phage proteins in NCBI, I could not identify any homology with the aforementioned P. aureuginosa and S. aureus phages. Normally, I would expect to observe some degree of homology, especially in proteins responsible for recognition, such as tail proteins or lytic proteins.
My question is how I can determine the wide host range of the phage based on its genome. It appears that bioinformatic tools should provide information regarding the extent of the phage's host range. I would greatly appreciate your comments and recommendations on this matter.
Thank you.
Has any of you ever done research in the field of bioinformatics?
I want to annotate each gene in the Homo sapiens taxon with its respective GO terms and its hierarchical parent terms in the GO database. How can I systematically do that? While I am aware that the obo file contains information such as "is a," "part of," and "regulates," it lacks a comprehensive hierarchy from child GO terms to all their parent terms. Is there an existing method available to achieve this systematic annotation, or do I need to develop a custom script to extract this information from the obo file?
I have been experimenting with machine learning in JavaScript, please, let me know also your experience! 😎🤗😍
In attachment a preprint!
Dear ResearchGate Community,
I am currently engaged in single-cell analysis for my research project and would greatly appreciate your insights and experiences regarding the use of Seurat and ScanPy.
I have been exploring both Seurat and ScanPy as tools for analyzing single-cell RNA sequencing (scRNA-seq) data. However, I would like to gather more information about these packages directly from researchers who have bioinformatic hands-on experience with them.
Specifically, I would be grateful if you could share your thoughts on the following:
1. Which package (Seurat or ScanPy) have you used for scRNA-seq analysis, and what were your primary reasons for choosing it? Is it depending on familiarity with programming languages (R for Seurat and Python for Scanpy)?
2. What are the notable features, strengths, or advantages of the packages you have worked with?
3. Were there any challenges or limitations you encountered while using the packages, and how did you address them?
4. Have you encountered any specific use cases or applications where one platform outperformed the other?
5. Are there any particular resources, tutorials, or best practices you found helpful when working with Seurat or ScanPy?
Your firsthand experiences and insights would be immensely valuable in helping me make an informed decision about which package to choose and understanding potential considerations for my single-cell analysis workflows.
Thank you in advance for taking the time to share your expertise. I look forward to hearing from you and benefiting from your valuable insights.
Best regards,
Emil Lagumdzic
Institute of Immunology
Department of Pathobiology
University of Veterinary Medicine Vienna
Is the hierarchical structure observed in the Gene Ontology (GO) OBO-basic file limited to the 'is a' relationship, or do the relationships 'has part' and 'regulates' also exhibit a similar hierarchical nature and can be propagated to the root?
I am looking for data from mammals ideally, but I will take anything to be honest. I am getting to grips with bioinformatics and need a practice data set with which I can go through the steps of filtering and trimming and mapping to a reference genome etc..
If anyone also has any advice on tools used subsequently for analysis such as MethylKit that would be awesome.
Thank you
I prefer to join 2 drug molecules (cocktail) using bioinformatics approach. Are there any tools available for it? Any software available where one can submit the individual structure of the drug molecules and receive the merged drug molecules?
I have a protein sequence with two cysteine residues and I would like to predict if those cysteins will form disulfide bonds.
I am looking for user-friendly tools to do this, either online tools or some other kind of easy to use software, since I am not well-versed in bioinformatics.
Please provide useful insights and general experiments required for designing lab manual.
Hi there,
I'm comparing the arrangement of a gene complex across different species to try and find clues about its evolutionary history. In some cases genes appear to have jumped around and switched positions, but I do not know if this is the result of recombination, or due to the orientation in which the chromosome has been assembled?
I'm taking data from the NCBI genome browser using ref seq chromosome level assemblies in each case. Does anyone know if there a standard direction that homologous chromosomes have to be uploaded in?
I imagine this is perfectly possible to do if you consider the positions of conserved genes at each end of the chromosome, but I would rather not have to do this myself if I know that it has already been accounted for...
Thanks,
Jake
If I have a sequence (genome.fasta). And I want to check the gene located in 400nt -500nt.
What bash script (I have WSL in my windows) I should use or are there any conda packages ?
Thank you in advanced
Is there any server or tools (bioconda, java, etc.) to exclusively annotate membrane protein only (similar to dbCAN for polysaccharides) from a bacterial genome?
Thank you in advanced!
Hi - I'm currently working with two RNA-Seq studies; one has RNA extracted from whole blood, the other PBMCs. Eventually we want to combine these data and perform some cell-specific deconvolution to look at DEGs.
Are there any recommended methods for batch correcting these data from different sources?
Mari
I am interested in predicting the protein structure of my protein of interest. Using NCBI BLAST, I found an experimental structure that corresponds to a domain of my protein, showing 24% query coverage and 100% similarity. My question is whether I can confidently use this experimental structure as a template for homology modeling, or if I should explore alternative techniques such as threading, ab initio modeling, or any other suitable approach. I would also appreciate recommendations for relevant servers or software that can assist in this case.
Thank you for your insights and suggestions.
I'm looking for an online course of Bioinformatics with a delivered certificate?
Greetings!
I have an issue that drives me crazy this evening...
I have a list of gene vectors, downregulated in different transgenic plants and I want to make a Venn diagram to visualize it and to show the intersections between plants.
But! The results from any package I used (in R) gaves me something like this (the uploaded picture 1)...
What's bothering me:
1. The numbers on "clear" (not intersected) parts of a diagram are lower, than the gensets I have. And I tried to use factor instead of character vectors, to remove possible duplications, to remove symbols (like space) that could cause software misunderstanding - all gaves me nothing... same result.
2. The intersection of vectors is not true - on the picture you can see that the intersection of 2 datasets (of 365 and 154 genes) - is 1133 genes!! How could that be?
The manual usage of intersect function on the same dataset gaves pretty correct results.
Maybe I am misunderstanding about Venn diagrams? Because in a web I found many examples of such strange mistakes - on the second picture from Datanovia you can see that the intersection of the red elliplse (of 58) and yellow (of 144) is 66!
It seemes logical to me that the intersection of 2 vectors cannot be greater than the length of a smaller vector. What am I doing wrong or misunderstanding?
Hello everyone,
I am not good at R so I am trying to find solutions for my problems through the internet. I have been stuck on a problem. I couldn't find a way to compare the means of groups separated by facet function. Maybe I should not have put x axis as it is now but I wanna make sure. Here is the shorter version of my code for you to have a look at:
my_comparisons <- list( c("Hybrid","Single"))
ggplot(data = rpkms_new2, aes(x = strand, y = log2(RPKM), fill=strand, label = strand))+
geom_violin(scale = "count", alpha=0.5)+
facet_grid(~Trans, switch = "x", scales = "free_x", space = "free_x") +
theme(plot.title = element_text(hjust=0.5))+
theme(panel.spacing = unit(0, "lines"),
strip.background = element_blank(),
strip.placement = "outside") +
stat_compare_means(ref.group = "None", aes(label = ..p.signif..), method = "wilcox")+
stat_compare_means(comparisons = my_comparisons, aes(label = ..p.signif..), method = "wilcox")+
geom_text(data = mean_ranks, aes(x = strand, y = -Inf, label = round(rank, 0)), size = 3, vjust = -1)
How should I modify my code to be able to compare all the subgroups(single and hybrid) with the "None" group ?
My data looks like below:
STRAND TRANS VALUES:
sense hybrid 2
sense hybrid 2
sense single 3
sense single 7
antisense hybrid 10
antisense hybrid 12
antisense single 1
antisense single 2
none none 1
none none 4
I am currently an Indonesian high school student passionate about bioinformatics and its potential to drive impactful innovations in the fields of biology and medicine. I am eager to participate in the Regeneron International Science and Engineering Fair and showcase a research project that can make a significant contribution to the scientific community.
Considering the vast possibilities within the realm of bioinformatics, I would greatly appreciate any suggestions, ideas, or insights for a research project that aligns with the following criteria:
- Impactful Innovation: I am looking for a research topic that has the potential to make a significant impact in the biology or medical world. It could involve the development of new algorithms, computational tools, or methodologies that address critical challenges in these domains.
- Bioinformatics Focus: The research should predominantly involve bioinformatics techniques, such as data analysis, data mining, machine learning, genomics, proteomics, or other computational approaches. It should leverage the power of data and computational tools to gain insights into biological processes or contribute to medical advancements.
- Feasibility for a High School Student: As a high school student, I have certain limitations in terms of resources, time, and expertise. Therefore, I am seeking research ideas that are feasible for a high school-level project. While the topic should be challenging enough to meet the standards of the Regeneron ISEF, it should also be manageable within the scope of a high school research project.
Thank you in advance for your valuable suggestions and insights.
Hello everybody, I'm a master degree student. I'm working with 16S data on some environmental samples. After all the cleaning, denoising ecc... now I have an object that stores my sequences, their taxonomic classification, and a table of counts of ASV per sample linked to their taxonomic classification.
The question is, what should I do with the counts for assessing Diversity metrics? Should I transform them prior to the calculation of indexes, or i should transform them according to the index/distance i want to assess? Where can I find some resources linked to these problems and related other for study that out?
I know that these questions may be very simple ones, but I'm lost.
As far as I know there is no consensus on the statistical operation of transforming the data, but i cannot leave raw because of the compositionality of the datum.
Please help
I'm interested in studying specific missense mutations in a human gene. My goal is to determine whether the mutated region of the protein is conserved across various species. Could you please guide me on how I can use in silico tools to find homologous protein sequences and identify their conserved regions?
Thank you very much
Hi, I am a beginner in bioinformatics and I would like to identify CRISPRs in my MAGs fasta files. Can someone recommend an up-to-date good tool that can be easily installed through the Conda environment, please? Thank You in advance
Dear Researchers,
If anyone is interested in reviewing manuscript on multiepitope vaccine design. Please provide your following details:
Note: Reviewers from India, Pakistan, Egypt & Saudi Arabia are not eligible for this manuscript.
First Name:
Last Name:
Degree:
Position:
Institution:
Department:
Institutional E-mail id:
Can an MD simulation be performed by adding other salts by varying their concentration inside the box?
"The result shows absence of intragenomic variation among 16S rDNA gene and presence of variable regions among the 16S rDNA sequences (intergenomic variation), noticing for example high variability around 800, 900, and 1000 bp and a large conserved region between 1150 and 1350 bp. This information allowed us to discard the restriction enzymes FnuII, AsuI, FokI, Eco57I that recognized some restriction sites contained within variable regions, since they are more susceptible of acquiring future nucleotidic variations and with this, the potential generation of different band patterns." [1]
I add that the article mentioned that these discarded enzymes were targeting conserved sites in the study species.
[1]Mandakovic D, Glasner B, Maldonado J, Aravena P, González M, Cambiazo V, Pulgar R. Genomic-Based Restriction Enzyme Selection for Specific Detection of Piscirickettsia salmonis by 16S rDNA PCR-RFLP. Front Microbiol. 2016 May 9;7:643. doi: 10.3389/fmicb.2016.00643. PMID: 27242682; PMCID: PMC4860512.
Is my reading right that the article implies that there is such potential? If yes, what are the possible mechanisms?
More important, what's the time frame of this "future nucleotidic variation", is it an evolutionary time frame that could take thousands of years?
Edit: i think my question can be thought of as: How common are new 16s rRNA gene variants in bacterial species?
Dear Friends and connection
I believe in the power of community. So, I post this,
I am excited to explore the possibility of collaborating with someone who works on network pharmacology. As, network pharmacology is an interdisciplinary field that combines principles of network analysis, bioinformatics, and pharmacology to investigate drug-target interactions and predict the therapeutic effects of drugs.
I have some projects related to bioinformatics and I believe that our collaboration can result in significant progress in this exciting field.
I am looking forward to hearing from you and exploring our collaboration for network pharmacology.
Regards
Shopnil Akash
WhatsApp: +8801935567417
Email: [email protected]
I've recently been using the NCI's Cancer Genome Atlas to find datasets and perform basic clinical correlation analyses. I think it's a fantastic tool, even for people with a limited bioinformatics background, so it made me curious if there are similar resources for people who study non-cancer diseases.
I was wondering if people are aware of any other databases/repositories/webtools that serve a similar purpose for non-cancer diseases. If anyone has recommendations/suggestions, please comment/link them down below.
Thanks in advance for your input!
"Is there any in-silico methods for studying the effect of up-regulation and down-regulation of the same genes?"
If yes, please suggest me the name/article.....Thank you
What bioinformatics tools are available to help analyze and interpret large-scale molecular data generated from crop research?
We all know that nanobody development is time and money consuming, it nearly needs a grant. I'm wondering if there is any bioinformatics tool or a method to predict nanobody sequence against certain antigen using this antigen sequence as an input ? Something like you put in the antigen sequence and that tool could predict how the nanobody against this antigen could be, in term of sequence, structure, etc?
Hi, I would like to ask if anybody has positive experiences with single primer PCR ? Can you recommend me any proven protocol of this type of PCR ? Thank you for all recommendations. Bohuš
I am running an MD simulation on a protein-protein complex.
After seeing a similar question on research gate, I checked the amino acids rtp file in my force fields folder, and as expected from this error, the HD1 atom was not present in the HSE entry. The atom HD2 is however present in that entry. So I figured replacing the HD1 atoms in my PDB file with HD2 should solve the error.
And it did. For the time being.
To reaffirm, I made changes in Histidine's hydrogen atoms in the PDB file. When I went ahead with the energy minimization step, I got an error that said there's an Infinite Force on an atom. It turns out that the atom was "HD2" of some Histidine in the PDB file.
I saw online that the reason behind this error was due to atom overlap. Hence, just for seeing if that was the case for me, I changed the coordinates of that atom a little bit (this was just for checking, I can't do this for the actual work). When I ran the EM step again, I got the same error, but for a HD2 of a different Histidine molecule. So yes, overlapping of the atoms is the reason for this particular error. I cannot solve it by changing coordinates of all the HD2 atoms of the Histidines. So it all boils down to the main fatal error that I mentioned.
How do I approach this?
1. Changing the atom name (as in HD1 -> HD2 is not working due to the subsequent error)
2. I do not know if I should add the atom HD1 in the HSE entry in the rtp file (I tried this and got several warnings).
3. I cannot (or should I?) use -ignh because mine is not an NMR structure. I have modelled my proteins on Modeller and refined them online.
Any suggestions/solutions will help me a lot. Thank you in advance!
I've been trying to know more about bioinformatics pipelines for whole genome shotgun sequencing data to use for the samples of animal fecal microbes diversity and identify pathogenic microorganisms (both of DNA and RNA).
I have tried to separate a direct coculture of MSCs (mesenchymal stromal cells) and macrophages to do bulk RNA seq on macrophages, as I want to find out how MSCs change the genetic expression on macrophages. I have tried different methods to separate the coculture as much possible, but I can only manage to retrieve a cell population with 95% macrophages, and 5% MSCs still present.
Therefore, I want to know if anyone has experience with analyzing data when the population is not completely pure with one cell type and how do I handle such data?
Is it wise to proceed with bulk RNA seq when 5% of my cells are still MSCs, well aware that the expressed genes observed could come from the 5% MSCs?
Risk of bias assessment (sometimes called "quality assessment" or "critical appraisal") helps to establish transparency of evidence synthesis results and findings. and it is mandatory to have it in your systematic review!
if you know any tools or used ones, can you please share it/them with me?
or if you have extra information regarding the risk of basis assessments, can you share it with me?