Identification of new members of hydrophobin family using primary structure analysis
Background: Hydrophobins are fungal proteins that can turn into amphipathic membranes at hydrophilic/hydrophobic interfaces by self-assembly. The assemblages by Class I hydrophobins are extremely stable and possess the remarkable ability to change the polarity of the surface. One of its most important industrial applications is its usage as paint. Without detailed knowledge of the 3D structure and self-assembly principles of hydrophobins, it is difficult to make significant progress in furthering its research. Results: In order to provide useful information to hydrophobin researchers, we analyzed primary structure of hydrophobins to gain more insight about these proteins. In this paper, we presented an in-depth primary sequence analysis using batch BLAST search of the database, sequence filtering by programming and motif finding by MEME. We used batch BLAST to find similar sequences in the NCBI nr database. Then we used MEME to find out motifs. Based on the newly found motifs and the well-known C-CC-C-C-CC-C pattern we used MAST to search the entire nr database. At the end, domain search and phylogenetic analysis were conducted to confirm the result. After searching the nr database with the new PSSM-format motifs identified by MEME, many sequences from various species were found by MAST. Filtering process by pattern, domain and length left 9 qualified candidates. Conclusion: All of 9 newly identified potential hydrophobins possess the common pattern and hydrophobin domain. From the multiple sequence alignment result, we can see that some of them are grouped very close to other known hydrophobins, which means their phylogenetic relationship is very close and it is highly plausible that they are indeed hydrophobin proteins.