; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G015460 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G015460
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationCmo_Chr06:10890750..10894181
RNA-Seq ExpressionCmoCh06G015460
SyntenyCmoCh06G015460
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597557.1 hypothetical protein SDJN03_10737, partial [Cucurbita argyrosperma subsp. sororia]8.1e-23598.13Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
        MSVSGGVSIARIRGENRFYHPPAMRRRL     QQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST

Query:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSDA
        NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL RRRGADSDA
Subjt:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSDA

Query:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKETTSDGSSNCGM KKTSTALQDEWIQDSSVTGSRRA QMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC
Subjt:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW

Query:  QDADNWLRSLNVNHPDYRFFASHTSSGR
        QDADNWLRSLNVNHPDYRFFASHTSSGR
Subjt:  QDADNWLRSLNVNHPDYRFFASHTSSGR

KAG7029002.1 hypothetical protein SDJN02_10185, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-23698.39Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL-----QQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRG
        MSVSGGVSIARIRGENRFYHPPAMRRRL     QQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRG
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL-----QQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRG

Query:  VADSTNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRG
        VADSTNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL RRRG
Subjt:  VADSTNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRG

Query:  ADSDAESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDK-ITILASRFPEL
        ADSDAESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDK ITILASRFPEL
Subjt:  ADSDAESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDK-ITILASRFPEL

Query:  KTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECP
        KTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECP
Subjt:  KTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECP

Query:  KANTLWQDADNWLRSLNVNHPDYRFFASHTSSGR
        KANTLWQDADNWLRSLNVNHPDYRFFASHTSSGR
Subjt:  KANTLWQDADNWLRSLNVNHPDYRFFASHTSSGR

XP_011651067.2 uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus]1.4e-20787.38Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEK-SEVDECRSWSTRSDCSVSDRGVADS
        MSVSGGVSIARIRGENRFYHPPAMRRRL QQQQQQQQQQQQQQQQQQQQQ KQ+ALD K+V AA T+ ID+LEK SE DECRSWSTRSDCSVSDRG+ADS
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEK-SEVDECRSWSTRSDCSVSDRGVADS

Query:  TNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDA
        TNLDRFLE+TTP+VPA C  KTSL+GWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+DPSKSSALRRRGADSDA
Subjt:  TNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDA

Query:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKET+SDGSSN G  KKT TALQ+EWIQD +V GS+RALQMNVPS+ESSSDESDSCYR GQLVFEY+E DPPFCREPLTDKIT+LASRF ELKTYRSC
Subjt:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHWPRVREV+TA+ PLKLQLP FGLASYKFK  FWNSTG EEC KA++LW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW

Query:  QDADNWLRSLNVNHPDYRFFASHTSSGR
        QDAD+WLR LNVNHPDYRFFASH S  R
Subjt:  QDADNWLRSLNVNHPDYRFFASHTSSGR

XP_022946432.1 uncharacterized protein LOC111450487 isoform X1 [Cucurbita moschata]3.2e-23999.77Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST

Query:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSDA
        NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL RRRGADSDA
Subjt:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSDA

Query:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC
Subjt:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW

Query:  QDADNWLRSLNVNHPDYRFFASHTSSGR
        QDADNWLRSLNVNHPDYRFFASHTSSGR
Subjt:  QDADNWLRSLNVNHPDYRFFASHTSSGR

XP_022946439.1 uncharacterized protein LOC111450487 isoform X2 [Cucurbita moschata]1.3e-240100Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST

Query:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDAE
        NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDAE
Subjt:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDAE

Query:  SSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCD
        SSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCD
Subjt:  SSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCD

Query:  LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQ
        LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQ
Subjt:  LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQ

Query:  DADNWLRSLNVNHPDYRFFASHTSSGR
        DADNWLRSLNVNHPDYRFFASHTSSGR
Subjt:  DADNWLRSLNVNHPDYRFFASHTSSGR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein3.0e-20384.85Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEK-SEVDECRSWSTRSDCSVSDRGVADS
        MSVSGGVSIARIRGENRFYHPPAMRRRL           QQQQQQQQQQQ KQ+ALD K+V AA T+ ID+LEK SE DECRSWSTRSDCSVSDRG+ADS
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEK-SEVDECRSWSTRSDCSVSDRGVADS

Query:  TNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSD
        TNLDRFLE+TTP+VPA C  KTSL+GWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+DPSKSSAL RRRGADSD
Subjt:  TNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSD

Query:  AESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRS
        AESSKET+SDGSSN G  KKT TALQ+EWIQD +V GS+RALQMNVPS+ESSSDESDSCYR GQLVFEY+E DPPFCREPLTDKIT+LASRF ELKTYRS
Subjt:  AESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRS

Query:  CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTL
        CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTAFQGI TDGLQFHWPRVREV+TA+ PLKLQLP FGLASYKFK  FWNSTG EEC KA++L
Subjt:  CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTL

Query:  WQDADNWLRSLNVNHPDYRFFASHTSSGR
        WQDAD+WLR LNVNHPDYRFFASH S  R
Subjt:  WQDADNWLRSLNVNHPDYRFFASHTSSGR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X15.2e-20385.08Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEK-SEVDECRSWSTRSDCSVSDRGVADS
        MSVSGGVSIARIRGENRFYHPPAMRRRL      QQQQQQQQQQQQQQQQ KQ+ALD K+V AA T+ ID+LEK SE DECRSWSTRSDCSVSDRG+ DS
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEK-SEVDECRSWSTRSDCSVSDRGVADS

Query:  TNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSD
        TNLDRFLE+TTP+VPA C  KTSL+GWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+DPSKS AL RRRGADSD
Subjt:  TNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSD

Query:  AESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRS
        AESSKET+SDGSSN G  KKT TALQ+EWIQD +  GS+RALQMNVPS+ESSSDESDSCYR GQLVFEY+E DPPFCREPLTDKIT+LASRFPELKTYRS
Subjt:  AESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRS

Query:  CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTL
        CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHWPRVREV+TA+ PLKLQLP FGLASYKFK  FWNSTG EEC KA++L
Subjt:  CDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTL

Query:  WQDADNWLRSLNVNHPDYRFFASHTSSGR
        WQDAD+WLR LNVNHPDYRFFASH S  R
Subjt:  WQDADNWLRSLNVNHPDYRFFASHTSSGR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X22.1e-20485.28Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEK-SEVDECRSWSTRSDCSVSDRGVADS
        MSVSGGVSIARIRGENRFYHPPAMRRRL      QQQQQQQQQQQQQQQQ KQ+ALD K+V AA T+ ID+LEK SE DECRSWSTRSDCSVSDRG+ DS
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEK-SEVDECRSWSTRSDCSVSDRGVADS

Query:  TNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDA
        TNLDRFLE+TTP+VPA C  KTSL+GWRNREVSEA PYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLY+DPSKS ALRRRGADSDA
Subjt:  TNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDA

Query:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKET+SDGSSN G  KKT TALQ+EWIQD +  GS+RALQMNVPS+ESSSDESDSCYR GQLVFEY+E DPPFCREPLTDKIT+LASRFPELKTYRSC
Subjt:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFH+LSTA QG  TDGLQFHWPRVREV+TA+ PLKLQLP FGLASYKFK  FWNSTG EEC KA++LW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW

Query:  QDADNWLRSLNVNHPDYRFFASHTSSGR
        QDAD+WLR LNVNHPDYRFFASH S  R
Subjt:  QDADNWLRSLNVNHPDYRFFASHTSSGR

A0A6J1G3Q7 uncharacterized protein LOC111450487 isoform X11.5e-23999.77Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST

Query:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSDA
        NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL RRRGADSDA
Subjt:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSAL-RRRGADSDA

Query:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC
        ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC
Subjt:  ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSC

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLW

Query:  QDADNWLRSLNVNHPDYRFFASHTSSGR
        QDADNWLRSLNVNHPDYRFFASHTSSGR
Subjt:  QDADNWLRSLNVNHPDYRFFASHTSSGR

A0A6J1G3S2 uncharacterized protein LOC111450487 isoform X26.3e-241100Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST

Query:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDAE
        NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDAE
Subjt:  NLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDAE

Query:  SSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCD
        SSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCD
Subjt:  SSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCD

Query:  LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQ
        LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQ
Subjt:  LSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQ

Query:  DADNWLRSLNVNHPDYRFFASHTSSGR
        DADNWLRSLNVNHPDYRFFASHTSSGR
Subjt:  DADNWLRSLNVNHPDYRFFASHTSSGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)2.5e-8046.81Show/hide
Query:  ELEKSEVD---ECRSWSTR-----SDCSVSDRGVADSTNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLN
        +L+++++D    CRS  T+     S         A S+N++RFL+  TP VPA   SKT ++     +V    PYF+LGD+WESF EWSAYG G+PL LN
Subjt:  ELEKSEVD---ECRSWSTR-----SDCSVSDRGVADSTNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLN

Query:  GS-DSVVQYYVPYLSGIQLY--IDPSKSSALRRRGADSDAESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAE-SSSDESDSCY
         + D V QYYVP LSGIQ+Y  +D   SS   RR  +      ++++S+GSS        S + +        ++     L +     E SSSD+ +   
Subjt:  GS-DSVVQYYVPYLSGIQLY--IDPSKSSALRRRGADSDAESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAE-SSSDESDSCY

Query:  RQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREV
         QG+L+FEY+E D P+ REP  DK++ LASRFPELKT RSCDL PSSW SVAWYPIY+IPTGPTL+ LDACFLT+HSL T FQG G      H  + RE 
Subjt:  RQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREV

Query:  HTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFF
               K++LP FGLASYK + S W S G      AN+L+Q ADNWLR   VNHPD+ FF
Subjt:  HTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)1.4e-7846.58Show/hide
Query:  RIDELEKSEVDECRSWSTRSDCSVSDRGVAD--STNLDRFLEYTTPVVPAQCFSKTSLKGWR-NREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS
        RID+L +++ D     S+           +D  S+NLDRFLE  TP VPAQ  SKT L+  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN +
Subjt:  RIDELEKSEVDECRSWSTRSDCSVSDRGVAD--STNLDRFLEYTTPVVPAQCFSKTSLKGWR-NREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS

Query:  -DSVVQYYVPYLSGIQLY-----IDPSKSSALRRRGADSDA---ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESD
         D V+QYYVP LS IQ+Y     +D S  S  RR G  SD+   +SS + +SD  S     +    +L+D+  +D                  SSSD+ +
Subjt:  -DSVVQYYVPYLSGIQLY-----IDPSKSSALRRRGADSDA---ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESD

Query:  SCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTD-GLQFHWPR
            QG+L+FEY+E D P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+F G G++  +    PR
Subjt:  SCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTD-GLQFHWPR

Query:  VREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFF
          E        K+ LP FGLASYKF+ S W   G  E    N+L+Q AD WL S +V+HPD+ FF
Subjt:  VREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)4.4e-6147.74Show/hide
Query:  RIDELEKSEVDECRSWSTRSDCSVSDRGVAD--STNLDRFLEYTTPVVPAQCFSKTSLKGWR-NREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS
        RID+L +++ D     S+           +D  S+NLDRFLE  TP VPAQ  SKT L+  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN +
Subjt:  RIDELEKSEVDECRSWSTRSDCSVSDRGVAD--STNLDRFLEYTTPVVPAQCFSKTSLKGWR-NREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGS

Query:  -DSVVQYYVPYLSGIQLY-----IDPSKSSALRRRGADSDA---ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESD
         D V+QYYVP LS IQ+Y     +D S  S  RR G  SD+   +SS + +SD  S     +    +L+D+  +D                  SSSD+ +
Subjt:  -DSVVQYYVPYLSGIQLY-----IDPSKSSALRRRGADSDA---ESSKETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESD

Query:  SCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQG
            QG+L+FEY+E D P+ REP  DK+  LA++FPEL T RSCDL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+HSL T+F G
Subjt:  SCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQG

AT4G16100.1 Protein of unknown function (DUF789)1.7e-9249.88Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST-------NLD
        RIRGENRFY+PP MR+     QQ++++++ + ++ ++++++ +  LD K        +++E E  + +EC    + SDCSV  R  + +T       NL 
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADST-------NLD

Query:  RFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRR-GADSDAESS
        RFL+ TTP+V  Q    TS KGWR RE  E  PYF+L DLW+SF+EWSAYG G+PLLLNG DSVVQYYVPYLSGIQLY DPS++   RRR G +SD +S 
Subjt:  RFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRR-GADSDAESS

Query:  KETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESD-SCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDL
        ++ +SDGS++C   ++ S  L              RA     P   SSSDES+ S    G+LVFEY+E   PF REPLTDKI+ L+S+FP L+TYRSCDL
Subjt:  KETTSDGSSNCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESD-SCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWN-STGVEECPKANTLWQ
        SPSSW+SVAWYPIYRIP G +LQ+LDACFLTFHSLST  +G   +  Q      + V +A LP    LPTFGLASYKFK S W+  + V+E  +  TL +
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWN-STGVEECPKANTLWQ

Query:  DADNWLRSLNVNHPDYRFFASHTSS
         A+ WLR L V  PD+R F SH+ S
Subjt:  DADNWLRSLNVNHPDYRFFASHTSS

AT5G49220.1 Protein of unknown function (DUF789)2.0e-9048.55Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSV-------
        MS SGGVSIAR  IRGENRFY+PP MRR      QQ+ Q QQQ +++Q++  + +  +D +   AAT A     +   V E +S    S   V       
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSV-------

Query:  ---SDRGVADSTNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYID
           S R ++D +NLDRFLE+TTPVVPA+ F   S    + RE S+   YFVL DLWESF EWSAYGAG+     PL ++G+DS VQYYVPYLSGIQLY+D
Subjt:  ---SDRGVADSTNLDRFLEYTTPVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYID

Query:  PSKSSALRRRGADSDAESSKETTSDGSS---NCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLT
        P K    + R    D E S E +S+  +   +  +G+    +L+D+     S+TGS             SS E++    QG+L+FEY+E +PPF REPL 
Subjt:  PSKSSALRRRGADSDAESSKETTSDGSS---NCGMGKKTSTALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLT

Query:  DKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTA--FQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYK
        +KI+ LASR PEL TYRSCDL PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFHSLSTA     +G    Q                KL LPTFGLASYK
Subjt:  DKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHSLSTA--FQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYK

Query:  FKFSFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHTSSGR
         K S WN   ++E  K  +L Q AD WL+ L V+HPDYRFF S++   R
Subjt:  FKFSFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHTSSGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGTGTTTCGATTGCCCGAATCCGTGGGGAGAATCGGTTCTATCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAGCAGCAGCAACAGCA
ACAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGAAGCAAACTGCCTTGGATTTGAAGGAGGTTGCTGCTGCTACTACTGCGAGGATCGATGAGTTGGAGA
AGAGTGAGGTTGATGAGTGTCGTTCTTGGTCTACTCGCTCGGATTGCTCTGTTTCTGATCGTGGAGTTGCTGATTCTACTAATTTGGATCGGTTCTTGGAGTACACTACT
CCCGTTGTTCCGGCTCAATGTTTTTCTAAGACGAGCCTAAAGGGATGGAGAAATCGTGAAGTCTCAGAGGCACCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTT
CAAGGAATGGAGTGCATACGGAGCGGGTATCCCTCTATTGTTAAATGGTAGCGACTCTGTAGTACAGTACTACGTTCCATATCTGTCGGGCATACAACTCTATATAGATC
CATCAAAATCCTCTGCCTTAAGAAGGCGTGGTGCTGATAGTGATGCTGAGTCCTCAAAGGAAACAACTAGTGATGGAAGTAGTAATTGTGGGATGGGAAAAAAAACTAGT
ACGGCTCTTCAGGATGAGTGGATACAGGACTCCAGTGTTACTGGGTCACGAAGAGCGCTTCAAATGAATGTACCCTCTGCCGAGTCATCAAGTGATGAAAGTGACTCTTG
CTACCGTCAAGGTCAGCTTGTGTTCGAATACATGGAGCTTGATCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTATCCTTGCATCTCGTTTTCCTGAATTGA
AAACATATAGAAGCTGTGATTTATCTCCTTCCAGTTGGATATCTGTGGCATGGTATCCAATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTCTAGATGCTTGCTTT
TTGACCTTCCATTCTCTGTCTACAGCATTTCAAGGCATCGGCACCGACGGGTTGCAATTCCATTGGCCAAGAGTTCGAGAGGTGCACACTGCAAATTTGCCTCTCAAACT
ACAGTTGCCAACATTTGGACTTGCTTCCTACAAGTTCAAATTTTCGTTTTGGAATTCAACTGGTGTGGAGGAATGTCCAAAGGCTAACACATTGTGGCAAGATGCCGACA
ACTGGCTCAGGTCATTAAACGTGAACCATCCCGATTACAGATTTTTTGCATCTCATACTTCGTCCGGGAGATGA
mRNA sequenceShow/hide mRNA sequence
GCGTCTGCTCCTAAAATCTCCCTCTCTAATCCAATCCTCTCCCTTCCTTCTCTCTCTAATTTCGACAAATGGGGCTCAAGGGTTTCTCTTAGCTATCTTCCCTCGCTAGA
TTTTGCAAAACCCTAGTTTCTTCATCGTCTTTATCACTGTATATATCCAAGGAAACCTGGTCTTCCTTTTGTGTACGCTGCCACGGTTTTGGTGTTTGTTTGATTCTCTG
TGTGGTTTTCTGCAATGTCAGTCTCCGGTGGTGTTTCGATTGCCCGAATCCGTGGGGAGAATCGGTTCTATCATCCACCTGCGATGCGGCGTCGTTTGCAGCAGCAGCAG
CAGCAGCAACAGCAACAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGAAGCAAACTGCCTTGGATTTGAAGGAGGTTGCTGCTGCTACTACTGCGAGGAT
CGATGAGTTGGAGAAGAGTGAGGTTGATGAGTGTCGTTCTTGGTCTACTCGCTCGGATTGCTCTGTTTCTGATCGTGGAGTTGCTGATTCTACTAATTTGGATCGGTTCT
TGGAGTACACTACTCCCGTTGTTCCGGCTCAATGTTTTTCTAAGACGAGCCTAAAGGGATGGAGAAATCGTGAAGTCTCAGAGGCACCTCCTTATTTTGTGCTCGGTGAT
CTCTGGGAATCTTTCAAGGAATGGAGTGCATACGGAGCGGGTATCCCTCTATTGTTAAATGGTAGCGACTCTGTAGTACAGTACTACGTTCCATATCTGTCGGGCATACA
ACTCTATATAGATCCATCAAAATCCTCTGCCTTAAGAAGGCGTGGTGCTGATAGTGATGCTGAGTCCTCAAAGGAAACAACTAGTGATGGAAGTAGTAATTGTGGGATGG
GAAAAAAAACTAGTACGGCTCTTCAGGATGAGTGGATACAGGACTCCAGTGTTACTGGGTCACGAAGAGCGCTTCAAATGAATGTACCCTCTGCCGAGTCATCAAGTGAT
GAAAGTGACTCTTGCTACCGTCAAGGTCAGCTTGTGTTCGAATACATGGAGCTTGATCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTATCCTTGCATCTCG
TTTTCCTGAATTGAAAACATATAGAAGCTGTGATTTATCTCCTTCCAGTTGGATATCTGTGGCATGGTATCCAATTTATAGGATTCCCACGGGTCCAACTCTACAAAGTC
TAGATGCTTGCTTTTTGACCTTCCATTCTCTGTCTACAGCATTTCAAGGCATCGGCACCGACGGGTTGCAATTCCATTGGCCAAGAGTTCGAGAGGTGCACACTGCAAAT
TTGCCTCTCAAACTACAGTTGCCAACATTTGGACTTGCTTCCTACAAGTTCAAATTTTCGTTTTGGAATTCAACTGGTGTGGAGGAATGTCCAAAGGCTAACACATTGTG
GCAAGATGCCGACAACTGGCTCAGGTCATTAAACGTGAACCATCCCGATTACAGATTTTTTGCATCTCATACTTCGTCCGGGAGATGA
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQQQQQQQKQTALDLKEVAAATTARIDELEKSEVDECRSWSTRSDCSVSDRGVADSTNLDRFLEYTT
PVVPAQCFSKTSLKGWRNREVSEAPPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYIDPSKSSALRRRGADSDAESSKETTSDGSSNCGMGKKTS
TALQDEWIQDSSVTGSRRALQMNVPSAESSSDESDSCYRQGQLVFEYMELDPPFCREPLTDKITILASRFPELKTYRSCDLSPSSWISVAWYPIYRIPTGPTLQSLDACF
LTFHSLSTAFQGIGTDGLQFHWPRVREVHTANLPLKLQLPTFGLASYKFKFSFWNSTGVEECPKANTLWQDADNWLRSLNVNHPDYRFFASHTSSGR