; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G03090 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G03090
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionS1 motif domain-containing protein
Genome locationClcChr01:2838906..2853087
RNA-Seq ExpressionClc01G03090
SyntenyClc01G03090
Gene Ontology termsGO:0034337 - RNA folding (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0019843 - rRNA binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042780.1 protein MLP1 [Cucumis melo var. makuwa]0.0e+0090.04Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF PIDLLRPRR AVRNLCFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKKEMKLKAANKPP PD+KKPSQAVGKV VSPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSLKMSNV TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTSNE+IDE EKLSGDGYVD V   ENRA+KGS+SDR+DDFTLSKKPEIGGD+T LE+EN          DTVD KE+N I+DL++L++PLN+MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF
        +SEET VGSSTNEN KDIDYS IGLQLHEPSD+DY E+PAALSESF+DILDSTIEVSKKATLLGKPRRVDHSS+E  KL REEASTPETD+NGA+ET +F
Subjt:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPS YKQNLGTIGS+ GG+QAFASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        + EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKL+FS+RPKEREELVEKKRSLM TLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE
        EGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKEL+NTEGIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE

Query:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        +VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
Subjt:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

KAE8647864.1 hypothetical protein Csa_000384 [Cucumis sativus]0.0e+0076.99Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFTPIDLLRPRR AVRNLCFNGRPSKFSVL+SKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAANKPP PD+KKPSQAVGKV VSPKGRVPNVILRKPT YNEDDVEDKPSRIRMKPNLSLKMSNVSTKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTSNE+IDE EKLSGDGYVD V   EN A+K  TSDR+DDFTLSKKPEIGGD+TRLE          SESD VD KE+N I+DL++LK+PLN+MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGL-QLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETET
        +SEET VGSSTNEN KDIDYS IGL QLHEPSD+DY E+PAALSESFSDILD TIE SKKATLLGKPRRVDHSSKE  KL REE STPETDVNGA ETE 
Subjt:  MSEETGVGSSTNENRKDIDYSPIGL-QLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETET

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
        FSAIPALEEHELADWTKAEDLAKSGDRADVE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR

Query:  PNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE
        P+ EIDVKDGGELTPDMKLEDLLQIY++EKIKFLSSFVGQKIKVNVVLANRKSRKLIFS+RPKER++LV+KKRSLM TLQVGDVVKCCIKKIAYFGIFVE
Subjt:  PNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDS E+DTEWADVESL+KEL+N EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGI

Query:  ESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE-----------------------------------------------------LMVET
        E+VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE                                                     L + T
Subjt:  ESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE-----------------------------------------------------LMVET

Query:  SLDKETMKSVI---------------------------LTCTNRFLAILVIIFGIWMGTHHDGCRKSLTLPVMGL-------------------------
         L  +   SV                            +     FLAILVIIFGIWMGTHHDGCRKSLTLPVMGL                         
Subjt:  SLDKETMKSVI---------------------------LTCTNRFLAILVIIFGIWMGTHHDGCRKSLTLPVMGL-------------------------

Query:  ----------------VSDHAVHYIGGDPRVYG----------------ASLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPP
                        V    V   G    V G                  LNN++NWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPP
Subjt:  ----------------VSDHAVHYIGGDPRVYG----------------ASLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPP

Query:  SECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKAGVAQYMKTEWRLVAIFNVILFVVL
        SECGYPAVNASYYDLSFHPVNSNHDCK+YKNSKAVKCYDCDSCKAGVAQYMKTEWRLVAIFNVILFVVL
Subjt:  SECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKAGVAQYMKTEWRLVAIFNVILFVVL

XP_008437231.1 PREDICTED: uncharacterized protein LOC103482723 isoform X1 [Cucumis melo]0.0e+0090.04Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF PIDLLRPRR AVRNLCFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKKEMKLKAANKPP PD+KKPSQAVGKV VSPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSLKMSNV TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTSNE+IDE EKLSGDGYVD V   ENRA+KGS+SDR+DDFTLSKKPEIGGD+T LE+EN          DTVD KE+N I+DL++L++PLN+MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF
        +SEET VGSSTNEN KDIDYS IGLQLHEPSD+DY E+PAALSESF+DILDSTIEVSKKATLLGKPRRVDHSS+E  KL REEASTPETD+NGA+ET +F
Subjt:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPS YKQNLGTIGSS GGSQAFASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        + EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKV VVLANRKSRKL+FS+RPKEREELVEKKRSLM TLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE
        EGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHD MDGRLDSAEVDTEWADVESLIKEL+NTEGIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE

Query:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        +VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
Subjt:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

XP_011654764.1 uncharacterized protein LOC101217667 [Cucumis sativus]0.0e+0090.18Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFTPIDLLRPRR AVRNLCFNGRPSKFSVL+SKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAANKPP PD+KKPSQAVGKV VSPKGRVPNVILRKPT YNEDDVEDKPSRIRMKPNLSLKMSNVSTKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTSNE+IDE EKLSGDGYVD V   EN A+K  TSDR+DDFTLSKKPEIGGD+TRLE          SESD VD KE+N I+DL++LK+PLN+MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGL-QLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETET
        +SEET VGSSTNEN KDIDYS IGL QLHEPSD+DY E+PAALSESFSDILD TIE SKKATLLGKPRRVDHSSKE  KL REE STPETDVNGA ETE 
Subjt:  MSEETGVGSSTNENRKDIDYSPIGL-QLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETET

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
        FSAIPALEEHELADWTKAEDLAKSGDRADVE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR

Query:  PNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE
        P+ EIDVKDGGELTPDMKLEDLLQIY++EKIKFLSSFVGQKIKVNVVLANRKSRKLIFS+RPKER++LV+KKRSLM TLQVGDVVKCCIKKIAYFGIFVE
Subjt:  PNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDS E+DTEWADVESL+KEL+N EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGI

Query:  ESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        E+VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
Subjt:  ESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

XP_038874720.1 uncharacterized protein LOC120067263 [Benincasa hispida]0.0e+0089.38Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFTPIDLLRPRR  VRNLCFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKK NPD SYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAA+KPPAPDLKKPSQ VGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSN STKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTSNE+IDEKEKLSG+GYVD+V   ENR +KGSTSD  D FTLSKKPEIGGD+TR           ++ES+TVDGK END+ED+++LK+PLN MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF
        MSEET VGSSTNEN KDIDYS I          DY +D A LSESFSDILDST++VSKKATLLGKPRRVDHSSKE +KL REEASTPETDV+GAVETE F
Subjt:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPS+YKQNLGTIGSSDG SQ FASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        + EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE
        EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGR DSAEVDTEW+DVESL+KEL+NT+GIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE

Query:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        +VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
Subjt:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

TrEMBL top hitse value%identityAlignment
A0A0A0KKL1 S1 motif domain-containing protein0.0e+0090.18Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFTPIDLLRPRR AVRNLCFNGRPSKFSVL+SKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAANKPP PD+KKPSQAVGKV VSPKGRVPNVILRKPT YNEDDVEDKPSRIRMKPNLSLKMSNVSTKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTSNE+IDE EKLSGDGYVD V   EN A+K  TSDR+DDFTLSKKPEIGGD+TRLE          SESD VD KE+N I+DL++LK+PLN+MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGL-QLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETET
        +SEET VGSSTNEN KDIDYS IGL QLHEPSD+DY E+PAALSESFSDILD TIE SKKATLLGKPRRVDHSSKE  KL REE STPETDVNGA ETE 
Subjt:  MSEETGVGSSTNENRKDIDYSPIGL-QLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETET

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
        FSAIPALEEHELADWTKAEDLAKSGDRADVE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR

Query:  PNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE
        P+ EIDVKDGGELTPDMKLEDLLQIY++EKIKFLSSFVGQKIKVNVVLANRKSRKLIFS+RPKER++LV+KKRSLM TLQVGDVVKCCIKKIAYFGIFVE
Subjt:  PNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDS E+DTEWADVESL+KEL+N EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGI

Query:  ESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        E+VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
Subjt:  ESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

A0A1S3AU41 uncharacterized protein LOC103482723 isoform X10.0e+0090.04Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF PIDLLRPRR AVRNLCFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKKEMKLKAANKPP PD+KKPSQAVGKV VSPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSLKMSNV TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTSNE+IDE EKLSGDGYVD V   ENRA+KGS+SDR+DDFTLSKKPEIGGD+T LE+EN          DTVD KE+N I+DL++L++PLN+MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF
        +SEET VGSSTNEN KDIDYS IGLQLHEPSD+DY E+PAALSESF+DILDSTIEVSKKATLLGKPRRVDHSS+E  KL REEASTPETD+NGA+ET +F
Subjt:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPS YKQNLGTIGSS GGSQAFASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        + EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKV VVLANRKSRKL+FS+RPKEREELVEKKRSLM TLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE
        EGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHD MDGRLDSAEVDTEWADVESLIKEL+NTEGIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE

Query:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        +VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
Subjt:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

A0A5A7THK9 Protein MLP10.0e+0090.04Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF PIDLLRPRR AVRNLCFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKKEMKLKAANKPP PD+KKPSQAVGKV VSPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSLKMSNV TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTSNE+IDE EKLSGDGYVD V   ENRA+KGS+SDR+DDFTLSKKPEIGGD+T LE+EN          DTVD KE+N I+DL++L++PLN+MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVV---ENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF
        +SEET VGSSTNEN KDIDYS IGLQLHEPSD+DY E+PAALSESF+DILDSTIEVSKKATLLGKPRRVDHSS+E  KL REEASTPETD+NGA+ET +F
Subjt:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPS YKQNLGTIGS+ GG+QAFASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        + EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKL+FS+RPKEREELVEKKRSLM TLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE
        EGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKEL+NTEGIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE

Query:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        +VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
Subjt:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

A0A6J1E0R7 uncharacterized protein LOC111026097 isoform X20.0e+0083.62Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFT IDLLRPRRGAVRNLC NGRPS+FSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKK+MKLKAA KPP  D+KKPSQAV K AVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSL M+N  TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVVEN---RATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTS+E I++KEK SGD YVD+ +N   +A+KGS+SD VDDFTLSKKPEI GD+ RL NEND KNL YSESDT+DGK+EN IE+L++LK+PLN+MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVVEN---RATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF
         S ET   +S+ ENR+DID   + LQ HEPS++ Y EDPAA SE FSDILD ++++S +A LLGKP     S++EA+       +TPETDVN AVET+ F
Subjt:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPAL+EHELADWT+AEDLAK GDRA+VE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGS+D GSQ F+STRP
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        +LE DVK+GGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVL NRKSRKLIFS+RPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  NLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE
        EGVPAL+HQTEVSWDATLNPAS++KIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVV DHDPMDGRL+ AEVDTEWADVESL KEL+N EG+E
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIE

Query:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        +VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMV+TSLDKETMKS ILTCTNR
Subjt:  SVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

A0A6J1E4Z1 uncharacterized protein LOC111026097 isoform X10.0e+0084.44Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFT IDLLRPRRGAVRNLC NGRPS+FSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKK+MKLKAA KPP  D+KKPSQAV K AVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSL M+N  TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEIIDEKEKLSGDGYVDVVEN---RATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS
        KPEPMTS+E I++KEK SGD YVD+ +N   +A+KGS+SD VDDFTLSKKPEI GD+ RL NEND KNL YSESDT+DGK+EN IE+L++LK+PLN+MS 
Subjt:  KPEPMTSNEIIDEKEKLSGDGYVDVVEN---RATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSS

Query:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEA--STPETDVNGAVETE
         S ET   +S+ ENR+DID   + LQ HEPS++ Y EDPAA SE FSDILD ++++S +A LLGKP R+DH S E LKL  EEA  +TPETDVN AVET+
Subjt:  MSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEA--STPETDVNGAVETE

Query:  TFSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFAST
         FSAIPAL+EHELADWT+AEDLAK GDRA+VE+ISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGS+D GSQ F+ST
Subjt:  TFSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFAST

Query:  RPNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFV
        RP+LE DVK+GGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVL NRKSRKLIFS+RPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFV
Subjt:  RPNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFV

Query:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEG
        EIEGVPAL+HQTEVSWDATLNPAS++KIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVV DHDPMDGRL+ AEVDTEWADVESL KEL+N EG
Subjt:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAEVDTEWADVESLIKELENTEG

Query:  IESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        +E+VSK RFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMV+TSLDKETMKS ILTCTNR
Subjt:  IESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

SwissProt top hitse value%identityAlignment
F4I214 Tetraspanin-109.6e-6150.2Show/hide
Query:  LAILVIIFGIWMGTHHDGCRKSLTLPVMGL----VSDHAVHYIGGDPR------VYGA------------------------------------------
        LA+ VIIFG+WM TH+DGCR+SLT PV+ L         + ++G   R      +Y A                                          
Subjt:  LAILVIIFGIWMGTHHDGCRKSLTLPVMGL----VSDHAVHYIGGDPR------VYGA------------------------------------------

Query:  -----SLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKA
              LNN+ NW+RLKSCLVKSE C  LSK+YKT+KQ K A+LTP+EAGCCRPPSECGYPAVNASYYDLSFH ++SN DCKLYKN + +KCY+CDSCKA
Subjt:  -----SLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKA

Query:  GVAQYMKTEWRLVAIFNVILFVVLSDAALDEKL---QKIGLKASGIAVVLNRLIK
        GVAQYMKTEWRLVAIFNV+LFVVL  + L  +    Q  GL  +G+  + N   K
Subjt:  GVAQYMKTEWRLVAIFNVILFVVLSDAALDEKL---QKIGLKASGIAVVLNRLIK

Q8S8Q6 Tetraspanin-85.3e-1935.83Show/hide
Query:  LNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAK--LTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKAGVAQ
        + N  NW +++SCLV+S+ C+ L  ++  +      K  LT +++GCC+P  ECG+  VN + +  +    ++N DC+ + N+K   C+DC SCKAG+  
Subjt:  LNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAK--LTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKAGVAQ

Query:  YMKTEWRLVAIFNVILFVVL
         +K+ W+ VAI N++  V L
Subjt:  YMKTEWRLVAIFNVILFVVL

Q9M1E7 Tetraspanin-31.4e-1932.79Show/hide
Query:  LNNSDNWMRLKSCLVKSEDCNNLSKRY----KTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKAGV
        +++   W ++ SCL  S  C  + + +    +T   + L +L+P+E+GCC+PP++CG+  VN + +D     +  N DC ++ N +++ CY C SCKAGV
Subjt:  LNNSDNWMRLKSCLVKSEDCNNLSKRY----KTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKAGV

Query:  AQYMKTEWRLVAIFNVILFVVL
           +K  WR V++ N+++ ++L
Subjt:  AQYMKTEWRLVAIFNVILFVVL

Q9SUD4 Tetraspanin-72.9e-1732.5Show/hide
Query:  LNNSDNWMRLKSCLVKSEDCNNLSKRYKTL--KQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKAGVAQ
        +NN+ NW R++SCL+ S+ C+    RY ++  + +  + L  +++GCC+P ++C +  VN + +  +  P   N DC ++ N     CYDC++CKAG+  
Subjt:  LNNSDNWMRLKSCLVKSEDCNNLSKRYKTL--KQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKAGVAQ

Query:  YMKTEWRLVAIFNVILFVVL
         +K  W+ VA  N++  + L
Subjt:  YMKTEWRLVAIFNVILFVVL

Q9ZUN5 Tetraspanin-24.8e-2032.8Show/hide
Query:  KETMKSVILTCTNRFLA-ILVIIFGIWMGTHHDGCRKSLTLPVMGLVSDHAVHYIGGDPRVYGASLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQ-YK
        KET+ +V L C    +  +LV++   ++ T  DG   S  +P  G    +  + + G       ++ +S NW RL++CL  +  C  L++ + T  Q + 
Subjt:  KETMKSVILTCTNRFLA-ILVIIFGIWMGTHHDGCRKSLTLPVMGLVSDHAVHYIGGDPRVYGASLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQ-YK

Query:  LAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVN--SNHDCKLYKNSKAVKCYDCDSCKAGVAQYMKTEWRLVAIFNVILFVVL
         +K+TP+++GCC+PP+ CGY  VN +   L  +P N  ++ DC L+ N ++  CY+C+SCKAG+   ++ EWR   +  +I  VVL
Subjt:  LAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVN--SNHDCKLYKNSKAVKCYDCDSCKAGVAQYMKTEWRLVAIFNVILFVVL

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein1.5e-19753.54Show/hide
Query:  ALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASK-EEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNEVEE
        A  A  F +    +R RR     L    +  KF V ASK EE +L+ WDQMEL FGRL+GEDPKLTLAKI+++K++P+AS++++EKSFY+ KGK  EVEE
Subjt:  ALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASK-EEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNEVEE

Query:  L--------------SLDGLNLVRPQLKKEMKL-KAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVS
        +              SLDGL LV+P LK  +K  +   K P+P LKKP   V  VA     R+PNVILRKP+ +   + +D+ S++R+KPNL+LKM N  
Subjt:  L--------------SLDGLNLVRPQLKKEMKL-KAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVS

Query:  TKEQYSDMTLLRKPEPMTSNEIIDEKEKLSGDGYVDVVENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKE--ENDIEDLH
          E++SDMTLLRKPEP+  + + +E++K   D     +E    +G T      +TL +KPE       +E E         +S  V+  E   N I+   
Subjt:  TKEQYSDMTLLRKPEPMTSNEIIDEKEKLSGDGYVDVVENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKE--ENDIEDLH

Query:  VLKKPLNMMSSMSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTI-EVSKKATLLGKPRRVDHSSKEALKLRREEASTPE
           +  N+   +++   + SS  EN       P  +QL+        E+    S+    I    I +   +A+L GKP+R+D SS E       + S   
Subjt:  VLKKPLNMMSSMSEETGVGSSTNENRKDIDYSPIGLQLHEPSDVDYGEDPAALSESFSDILDSTI-EVSKKATLLGKPRRVDHSSKEALKLRREEASTPE

Query:  TDVNGAVETETFSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSS
         +  G   +      P     E  DW KAE L K+  RADVE+ISSSTRGF VS+GSL+GF+PYRNLAAKWKFLAFESWLR+KG+DPS Y+QNLG IG  
Subjt:  TDVNGAVETETFSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSFGSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSS

Query:  DGGSQA-FASTRPNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCC
        D  S++    +  + E+     GE++ DMKLEDLL +YDREK KFLSSFVGQKIKVNVV+ANR SRKLIFS+RP+E EE VEKKR+LMA L+VGDVVKCC
Subjt:  DGGSQA-FASTRPNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSVRPKEREELVEKKRSLMATLQVGDVVKCC

Query:  IKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRLDSAEVDTEWADV
        IKKI YFGIF E+EGVPAL+HQ+EVSWDATL+PASYFKIGQ+VEAKVHQLDF+LERIFLSLK+ITPDPL EALESVV GD+D + GRL +AE+D EW DV
Subjt:  IKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRLDSAEVDTEWADV

Query:  ESLIKELENTEGIESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR
        ESLIKELE  EGI+SVSK RFFLSPGLAPTFQVYMA M+ENQYKLLAR+GN+VQEL+VE SL KE MKS I++CTNR
Subjt:  ESLIKELENTEGIESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNR

AT1G63260.1 tetraspanin106.8e-6250.2Show/hide
Query:  LAILVIIFGIWMGTHHDGCRKSLTLPVMGL----VSDHAVHYIGGDPR------VYGA------------------------------------------
        LA+ VIIFG+WM TH+DGCR+SLT PV+ L         + ++G   R      +Y A                                          
Subjt:  LAILVIIFGIWMGTHHDGCRKSLTLPVMGL----VSDHAVHYIGGDPR------VYGA------------------------------------------

Query:  -----SLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKA
              LNN+ NW+RLKSCLVKSE C  LSK+YKT+KQ K A+LTP+EAGCCRPPSECGYPAVNASYYDLSFH ++SN DCKLYKN + +KCY+CDSCKA
Subjt:  -----SLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKA

Query:  GVAQYMKTEWRLVAIFNVILFVVLSDAALDEKL---QKIGLKASGIAVVLNRLIK
        GVAQYMKTEWRLVAIFNV+LFVVL  + L  +    Q  GL  +G+  + N   K
Subjt:  GVAQYMKTEWRLVAIFNVILFVVLSDAALDEKL---QKIGLKASGIAVVLNRLIK

AT1G63260.2 tetraspanin101.2e-5853.21Show/hide
Query:  LAILVIIFGIWMGTHHDGCRKSLTLPVMGL----VSDHAVHYIGGDPR------VYGA------------------------------------------
        LA+ VIIFG+WM TH+DGCR+SLT PV+ L         + ++G   R      +Y A                                          
Subjt:  LAILVIIFGIWMGTHHDGCRKSLTLPVMGL----VSDHAVHYIGGDPR------VYGA------------------------------------------

Query:  -----SLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKA
              LNN+ NW+RLKSCLVKSE C  LSK+YKT+KQ K A+LTP+EAGCCRPPSECGYPAVNASYYDLSFH ++SN DCKLYKN + +KCY+CDSCKA
Subjt:  -----SLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKA

Query:  GVAQYMKTEWRLVAIFNV
        GVAQYMKTEWRLVAIFNV
Subjt:  GVAQYMKTEWRLVAIFNV

AT1G63260.3 tetraspanin108.9e-6251.87Show/hide
Query:  LAILVIIFGIWMGTHHDGCRKSLTLPVMGL----VSDHAVHYIGGDPR------VYGA------------------------------------------
        LA+ VIIFG+WM TH+DGCR+SLT PV+ L         + ++G   R      +Y A                                          
Subjt:  LAILVIIFGIWMGTHHDGCRKSLTLPVMGL----VSDHAVHYIGGDPR------VYGA------------------------------------------

Query:  -----SLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKA
              LNN+ NW+RLKSCLVKSE C  LSK+YKT+KQ K A+LTP+EAGCCRPPSECGYPAVNASYYDLSFH ++SN DCKLYKN + +KCY+CDSCKA
Subjt:  -----SLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVNSNHDCKLYKNSKAVKCYDCDSCKA

Query:  GVAQYMKTEWRLVAIFNVILFVVLSDAALDEKL---QKIGL
        GVAQYMKTEWRLVAIFNV+LFVVL  + L  +    Q  GL
Subjt:  GVAQYMKTEWRLVAIFNVILFVVLSDAALDEKL---QKIGL

AT2G19580.1 tetraspanin23.4e-2132.8Show/hide
Query:  KETMKSVILTCTNRFLA-ILVIIFGIWMGTHHDGCRKSLTLPVMGLVSDHAVHYIGGDPRVYGASLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQ-YK
        KET+ +V L C    +  +LV++   ++ T  DG   S  +P  G    +  + + G       ++ +S NW RL++CL  +  C  L++ + T  Q + 
Subjt:  KETMKSVILTCTNRFLA-ILVIIFGIWMGTHHDGCRKSLTLPVMGLVSDHAVHYIGGDPRVYGASLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQ-YK

Query:  LAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVN--SNHDCKLYKNSKAVKCYDCDSCKAGVAQYMKTEWRLVAIFNVILFVVL
         +K+TP+++GCC+PP+ CGY  VN +   L  +P N  ++ DC L+ N ++  CY+C+SCKAG+   ++ EWR   +  +I  VVL
Subjt:  LAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVN--SNHDCKLYKNSKAVKCYDCDSCKAGVAQYMKTEWRLVAIFNVILFVVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGTCGCGCTCTAACGGCCTCTTCCTTCTTCACTCCCATTGATTTATTGCGACCCAGAAGAGGAGCTGTTAGAAATCTGTGTTTTAATGGCAGACCCAGTAAGTT
TTCTGTTCTTGCTTCCAAAGAAGAGGCTGAACTCGACCGTTGGGACCAAATGGAGCTCAAGTTCGGCCGCTTGATTGGGGAAGACCCCAAATTAACATTGGCCAAGATAA
TGAGCAAAAAAATGAATCCAGATGCTTCTTATCTTGAAGTTGAGAAATCATTTTACCAGAAGAAGGGAAAGTCTAACGAGGTAGAGGAACTTTCTCTTGATGGGTTGAAT
TTGGTCAGACCTCAGCTAAAGAAGGAAATGAAGTTAAAGGCTGCCAATAAGCCACCTGCACCAGATTTAAAGAAACCAAGCCAAGCAGTTGGAAAGGTAGCAGTTAGTCC
TAAAGGTAGGGTTCCCAATGTTATTTTGAGAAAACCGACGATTTATAATGAGGACGATGTTGAAGATAAGCCGTCACGGATAAGGATGAAGCCAAATTTATCATTGAAAA
TGAGCAATGTATCAACAAAGGAGCAATATAGTGATATGACACTGTTGAGGAAGCCAGAACCAATGACTTCAAATGAAATTATTGATGAGAAGGAAAAGCTCTCTGGTGAT
GGGTATGTCGATGTTGTTGAAAATCGGGCTACCAAGGGATCAACAAGTGACCGAGTTGATGATTTTACTCTTTCTAAGAAGCCAGAAATAGGAGGTGACCAAACAAGACT
TGAAAATGAAAATGATCATAAAAATCTGGCTTATTCAGAAAGTGATACGGTTGATGGTAAAGAGGAGAATGACATTGAAGACTTACATGTCCTAAAAAAGCCTTTAAATA
TGATGTCTAGCATGTCTGAAGAAACTGGAGTAGGCTCGTCAACAAATGAAAACAGGAAAGATATTGATTATTCTCCTATAGGATTACAGCTACATGAGCCAAGTGATGTA
GATTATGGTGAGGACCCAGCAGCTTTGAGCGAATCATTCAGTGATATTTTGGATTCAACAATAGAGGTGTCCAAAAAAGCTACATTATTGGGTAAACCAAGAAGAGTTGA
TCATTCTTCCAAGGAAGCACTAAAACTCCGTAGAGAAGAGGCCTCCACCCCTGAAACAGACGTCAATGGTGCCGTTGAGACAGAGACCTTCTCTGCTATCCCTGCTTTGG
AGGAACACGAACTTGCTGACTGGACTAAAGCAGAGGATCTGGCTAAGTCAGGAGACAGAGCTGACGTGGAGATAATAAGCTCGAGTACTCGAGGTTTTGTTGTATCATTT
GGCTCCCTAGTAGGATTTATTCCATATCGTAATCTTGCTGCCAAGTGGAAGTTCTTGGCTTTTGAGTCATGGCTAAGACAGAAAGGTTTGGATCCATCAATATACAAGCA
AAACTTAGGAACTATTGGAAGTAGTGATGGTGGAAGCCAGGCCTTTGCCAGTACTAGGCCGAATCTGGAAATTGATGTAAAAGATGGGGGAGAACTTACACCTGATATGA
AATTGGAGGATCTTCTTCAAATTTATGATCGAGAGAAAATCAAATTCTTGTCATCATTTGTTGGCCAGAAAATCAAAGTAAATGTGGTGTTGGCAAACAGAAAATCAAGG
AAGCTTATATTTTCCGTAAGGCCAAAAGAGAGGGAGGAATTGGTCGAGAAAAAGAGGAGTTTAATGGCGACGCTGCAAGTAGGTGATGTTGTCAAATGCTGCATTAAGAA
GATTGCTTATTTTGGTATCTTTGTTGAGATCGAAGGAGTGCCTGCTTTGATTCATCAGACAGAGGTTTCCTGGGACGCTACTTTAAACCCTGCATCATATTTTAAAATTG
GTCAGGTTGTAGAGGCCAAAGTTCATCAGTTGGATTTTTCACTCGAACGCATATTCTTATCATTGAAGCAGATTACACCAGACCCGCTAGCGGAAGCATTGGAGTCTGTG
GTTGGTGATCATGATCCCATGGATGGAAGATTAGATTCAGCCGAAGTAGATACCGAGTGGGCTGATGTAGAATCTCTCATCAAGGAACTGGAAAATACTGAAGGTATTGA
GTCTGTATCCAAAGAGCGTTTTTTCCTTAGCCCTGGATTGGCTCCAACCTTTCAGGTATATATGGCTTCCATGTATGAAAATCAGTACAAATTACTTGCTCGATCTGGAA
ACAAAGTACAAGAGCTGATGGTTGAGACATCATTGGATAAAGAAACAATGAAATCTGTTATCTTAACTTGCACCAACAGGTTTCTGGCTATTCTTGTCATAATATTTGGG
ATATGGATGGGCACACACCACGATGGTTGTCGAAAATCTCTCACTCTTCCTGTTATGGGACTTGTATCTGATCATGCTGTGCATTACATTGGTGGGGATCCTCGTGTTTA
CGGTGCTAGCCTTAATAATTCTGATAATTGGATGCGTCTGAAAAGCTGTCTTGTCAAATCTGAGGATTGCAATAATCTTTCTAAAAGATACAAGACTCTCAAGCAGTACA
AATTGGCAAAGCTGACGCCAATGGAAGCGGGTTGTTGCAGACCCCCATCAGAATGTGGTTATCCTGCTGTTAATGCATCATACTATGACTTGAGCTTTCATCCTGTCAAT
TCAAACCATGATTGCAAACTATACAAGAATTCCAAGGCAGTCAAGTGTTACGATTGTGATTCGTGCAAGGCCGGGGTCGCACAGTACATGAAAACTGAATGGCGATTGGT
TGCAATCTTCAATGTCATTCTATTTGTTGTCTTGTCGGATGCTGCGCTAGACGAAAAGCTGCAAAAAATCGGTCTAAAGGCTAGCGGCATAGCAGTTGTGCTCAACAGAC
TGATTAAACACTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTTCTGGCGGATAAACGCCACGGTCCCAAACTTCAGCACCTTCTATCGTCTTCTTCCTACTCCCACAGTTTCAGAAACACACAAAGAAAGGCTCCATAGCTCAATCT
CATACCCATAATCTCCTGAATTCAACCTCATCCGCCTTCGATTACATTGAGTTTAATGACACCGACCACTGTCAAACAAGTGTAATTTTCACTGCAATTCCCTTTCCAAG
AAATGGATGGTCGCGCTCTAACGGCCTCTTCCTTCTTCACTCCCATTGATTTATTGCGACCCAGAAGAGGAGCTGTTAGAAATCTGTGTTTTAATGGCAGACCCAGTAAG
TTTTCTGTTCTTGCTTCCAAAGAAGAGGCTGAACTCGACCGTTGGGACCAAATGGAGCTCAAGTTCGGCCGCTTGATTGGGGAAGACCCCAAATTAACATTGGCCAAGAT
AATGAGCAAAAAAATGAATCCAGATGCTTCTTATCTTGAAGTTGAGAAATCATTTTACCAGAAGAAGGGAAAGTCTAACGAGGTAGAGGAACTTTCTCTTGATGGGTTGA
ATTTGGTCAGACCTCAGCTAAAGAAGGAAATGAAGTTAAAGGCTGCCAATAAGCCACCTGCACCAGATTTAAAGAAACCAAGCCAAGCAGTTGGAAAGGTAGCAGTTAGT
CCTAAAGGTAGGGTTCCCAATGTTATTTTGAGAAAACCGACGATTTATAATGAGGACGATGTTGAAGATAAGCCGTCACGGATAAGGATGAAGCCAAATTTATCATTGAA
AATGAGCAATGTATCAACAAAGGAGCAATATAGTGATATGACACTGTTGAGGAAGCCAGAACCAATGACTTCAAATGAAATTATTGATGAGAAGGAAAAGCTCTCTGGTG
ATGGGTATGTCGATGTTGTTGAAAATCGGGCTACCAAGGGATCAACAAGTGACCGAGTTGATGATTTTACTCTTTCTAAGAAGCCAGAAATAGGAGGTGACCAAACAAGA
CTTGAAAATGAAAATGATCATAAAAATCTGGCTTATTCAGAAAGTGATACGGTTGATGGTAAAGAGGAGAATGACATTGAAGACTTACATGTCCTAAAAAAGCCTTTAAA
TATGATGTCTAGCATGTCTGAAGAAACTGGAGTAGGCTCGTCAACAAATGAAAACAGGAAAGATATTGATTATTCTCCTATAGGATTACAGCTACATGAGCCAAGTGATG
TAGATTATGGTGAGGACCCAGCAGCTTTGAGCGAATCATTCAGTGATATTTTGGATTCAACAATAGAGGTGTCCAAAAAAGCTACATTATTGGGTAAACCAAGAAGAGTT
GATCATTCTTCCAAGGAAGCACTAAAACTCCGTAGAGAAGAGGCCTCCACCCCTGAAACAGACGTCAATGGTGCCGTTGAGACAGAGACCTTCTCTGCTATCCCTGCTTT
GGAGGAACACGAACTTGCTGACTGGACTAAAGCAGAGGATCTGGCTAAGTCAGGAGACAGAGCTGACGTGGAGATAATAAGCTCGAGTACTCGAGGTTTTGTTGTATCAT
TTGGCTCCCTAGTAGGATTTATTCCATATCGTAATCTTGCTGCCAAGTGGAAGTTCTTGGCTTTTGAGTCATGGCTAAGACAGAAAGGTTTGGATCCATCAATATACAAG
CAAAACTTAGGAACTATTGGAAGTAGTGATGGTGGAAGCCAGGCCTTTGCCAGTACTAGGCCGAATCTGGAAATTGATGTAAAAGATGGGGGAGAACTTACACCTGATAT
GAAATTGGAGGATCTTCTTCAAATTTATGATCGAGAGAAAATCAAATTCTTGTCATCATTTGTTGGCCAGAAAATCAAAGTAAATGTGGTGTTGGCAAACAGAAAATCAA
GGAAGCTTATATTTTCCGTAAGGCCAAAAGAGAGGGAGGAATTGGTCGAGAAAAAGAGGAGTTTAATGGCGACGCTGCAAGTAGGTGATGTTGTCAAATGCTGCATTAAG
AAGATTGCTTATTTTGGTATCTTTGTTGAGATCGAAGGAGTGCCTGCTTTGATTCATCAGACAGAGGTTTCCTGGGACGCTACTTTAAACCCTGCATCATATTTTAAAAT
TGGTCAGGTTGTAGAGGCCAAAGTTCATCAGTTGGATTTTTCACTCGAACGCATATTCTTATCATTGAAGCAGATTACACCAGACCCGCTAGCGGAAGCATTGGAGTCTG
TGGTTGGTGATCATGATCCCATGGATGGAAGATTAGATTCAGCCGAAGTAGATACCGAGTGGGCTGATGTAGAATCTCTCATCAAGGAACTGGAAAATACTGAAGGTATT
GAGTCTGTATCCAAAGAGCGTTTTTTCCTTAGCCCTGGATTGGCTCCAACCTTTCAGGTATATATGGCTTCCATGTATGAAAATCAGTACAAATTACTTGCTCGATCTGG
AAACAAAGTACAAGAGCTGATGGTTGAGACATCATTGGATAAAGAAACAATGAAATCTGTTATCTTAACTTGCACCAACAGGTTTCTGGCTATTCTTGTCATAATATTTG
GGATATGGATGGGCACACACCACGATGGTTGTCGAAAATCTCTCACTCTTCCTGTTATGGGACTTGTATCTGATCATGCTGTGCATTACATTGGTGGGGATCCTCGTGTT
TACGGTGCTAGCCTTAATAATTCTGATAATTGGATGCGTCTGAAAAGCTGTCTTGTCAAATCTGAGGATTGCAATAATCTTTCTAAAAGATACAAGACTCTCAAGCAGTA
CAAATTGGCAAAGCTGACGCCAATGGAAGCGGGTTGTTGCAGACCCCCATCAGAATGTGGTTATCCTGCTGTTAATGCATCATACTATGACTTGAGCTTTCATCCTGTCA
ATTCAAACCATGATTGCAAACTATACAAGAATTCCAAGGCAGTCAAGTGTTACGATTGTGATTCGTGCAAGGCCGGGGTCGCACAGTACATGAAAACTGAATGGCGATTG
GTTGCAATCTTCAATGTCATTCTATTTGTTGTCTTGTCGGATGCTGCGCTAGACGAAAAGCTGCAAAAAATCGGTCTAAAGGCTAGCGGCATAGCAGTTGTGCTCAACAG
ACTGATTAAACACTGA
Protein sequenceShow/hide protein sequence
MDGRALTASSFFTPIDLLRPRRGAVRNLCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNEVEELSLDGLN
LVRPQLKKEMKLKAANKPPAPDLKKPSQAVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLRKPEPMTSNEIIDEKEKLSGD
GYVDVVENRATKGSTSDRVDDFTLSKKPEIGGDQTRLENENDHKNLAYSESDTVDGKEENDIEDLHVLKKPLNMMSSMSEETGVGSSTNENRKDIDYSPIGLQLHEPSDV
DYGEDPAALSESFSDILDSTIEVSKKATLLGKPRRVDHSSKEALKLRREEASTPETDVNGAVETETFSAIPALEEHELADWTKAEDLAKSGDRADVEIISSSTRGFVVSF
GSLVGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRPNLEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSR
KLIFSVRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESV
VGDHDPMDGRLDSAEVDTEWADVESLIKELENTEGIESVSKERFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQELMVETSLDKETMKSVILTCTNRFLAILVIIFG
IWMGTHHDGCRKSLTLPVMGLVSDHAVHYIGGDPRVYGASLNNSDNWMRLKSCLVKSEDCNNLSKRYKTLKQYKLAKLTPMEAGCCRPPSECGYPAVNASYYDLSFHPVN
SNHDCKLYKNSKAVKCYDCDSCKAGVAQYMKTEWRLVAIFNVILFVVLSDAALDEKLQKIGLKASGIAVVLNRLIKH