; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G095980 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G095980
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionUnknown protein
Genome locationCiama_Chr05:27127161..27130807
RNA-Seq ExpressionCaUC05G095980
SyntenyCaUC05G095980
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145517.1 uncharacterized protein LOC101216814 isoform X2 [Cucumis sativus]9.5e-6638.3Show/hide
Query:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT
        DQWL+AAMAD+T+VA+LL RLKQSQAVLPSKS L M++PFTWGI+QPRSR+STA A   ATV VRC DVVL RN KDVDSTRCSPTTPLSWSGGASPSAT
Subjt:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT

Query:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR
        LDG+EESSRPATLS AASRF                                                                                
Subjt:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR

Query:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS
                                                                                                            
Subjt:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS

Query:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG
                                                                                                 KG A NES  G
Subjt:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG

Query:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----
        N TKRLRRKK               TFAELKEEES+LLKEK+HLKMELATLRAN EEQR+KNESLKKMK           VD N K TEKF TNSN    
Subjt:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----

Query:  ---IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
           +MPEESSSTLTHQRESS+  T+PF   GSGS EAQSQKN KSTEE+CVFLLPDLNM PSED
Subjt:  ---IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

XP_016901256.1 PREDICTED: uncharacterized protein LOC103493773 isoform X1 [Cucumis melo]3.3e-6638.08Show/hide
Query:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT
        DQWL+AAMADDT+VA+LL RLKQSQAVLPSKS L M++PFTWGI+QPRSR+STA A   ATV+VRC DVVL RN KDVDSTRCSPTTPLSWSGGASPSAT
Subjt:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT

Query:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR
        LDG+EESSRPATLS AASRFK                                                                               
Subjt:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR

Query:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS
                                                                                                            
Subjt:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS

Query:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG
                                                                                                    A NES  G
Subjt:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG

Query:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----
        N TKRLRRKK               TFAELKEEES+LLKEK+HLKMELATLRAN EEQR+KNE LKKMK           VD NLK  EKFS NSN    
Subjt:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----

Query:  -IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
         +MPEESSSTLTHQRESS+  T+PF   GSGS +AQSQKN KSTEE+CVFLLPDLNM PSED
Subjt:  -IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

XP_022976190.1 uncharacterized protein LOC111476653 [Cucurbita maxima]4.4e-6337.72Show/hide
Query:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASP
        M++D+WLTAAMA+D+VVA+LL RLKQSQA  PSKS   M LPF WG+RQPRSR  TAAA  MA V VRC DVVL RN KDVDSTRCSPTTPLSWSGGASP
Subjt:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASP

Query:  SATLDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGR
        SATLDGYE SSR ATL H ASRF                                                                             
Subjt:  SATLDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGR

Query:  RRRGFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYN
                                                                                                            
Subjt:  RRRGFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYN

Query:  LSSIADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANES
                                                                                                    KG  ANES
Subjt:  LSSIADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANES

Query:  VTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSNI
             TKRLRRKK               TFAELKEEE+ LLKEKLHLKMELATL+A+FEEQR+KNESLKKMK           VDFNLK  EKF+TNSN+
Subjt:  VTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSNI

Query:  MPEESSSTLTHQRESSNIE----TIPFKGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
        M EESSSTLTHQRESSNIE    T+PF GSGS EAQSQK SKSTEE+ VFLLPDLNMTPSED
Subjt:  MPEESSSTLTHQRESSNIE----TIPFKGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

XP_031739969.1 uncharacterized protein LOC101216814 isoform X1 [Cucumis sativus]2.3e-6738.65Show/hide
Query:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT
        DQWL+AAMAD+T+VA+LL RLKQSQAVLPSKS L M++PFTWGI+QPRSR+STA A   ATV VRC DVVL RN KDVDSTRCSPTTPLSWSGGASPSAT
Subjt:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT

Query:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR
        LDG+EESSRPATLS AASRFKVL+                                                                            
Subjt:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR

Query:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS
                                                                                                            
Subjt:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS

Query:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG
                                                                                                  G A NES  G
Subjt:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG

Query:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----
        N TKRLRRKK               TFAELKEEES+LLKEK+HLKMELATLRAN EEQR+KNESLKKMK           VD N K TEKF TNSN    
Subjt:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----

Query:  ---IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
           +MPEESSSTLTHQRESS+  T+PF   GSGS EAQSQKN KSTEE+CVFLLPDLNM PSED
Subjt:  ---IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

XP_038897737.1 uncharacterized protein LOC120085676 [Benincasa hispida]3.7e-7841.08Show/hide
Query:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRIST--------AAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPL
        + DDQWLTAAMADDT+VAELLFRLKQSQAVLPSKS L +T+PFTWGIRQPRSR+ST        AAA  MATVAVRC DVVLHRN KDVDSTRCSPTTPL
Subjt:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRIST--------AAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPL

Query:  SWSGGASPSATLDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWC
        SWSGGASPSATLDG+E+SSRPATLS AASRF                                                                     
Subjt:  SWSGGASPSATLDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWC

Query:  RLRILGGRRRRGFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVT
                                                                                                            
Subjt:  RLRILGGRRRRGFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVT

Query:  HEIFPIYNLSSIADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNL
                                                                                                            
Subjt:  HEIFPIYNLSSIADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNL

Query:  KGVAANESVTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTE
        KG A NESV GN TKRLRRKK               TFAELKEEESMLLKEKLHLKMELATLRANFEEQR+KNESLKKMK           VDFNLK TE
Subjt:  KGVAANESVTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTE

Query:  KFSTNSNIMPEESSSTLTHQRESSNIE----TIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
        KFSTNSN+MPEES+STLTHQRESSNIE    T+PF   GSGSFEAQSQKN KSTEE+C FLLPDLNMTPSED
Subjt:  KFSTNSNIMPEESSSTLTHQRESSNIE----TIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

TrEMBL top hitse value%identityAlignment
A0A0A0L0G9 Uncharacterized protein5.2e-6237.7Show/hide
Query:  MADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSATLDGYEES
        MAD+T+VA+LL RLKQSQAVLPSKS L M++PFTWGI+QPRSR+STA A   ATV VRC DVVL RN KDVDSTRCSPTTPLSWSGGASPSATLDG+EES
Subjt:  MADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSATLDGYEES

Query:  SRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRRGFSDENI
        SRPATLS AASRF                                                                                       
Subjt:  SRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRRGFSDENI

Query:  LISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSSIADGLRR
                                                                                                            
Subjt:  LISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSSIADGLRR

Query:  DHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTGNATKRLR
                                                                                          KG A NES  GN TKRLR
Subjt:  DHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTGNATKRLR

Query:  RKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN-------IMPE
        RKK               TFAELKEEES+LLKEK+HLKMELATLRAN EEQR+KNESLKKMK           VD N K TEKF TNSN       +MPE
Subjt:  RKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN-------IMPE

Query:  ESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
        ESSSTLTHQRESS+  T+PF   GSGS EAQSQKN KSTEE+CVFLLPDLNM PSED
Subjt:  ESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

A0A1S4DZ38 uncharacterized protein LOC103493773 isoform X11.6e-6638.08Show/hide
Query:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT
        DQWL+AAMADDT+VA+LL RLKQSQAVLPSKS L M++PFTWGI+QPRSR+STA A   ATV+VRC DVVL RN KDVDSTRCSPTTPLSWSGGASPSAT
Subjt:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT

Query:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR
        LDG+EESSRPATLS AASRFK                                                                               
Subjt:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR

Query:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS
                                                                                                            
Subjt:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS

Query:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG
                                                                                                    A NES  G
Subjt:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG

Query:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----
        N TKRLRRKK               TFAELKEEES+LLKEK+HLKMELATLRAN EEQR+KNE LKKMK           VD NLK  EKFS NSN    
Subjt:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----

Query:  -IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
         +MPEESSSTLTHQRESS+  T+PF   GSGS +AQSQKN KSTEE+CVFLLPDLNM PSED
Subjt:  -IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

A0A5A7VE69 Uncharacterized protein1.6e-6638.08Show/hide
Query:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT
        DQWL+AAMADDT+VA+LL RLKQSQAVLPSKS L M++PFTWGI+QPRSR+STA A   ATV+VRC DVVL RN KDVDSTRCSPTTPLSWSGGASPSAT
Subjt:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSAT

Query:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR
        LDG+EESSRPATLS AASRFK                                                                               
Subjt:  LDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRR

Query:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS
                                                                                                            
Subjt:  GFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSS

Query:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG
                                                                                                    A NES  G
Subjt:  IADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTG

Query:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----
        N TKRLRRKK               TFAELKEEES+LLKEK+HLKMELATLRAN EEQR+KNE LKKMK           VD NLK  EKFS NSN    
Subjt:  NATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSN----

Query:  -IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
         +MPEESSSTLTHQRESS+  T+PF   GSGS +AQSQKN KSTEE+CVFLLPDLNM PSED
Subjt:  -IMPEESSSTLTHQRESSNIETIPF--KGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

A0A6J1FA47 uncharacterized protein LOC1114436683.6e-6338.01Show/hide
Query:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASP
        M++D+WLTAAMA+D+VVA+LL RLKQSQA  PSKS   M LPF WG+RQPRSR  TAAA  MA V +RC DVVL RN KDVDSTRCSPTTPLSWSGGASP
Subjt:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASP

Query:  SATLDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGR
        SATLDGYE SSR ATL H ASRF                                                                             
Subjt:  SATLDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGR

Query:  RRRGFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYN
                                                                                                            
Subjt:  RRRGFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYN

Query:  LSSIADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANES
                                                                                                    KG  ANES
Subjt:  LSSIADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANES

Query:  VTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSNI
          G  TKRLRRKK               TFAELKEEE+ LLKEKLHLKMELATL+A+FEEQRSKNESLKKMK           VDFNLK  EKFSTNSN+
Subjt:  VTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSNI

Query:  MPEESSSTLTHQRESS-NIE----TIPFKGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
        M EESSSTLTHQRESS NIE    TIPF GSGS EAQSQK SKSTEE+ VFLLPDLNM PSED
Subjt:  MPEESSSTLTHQRESS-NIE----TIPFKGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

A0A6J1IIT2 uncharacterized protein LOC1114766532.1e-6337.72Show/hide
Query:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASP
        M++D+WLTAAMA+D+VVA+LL RLKQSQA  PSKS   M LPF WG+RQPRSR  TAAA  MA V VRC DVVL RN KDVDSTRCSPTTPLSWSGGASP
Subjt:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASP

Query:  SATLDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGR
        SATLDGYE SSR ATL H ASRF                                                                             
Subjt:  SATLDGYEESSRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGR

Query:  RRRGFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYN
                                                                                                            
Subjt:  RRRGFSDENILISEKPRNLIPYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYN

Query:  LSSIADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANES
                                                                                                    KG  ANES
Subjt:  LSSIADGLRRDHSKRLETSLQPLSFLFPLFLTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANES

Query:  VTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSNI
             TKRLRRKK               TFAELKEEE+ LLKEKLHLKMELATL+A+FEEQR+KNESLKKMK           VDFNLK  EKF+TNSN+
Subjt:  VTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSNI

Query:  MPEESSSTLTHQRESSNIE----TIPFKGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED
        M EESSSTLTHQRESSNIE    T+PF GSGS EAQSQK SKSTEE+ VFLLPDLNMTPSED
Subjt:  MPEESSSTLTHQRESSNIE----TIPFKGSGSFEAQSQKNSKSTEEECVFLLPDLNMTPSED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15800.1 unknown protein2.7e-1037.8Show/hide
Query:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLP-SKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWS----
        M    W+  AM DD++VAE L  L  ++  LP +KS  +  L   W +RQPR++ +T                   R   D D TR SPTTPLSWS    
Subjt:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLP-SKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWS----

Query:  ---GGASPSATLDGYEESSRPATLSHA
           GG   +A +DG+EESS    LS A
Subjt:  ---GGASPSATLDGYEESSRPATLSHA

AT1G15800.1 unknown protein4.8e-0740.59Show/hide
Query:  SVTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKA--SENNKSYFMQVDFNLKCTEKFSTN
        SVT +  KR R+KK               T A+LKEEES+LLKE+  L+ ELAT++   ++QR++NESLKK++A   +N+ S F+  D N+      S  
Subjt:  SVTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKA--SENNKSYFMQVDFNLKCTEKFSTN

Query:  S
        S
Subjt:  S

AT1G80610.1 unknown protein1.6e-0734.15Show/hide
Query:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASP
        M  + W+  AM+DD++VAE L RL+ S+   P     +  L   W +RQ RS                          K  D TR SPTTPLSWSG  S 
Subjt:  MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASP

Query:  S------------ATLDGYEESS
        S             T++G EESS
Subjt:  S------------ATLDGYEESS

AT1G80610.1 unknown protein6.9e-0656.6Show/hide
Query:  RISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENN
        R  +T AELKEEE MLLKE   LK ELA +R   E+QR++N +LKKMKA   +
Subjt:  RISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENN

AT4G32030.1 unknown protein6.9e-1442.98Show/hide
Query:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGG------
        D W+  A+ DD +V ELL RLK +  V+    ++ +  P  WGIRQ RSR S         V+++          KDVDS R SP TPLSWSGG      
Subjt:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGG------

Query:  -ASPSATLDGYEESSRPATLS
         ASPSA  DG+E++SR A+ S
Subjt:  -ASPSATLDGYEESSRPATLS

AT4G32030.1 unknown protein1.4e-0328.48Show/hide
Query:  GRNLKGVAANESVTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNL
        G   K    NE +T   +KRL+++K                  ELK EE++ LKE+L L+ E+A+LRA F+EQ  +N+ LK++K   N+           
Subjt:  GRNLKGVAANESVTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESMLLKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNL

Query:  KCTEKFSTNSNIMPEESSSTLTHQRESSNIETIPFKGSGSFEAQSQKNSKSTEEECVF
               TN   +     S L   + S + +T   +  GSF      N   +EEE ++
Subjt:  KCTEKFSTNSNIMPEESSSTLTHQRESSNIETIPFKGSGSFEAQSQKNSKSTEEECVF

AT4G32030.2 unknown protein6.9e-1442.98Show/hide
Query:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGG------
        D W+  A+ DD +V ELL RLK +  V+    ++ +  P  WGIRQ RSR S         V+++          KDVDS R SP TPLSWSGG      
Subjt:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGG------

Query:  -ASPSATLDGYEESSRPATLS
         ASPSA  DG+E++SR A+ S
Subjt:  -ASPSATLDGYEESSRPATLS

AT5G25210.1 unknown protein1.4e-1138.97Show/hide
Query:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGG-----A
        D W   AM D  VVAELL +LK+++ +   K+ +   L   WGI+QPRSR                         +    +RCSP+TPLSWSGG     +
Subjt:  DQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGG-----A

Query:  SPSATLDGYEESSRPATLSHAASRFKVLDSF-SPFS
        SPS  +DGYE +SR   +S   SR K + S  SPFS
Subjt:  SPSATLDGYEESSRPATLSHAASRFKVLDSF-SPFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGACGACCAATGGCTTACGGCTGCCATGGCCGACGACACTGTCGTCGCGGAGTTATTGTTCAGGTTGAAGCAGTCGCAGGCTGTATTGCCTTCCAAATCTTCTCT
TTCTATGACGCTGCCGTTTACTTGGGGGATTAGGCAACCCAGGTCTAGGATTTCTACGGCAGCCGCGCCAACTATGGCTACGGTGGCTGTCAGATGTGCCGATGTTGTCT
TGCACCGGAATACTAAGGATGTTGACTCCACTAGATGTAGTCCTACGACGCCGCTTTCATGGAGCGGTGGAGCTTCTCCTTCCGCTACGCTTGACGGTTATGAGGAGTCG
AGCCGTCCCGCTACTCTCTCCCATGCTGCTTCCAGATTTAAGGTTCTTGACTCTTTCTCTCCATTCTCCGTTATGCCGTCTGTTTTGGATTTTTGTAGTTTCGTCTCTTT
TCTCGGCTCTGTGTGCTCTGCCTTTCCGATCTTTCTGATTTTTGTCTTTTCTATTTTTATTTTTTTGAATTTTGATACGAGTTTTCAAATCTTAACGCTGCCGTTCTTAA
TTGCGTTTTTTTCTCGGCGGTGGTGTCGGCTGCGGATTTTAGGCGGCCGGCGGCGGCGTGGATTCTCAGATGAAAACATTCTTATATCGGAGAAGCCTCGTAATCTGATT
CCGTATCGTGAAATCTCTCCTGATTTCCTCTTCATCAGAAGAAGGTCAGCGGATGAACGCTGGTTGAAAATACTTCGCGTTTTCTTCTCGCCTCTTTTCAGCTCCGTCTG
TCGATTCGAAATCCCTTATTTCTTCAAATTGACCAAAACGCCCTTCAAATCCCTCGCTAGATTCCAAACCTTTCCAAATCGTACATCGAGTCGCTGTTTCGTCACGCACG
AGATTTTTCCAATATATAATCTCTCTTCCATCGCCGACGGTCTACGCAGGGACCATAGCAAGCGCCTCGAAACTTCCCTACAGCCATTGTCCTTTCTTTTTCCCCTCTTT
TTGACCGTATTGCCCCTCGACGCCCGGTCTGGTCGAACCGCCGGTCAGATTCGACCGTCTACGACGCTTCAAAGTGAGGGCCGTAATCGGATCAGTTGTTGGGCTGAGAG
GAGGAAAAGGAAGATCCACGTGACCGAGTCACGCGAGGTGGGAAAGCAAGAATGGTTCGGGTGTGGTCGCAATCTTAAAGGTGTTGCTGCAAATGAGTCGGTGACTGGTA
ATGCTACAAAGAGGTTAAGACGAAAGAAGGAAAAATCAATTATGGATTTGTTGGGGTTTGGCCGTATAAGCCAAACATTTGCTGAACTTAAGGAAGAGGAAAGCATGCTT
TTGAAGGAGAAACTTCATCTTAAAATGGAACTGGCCACACTACGGGCAAACTTCGAAGAACAAAGATCCAAGAATGAAAGTTTGAAGAAAATGAAGGCTTCTGAGAACAA
TAAATCCTACTTTATGCAGGTGGATTTCAACTTAAAATGCACAGAGAAATTCAGTACAAACTCCAATATAATGCCGGAGGAATCTTCATCCACACTAACCCATCAAAGGG
AAAGCTCTAACATTGAAACAATTCCATTCAAAGGATCAGGTTCTTTCGAGGCTCAATCACAGAAGAACAGCAAATCAACCGAAGAAGAATGTGTCTTTCTTTTACCAGAT
TTAAATATGACACCTTCAGAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGACGACCAATGGCTTACGGCTGCCATGGCCGACGACACTGTCGTCGCGGAGTTATTGTTCAGGTTGAAGCAGTCGCAGGCTGTATTGCCTTCCAAATCTTCTCT
TTCTATGACGCTGCCGTTTACTTGGGGGATTAGGCAACCCAGGTCTAGGATTTCTACGGCAGCCGCGCCAACTATGGCTACGGTGGCTGTCAGATGTGCCGATGTTGTCT
TGCACCGGAATACTAAGGATGTTGACTCCACTAGATGTAGTCCTACGACGCCGCTTTCATGGAGCGGTGGAGCTTCTCCTTCCGCTACGCTTGACGGTTATGAGGAGTCG
AGCCGTCCCGCTACTCTCTCCCATGCTGCTTCCAGATTTAAGGTTCTTGACTCTTTCTCTCCATTCTCCGTTATGCCGTCTGTTTTGGATTTTTGTAGTTTCGTCTCTTT
TCTCGGCTCTGTGTGCTCTGCCTTTCCGATCTTTCTGATTTTTGTCTTTTCTATTTTTATTTTTTTGAATTTTGATACGAGTTTTCAAATCTTAACGCTGCCGTTCTTAA
TTGCGTTTTTTTCTCGGCGGTGGTGTCGGCTGCGGATTTTAGGCGGCCGGCGGCGGCGTGGATTCTCAGATGAAAACATTCTTATATCGGAGAAGCCTCGTAATCTGATT
CCGTATCGTGAAATCTCTCCTGATTTCCTCTTCATCAGAAGAAGGTCAGCGGATGAACGCTGGTTGAAAATACTTCGCGTTTTCTTCTCGCCTCTTTTCAGCTCCGTCTG
TCGATTCGAAATCCCTTATTTCTTCAAATTGACCAAAACGCCCTTCAAATCCCTCGCTAGATTCCAAACCTTTCCAAATCGTACATCGAGTCGCTGTTTCGTCACGCACG
AGATTTTTCCAATATATAATCTCTCTTCCATCGCCGACGGTCTACGCAGGGACCATAGCAAGCGCCTCGAAACTTCCCTACAGCCATTGTCCTTTCTTTTTCCCCTCTTT
TTGACCGTATTGCCCCTCGACGCCCGGTCTGGTCGAACCGCCGGTCAGATTCGACCGTCTACGACGCTTCAAAGTGAGGGCCGTAATCGGATCAGTTGTTGGGCTGAGAG
GAGGAAAAGGAAGATCCACGTGACCGAGTCACGCGAGGTGGGAAAGCAAGAATGGTTCGGGTGTGGTCGCAATCTTAAAGGTGTTGCTGCAAATGAGTCGGTGACTGGTA
ATGCTACAAAGAGGTTAAGACGAAAGAAGGAAAAATCAATTATGGATTTGTTGGGGTTTGGCCGTATAAGCCAAACATTTGCTGAACTTAAGGAAGAGGAAAGCATGCTT
TTGAAGGAGAAACTTCATCTTAAAATGGAACTGGCCACACTACGGGCAAACTTCGAAGAACAAAGATCCAAGAATGAAAGTTTGAAGAAAATGAAGGCTTCTGAGAACAA
TAAATCCTACTTTATGCAGGTGGATTTCAACTTAAAATGCACAGAGAAATTCAGTACAAACTCCAATATAATGCCGGAGGAATCTTCATCCACACTAACCCATCAAAGGG
AAAGCTCTAACATTGAAACAATTCCATTCAAAGGATCAGGTTCTTTCGAGGCTCAATCACAGAAGAACAGCAAATCAACCGAAGAAGAATGTGTCTTTCTTTTACCAGAT
TTAAATATGACACCTTCAGAGGATTGA
Protein sequenceShow/hide protein sequence
MMDDQWLTAAMADDTVVAELLFRLKQSQAVLPSKSSLSMTLPFTWGIRQPRSRISTAAAPTMATVAVRCADVVLHRNTKDVDSTRCSPTTPLSWSGGASPSATLDGYEES
SRPATLSHAASRFKVLDSFSPFSVMPSVLDFCSFVSFLGSVCSAFPIFLIFVFSIFIFLNFDTSFQILTLPFLIAFFSRRWCRLRILGGRRRRGFSDENILISEKPRNLI
PYREISPDFLFIRRRSADERWLKILRVFFSPLFSSVCRFEIPYFFKLTKTPFKSLARFQTFPNRTSSRCFVTHEIFPIYNLSSIADGLRRDHSKRLETSLQPLSFLFPLF
LTVLPLDARSGRTAGQIRPSTTLQSEGRNRISCWAERRKRKIHVTESREVGKQEWFGCGRNLKGVAANESVTGNATKRLRRKKEKSIMDLLGFGRISQTFAELKEEESML
LKEKLHLKMELATLRANFEEQRSKNESLKKMKASENNKSYFMQVDFNLKCTEKFSTNSNIMPEESSSTLTHQRESSNIETIPFKGSGSFEAQSQKNSKSTEEECVFLLPD
LNMTPSED