; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg19237 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg19237
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptiontranscription factor SPT20 homolog isoform X1
Genome locationCarg_Chr19:8484843..8491875
RNA-Seq ExpressionCarg19237
SyntenyCarg19237
Gene Ontology termsNA
InterPro domainsIPR010820 - UBA-like domain DUF1421


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589219.1 hypothetical protein SDJN03_17784, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0075.23Show/hide
Query:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
        MASGSAGRPNSGSK FDFG++D+LCSYEDYGNQESSNG+H+DLSVANSSKDFHKSR+STVYPAAAY QPEDS+KQDVISTVENSMKKYSDNILRFLEGIS
Subjt:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS

Query:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI
        SRLSQLELN YNLDKSVGEMRSD+IRDHEE DLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHL+QKES  S HSHSNEERASP A DP KNE 
Subjt:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI

Query:  PSKNHNQQLALALPHQIVPQQH---PPPPAALPENVPQQQPYY------------IQHPQSQHQM----TNAHAQLSQTPPPPPQQFSQYQQQW------
        PS+NHNQQLALALPHQ++ QQ+   PPPPAALP+N+PQQQ YY            IQH Q Q+Q      +   Q SQT  PPPQQF+QY QQW      
Subjt:  PSKNHNQQLALALPHQIVPQQH---PPPPAALPENVPQQQPYY------------IQHPQSQHQM----TNAHAQLSQTPPPPPQQFSQYQQQW------

Query:  PQQPPQQAQPP-QQHPSMQPQIRLPPTSVYSSYSMNQPTSMPET----PPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEG
         QQPPQ  QPP QQ PSMQPQIR  P+SVY SYSMNQPTSMPET     PMQ +FSP+PQPGSSR+DTV YGY GS  T+PQQPPQVKNAF  GP AGEG
Subjt:  PQQPPQQAQPP-QQHPSMQPQIRLPPTSVYSSYSMNQPTSMPET----PPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEG

Query:  YLPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPAN--VQIPQHPSGPHVMARNHPNQSHFMRNQ
        YLPSGPQ  LSSGG+YM+YDRE+GR               PHH PQPQ         QQPHFNQ  YPPAN  +QIPQ  SGPHV+ARN P+ +H MRNQ
Subjt:  YLPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPAN--VQIPQHPSGPHVMARNHPNQSHFMRNQ

Query:  NHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLSNSG-----------------------------GRPNSAPKSFDFGSDNILCSFED
        +HPYGEIV+KLVGMGFRSDHI SVIHRMEESGQPIDFNAVLDGLSN G                             GR NSAPK+FDFGSD+ILCS+ED
Subjt:  NHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLSNSG-----------------------------GRPNSAPKSFDFGSDNILCSFED

Query:  YTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAA-YGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSDLARD
        Y KQ+ SNGSHS+PVSV NSSKDFHK RMST FP AA YGQPDDSI QDVI+AVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSD+ARD
Subjt:  YTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAA-YGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSDLARD

Query:  HEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEER---VSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITAP
        HEE +SKLKS+EKH+QEVHRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHSQ+NEER   V++DPKKNEN SEIH QQLALALPHQIVPQQNPI AP
Subjt:  HEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEER---VSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITAP

Query:  PSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQT-PQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPP
         SA LP NVP QQQSYYIS +QL G QP HIQHA  QYIS D QHRA QPQDVS  TNPQLSQ+ PQPFNQYQQQWAQ PSQ  QPPQQ+SMQPQIRPPP
Subjt:  PSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQT-PQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPP

Query:  TSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRESGR
        TS YP    PPNQP+S+PETLSS+    MSFASIP PGSSR D VPYGYAAASGGS+PQQPPQVKN YGPATGEGY+PPGQ        AYMMYDRESGR
Subjt:  TSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRESGR

Query:  PPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQR
        PPHH PQQP        HF+QSGYPPANAPHQ+ PQA T P VS+RNPSHSHLIEKLVGMGFRGDHV +IIQRMED G+ VDFN VLDRLS+ A PGPQR
Subjt:  PPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQR

Query:  AW
        AW
Subjt:  AW

KAG7011974.1 hypothetical protein SDJN02_26882, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  KIEIRFYKITLNQIHGEFIRGKSHKEKLSFSIIAKKKKKIIYFSIFCRSLNRSPFYFRLSDFPLRSMASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQE
        KIEIRFYKITLNQIHGEFIRGKSHKEKLSFSIIAKKKKKIIYFSIFCRSLNRSPFYFRLSDFPLRSMASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQE
Subjt:  KIEIRFYKITLNQIHGEFIRGKSHKEKLSFSIIAKKKKKIIYFSIFCRSLNRSPFYFRLSDFPLRSMASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQE

Query:  SSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLK
        SSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLK
Subjt:  SSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLK

Query:  LKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVP
        LKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVP
Subjt:  LKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVP

Query:  QQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSS
        QQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSS
Subjt:  QQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSS

Query:  RMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQ
        RMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQ
Subjt:  RMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQ

Query:  SGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLSNSGGRPNSAPKSFDFGSDNIL
        SGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLSNSGGRPNSAPKSFDFGSDNIL
Subjt:  SGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLSNSGGRPNSAPKSFDFGSDNIL

Query:  CSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSD
        CSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSD
Subjt:  CSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSD

Query:  LARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITA
        LARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITA
Subjt:  LARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITA

Query:  PPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPP
        PPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPP
Subjt:  PPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPP

Query:  TSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRESGR
        TSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRESGR
Subjt:  TSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRESGR

Query:  PPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQR
        PPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQR
Subjt:  PPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQR

Query:  AW
        AW
Subjt:  AW

KAG7022919.1 hypothetical protein SDJN02_16655, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0075.62Show/hide
Query:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
        MASGSAGRPNSGSK FDFG++D+LCSYEDYGNQESSNG+H+DLSVANSSKDFHKSR+STVYPAAAY QPEDS+KQDVISTVENSMKKYSDNILRFLEGIS
Subjt:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS

Query:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI
        SRLSQLELN YNLDKSVGEMRSD+IRDHEE DLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHL+QKES  S HSHSNEERASP A DP KNE 
Subjt:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI

Query:  PSKNHNQQLALALPHQIVPQQH---PPPPAALPENVPQQQPYY------------IQHPQSQHQM----TNAHAQLSQTPPPPPQQFSQYQQQW-----P
        PS+NHNQQLALALPHQ++ QQ+   PPPPAALP+NVPQQQ YY            IQH Q Q+Q      +   Q SQT  PPPQQF+QY QQW      
Subjt:  PSKNHNQQLALALPHQIVPQQH---PPPPAALPENVPQQQPYY------------IQHPQSQHQM----TNAHAQLSQTPPPPPQQFSQYQQQW-----P

Query:  QQPPQQAQPP-QQHPSMQPQIRLPPTSVYSSYSMNQPTSMPET----PPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGY
        QQPPQ  QPP QQ PSMQPQIR  P+SVY SYSMNQPTSMPET     PMQ +FSP+PQPGSSR+DTV YGY GS  T+PQQPPQVKNAF  GP AGEGY
Subjt:  QQPPQQAQPP-QQHPSMQPQIRLPPTSVYSSYSMNQPTSMPET----PPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGY

Query:  LPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPAN--VQIPQHPSGPHVMARNHPNQSHFMRNQN
        LPSGPQ  LSSGG+YM+YDRE+GR               PHH PQPQ         QQPHFNQ  YPPAN  +QIPQ  SGPHV+ARN P+ +H MRNQ+
Subjt:  LPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPAN--VQIPQHPSGPHVMARNHPNQSHFMRNQN

Query:  HPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLSNSG------------GRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSNPVSVA
        HPYGEIV+KLVGMGFRSDHI SVIHRMEESGQPIDFNAVLDGLSN G            GR NSAPK+FDFGSD+ILCS+EDY KQ+ SNGSHS+PVSV 
Subjt:  HPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLSNSG------------GRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSNPVSVA

Query:  NSSKDFHKSRMSTVFPGAA-YGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEV
        NSSKDFHK RMST FP AA YGQPDDSI QDVI+AVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSD+ARDHE              EV
Subjt:  NSSKDFHKSRMSTVFPGAA-YGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEV

Query:  HRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEER---VSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYI
        HRSVQIIRDKQELAETQKDLAKLQV QKEPS SSHSQ+NEER   V++DPKKNEN SEIH QQLALALPHQIVPQQNPI AP SA LP NVP QQQSYYI
Subjt:  HRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEER---VSTDPKKNENPSEIHNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYI

Query:  SSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQT-PQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMP
        S +QL G QP HIQHA  QYIS D QHRA QPQDVS  TNPQLSQ+ PQPFNQYQQQWAQ PSQ  QPPQQ+SMQPQIRPPPTS YP    PPNQP+S+P
Subjt:  SSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQT-PQPFNQYQQQWAQPPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMP

Query:  ETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHHPSQQSH
        ETLSS+    MSFASIP PGSSR D VPYGYAAASGGS+PQQPPQVKN YGPATGEGY+PPGQ        AYMMYDRESGRPPHH PQQP        H
Subjt:  ETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHHPSQQSH

Query:  FSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRA
        F+QSGYPPANAPHQ+ PQA T P VS+RNPSHSHLIEKLVGMGFRGDHV +IIQRMED G+ VDFN VLDRLS+ A PGPQRA
Subjt:  FSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQRA

XP_022952329.1 class E vacuolar protein-sorting machinery protein hse1-like [Cucurbita moschata]2.2e-28498.66Show/hide
Query:  SNSGGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISS
        S S GRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHS+PVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISS
Subjt:  SNSGGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISS

Query:  RLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEI
        RLSQLELYCYNLDKSVGEMRSDLARDHEEA+SKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEI
Subjt:  RLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEI

Query:  HNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQ
        HNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQ
Subjt:  HNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQ

Query:  PPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMP
        PPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMP
Subjt:  PPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMP

Query:  PGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSG
        PGQQPALSSGGAYMMYDRESGRPPHHLPQQP HHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHV +IIQRMEDSG
Subjt:  PGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSG

Query:  QTVDFNAVLDRLSTPAGPGPQRAW
        QTVDFNAVLDRLSTPAGPGPQRAW
Subjt:  QTVDFNAVLDRLSTPAGPGPQRAW

XP_022952330.1 trithorax group protein osa-like isoform X1 [Cucurbita moschata]5.8e-28599.23Show/hide
Query:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
        MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
Subjt:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS

Query:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI
        SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQEL ETQKDLAKLHLLQKESSLS HSHSNEERASPGAFDPKKNEI
Subjt:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI

Query:  PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPP
        PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQW QQPPQQAQPPQQHPSMQPQIRLPP
Subjt:  PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPP

Query:  TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHP
        TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRE+GRPPHHP
Subjt:  TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHP

Query:  PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG
        PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG
Subjt:  PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG

Query:  QPIDFNAVLDGLSNSGG
        QPIDFNAVLDGLSNSGG
Subjt:  QPIDFNAVLDGLSNSGG

TrEMBL top hitse value%identityAlignment
A0A6J1GK45 trithorax group protein osa-like isoform X37.2e-28198.45Show/hide
Query:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
        MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
Subjt:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS

Query:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI
        SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQEL ETQKDLAKLHLLQKESSLS HSHSNEERASPGAFDPKKNEI
Subjt:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI

Query:  PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPP
        PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQW QQPPQQAQPPQQHPSMQPQIRLPP
Subjt:  PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPP

Query:  TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHP
        TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRE+GR    P
Subjt:  TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHP

Query:  PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG
        PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG
Subjt:  PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG

Query:  QPIDFNAVLDGLSNSGG
        QPIDFNAVLDGLSNSGG
Subjt:  QPIDFNAVLDGLSNSGG

A0A6J1GLD5 class E vacuolar protein-sorting machinery protein hse1-like1.1e-28498.66Show/hide
Query:  SNSGGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISS
        S S GRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHS+PVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISS
Subjt:  SNSGGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISS

Query:  RLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEI
        RLSQLELYCYNLDKSVGEMRSDLARDHEEA+SKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEI
Subjt:  RLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEI

Query:  HNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQ
        HNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQ
Subjt:  HNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQ

Query:  PPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMP
        PPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMP
Subjt:  PPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMP

Query:  PGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSG
        PGQQPALSSGGAYMMYDRESGRPPHHLPQQP HHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHV +IIQRMEDSG
Subjt:  PGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSG

Query:  QTVDFNAVLDRLSTPAGPGPQRAW
        QTVDFNAVLDRLSTPAGPGPQRAW
Subjt:  QTVDFNAVLDRLSTPAGPGPQRAW

A0A6J1GLG8 trithorax group protein osa-like isoform X12.8e-28599.23Show/hide
Query:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
        MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
Subjt:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS

Query:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI
        SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQEL ETQKDLAKLHLLQKESSLS HSHSNEERASPGAFDPKKNEI
Subjt:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI

Query:  PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPP
        PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQW QQPPQQAQPPQQHPSMQPQIRLPP
Subjt:  PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPP

Query:  TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHP
        TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRE+GRPPHHP
Subjt:  TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHP

Query:  PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG
        PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG
Subjt:  PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG

Query:  QPIDFNAVLDGLSNSGG
        QPIDFNAVLDGLSNSGG
Subjt:  QPIDFNAVLDGLSNSGG

A0A6J1HZW1 ataxin-2 homolog2.2e-27797.14Show/hide
Query:  SNSGGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISS
        S S GRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHS+PVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIA VENSMKKHSDNLLRFLEGISS
Subjt:  SNSGGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVENSMKKHSDNLLRFLEGISS

Query:  RLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEI
        RLSQLELYCYNLDKSVGEMRSDLARDHEEA+SKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEI
Subjt:  RLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPKKNENPSEI

Query:  HNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQ
        HNQQLALALPHQIVPQQNP+T PPSAALPQNVPQQ QSYYISSSQLPGQQPSHIQHAQNQYISSDS HRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQ
Subjt:  HNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQ

Query:  PPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMP
        PPSQPAQPPQQASMQPQIRPPPTSVYPSPY PPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMP
Subjt:  PPSQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMP

Query:  PGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSG
        PGQQPALSSGGAYMMYDRESGRPPHHLPQQP HHPSQQSHF+QSGYPPANAP QVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHV +IIQRMEDSG
Subjt:  PGQQPALSSGGAYMMYDRESGRPPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSG

Query:  QTVDFNAVLDRLSTPAGPGPQRAW
        QTVDFNAVLDRLSTPAGPGPQRAW
Subjt:  QTVDFNAVLDRLSTPAGPGPQRAW

A0A6J1I1G6 RNA-binding protein 33-like6.8e-27196.52Show/hide
Query:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
        MASGS GRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS
Subjt:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGIS

Query:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI
        SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQEL ETQKDLAKLHLLQKESSLS HSHSNEERASPGAFDPKKNEI
Subjt:  SRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEI

Query:  PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPP
        PSKN NQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTN  AQLSQT PPPPQQFSQYQQQW QQPPQQAQPPQQHPSMQPQIRLPP
Subjt:  PSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPP

Query:  TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHP
        TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSR+DTVQYGYVGSAGTMPQ PPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRE+GR    P
Subjt:  TSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHP

Query:  PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG
        PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNH N SHFMRNQNHPYGEIVDKLVGMGFRSDHI SVIHRMEESG
Subjt:  PHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESG

Query:  QPIDFNAVLDGLSNSGG
        QPIDFNAVLDGLSNSGG
Subjt:  QPIDFNAVLDGLSNSGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01560.1 Protein of unknown function (DUF1421)1.1e-1527.39Show/hide
Query:  GSKGFDFGTDDVLCSYEDYGNQESSNG--SHSDLSVANSSKDFHKSR-----ISTVYPAAAYGQPED--------SMKQDVIST------VENSMKKYSD
        G  G D   + ++ SY+ +  + ++    SHS L +A S+   + S      +ST  P   +G  +            Q+V +T      ++ +MKK++D
Subjt:  GSKGFDFGTDDVLCSYEDYGNQESSNG--SHSDLSVANSSKDFHKSR-----ISTVYPAAAYGQPED--------SMKQDVIST------VENSMKKYSD

Query:  NILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASP
         +L  +EG+S+RLSQLE   +NL+  V +++  +   H   D K++ L+  L EV   VQ+++DKQE+ E Q              LS H  SN+     
Subjt:  NILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASP

Query:  GAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQ--LSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQ
                   +K H+               H  P A  P  VP QQ      PQ     T A +Q   SQ PP  P QFS  Q+ +   PP   QPP  
Subjt:  GAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQ--LSQTPPPPPQQFSQYQQQWPQQPPQQAQPPQQ

Query:  HPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMI
        +P        PP     + + +QP+   ++PP Q  +   P P S      Q  Y     + P  PP+ +   G  P + + Y P  PQ  +        
Subjt:  HPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMI

Query:  YDRENGRP-PHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNH-PYGEIVDKLVGMGFRS
        YD   GR     P  +  +P      P    +PPH          N +GYP  +   P   + P V A +    S   R+++  P  +++D++  MGF  
Subjt:  YDRENGRP-PHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNH-PYGEIVDKLVGMGFRS

Query:  DHIGSVIHRMEESGQPIDFNAVLDGLSNSGGRP
        D + + + ++ E+GQ +D N VLD L N GG P
Subjt:  DHIGSVIHRMEESGQPIDFNAVLDGLSNSGGRP

AT4G28300.1 Protein of unknown function (DUF1421)1.3e-10149.09Show/hide
Query:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVA--NSSKDFHKSRI--STVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFL
        MASGS+GR NSGSKGFDFG+DD+LCSY+DY NQ+SSNG HSD ++A  NS+K+FHK+R+  S+V+P ++Y  PEDS+ QD+  TVE +MK Y+DN++RFL
Subjt:  MASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLSVA--NSSKDFHKSRI--STVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFL

Query:  EGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPK
        EG+SSRLSQLEL  YNLDK++GEMRS+L   HE+AD+KL+SL+KHLQEVHRSVQI+RDKQELA+TQK+LAKL L+QKESS S HS   E+R +    +PK
Subjt:  EGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPK

Query:  KNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTP-------------PPPP----------QQFSQYQQ
        K+E  S  HNQQLALALPHQI PQ     P   P+  PQQ  YY+  P +Q Q T A   +S  P             PPPP          Q F QYQQ
Subjt:  KNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTP-------------PPPP----------QQFSQYQQ

Query:  QWPQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQP--TSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGY
         WP QP  + Q    +P+  P    PP         NQP   S+P +  MQ  +S  PQ          YGY   A   PQ PPQ +      PQ G+GY
Subjt:  QWPQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQP--TSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGY

Query:  LPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHP
        LPSGP     SG A  +Y  E GR  + PP   PQ QQ   H  Q  Q   + PQP Q      G PP                         +R++   
Subjt:  LPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHP

Query:  YGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLS--NSGGRP
        YGE+++KLV MGFR DH+ +VI RMEESGQPIDFN +LD LS  +SGG P
Subjt:  YGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLS--NSGGRP

AT4G28300.2 Protein of unknown function (DUF1421)2.1e-7846.63Show/hide
Query:  STVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQE
        S+V+P ++Y  PEDS+ QD+  TVE +MK Y+DN++RFLEG+SSRLSQLEL  YNLDK++GEMRS+L   HE+AD+KL+SL+KHLQEVHRSVQI+RDKQE
Subjt:  STVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRDKQE

Query:  LAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQL
        LA+TQK+LAKL L+QKESS S HS   E+R +    +PKK+E  S  HNQQLALALPHQI PQ     P   P+  PQQ  YY+  P +Q Q T A   +
Subjt:  LAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQL

Query:  SQTP-------------PPPP----------QQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQP--TSMPETPPMQVSFSPIPQPG
        S  P             PPPP          Q F QYQQ WP QP  + Q    +P+  P    PP         NQP   S+P +  MQ  +S  PQ  
Subjt:  SQTP-------------PPPP----------QQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQP--TSMPETPPMQVSFSPIPQPG

Query:  SSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHF
                YGY   A   PQ PPQ +      PQ G+GYLPSGP     SG A  +Y  E GR  + PP   PQ QQ   H  Q  Q   + PQP Q   
Subjt:  SSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHF

Query:  NQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLS--NSGGRP
           G PP                         +R++   YGE+++KLV MGFR DH+ +VI RMEESGQPIDFN +LD LS  +SGG P
Subjt:  NQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLS--NSGGRP

AT5G14540.1 Protein of unknown function (DUF1421)1.4e-1828.4Show/hide
Query:  SDLSVANSSKDFHKSRISTVYPAAAYGQPE-DSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLE
        SD    ++S       + ++ P+  + + + +S +  +IS ++ +MK ++D +L  +EG+S+RL+QLE    +L+  V +++  +   H + D KL+ LE
Subjt:  SDLSVANSSKDFHKSRISTVYPAAAYGQPE-DSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLE

Query:  KHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPY
          + EV   VQ+++DKQE+ E Q  L+KL L +       HS   E  A P A  P+                      P     PP+   + +P QQ  
Subjt:  KHLQEVHRSVQIIRDKQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPY

Query:  YIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQ-QAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDT
        +IQ P SQH ++    QL    P  P QFS   QQ P  PP  Q+QPP   P++QP  + PP     + S++QP   P  PP Q  +   P P       
Subjt:  YIQHPQSQHQMTNAHAQLSQTPPPPPQQFSQYQQQWPQQPPQ-QAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDT

Query:  VQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAY--------MIYDRENGR-----PPHHPPHHPPQPQQPPHHPPQPQQPPHHPP
        +Q+     +G  P++PP  + ++   P       PS P  G +    Y         +YD   GR     P  + P   P    P  +   P   P H  
Subjt:  VQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPSGPQSGLSSGGAY--------MIYDRENGR-----PPHHPPHHPPQPQQPPHHPPQPQQPPHHPP

Query:  QPQQPHFNQSGYP--PANVQIPQH-PSGPHVMARNHPNQSHFMRNQNH-PYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLSN
           Q       YP  P    +PQ  P    + +      S   R+ N  P  +++DK+V MGF  D +   +  + E+GQ +D N VLD L N
Subjt:  QPQQPHFNQSGYP--PANVQIPQH-PSGPHVMARNHPNQSHFMRNQNH-PYGEIVDKLVGMGFRSDHIGSVIHRMEESGQPIDFNAVLDGLSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAAATCGAAATTCGTTTCTATAAGATTACTCTGAACCAAATCCATGGGGAGTTCATTCGCGGAAAGAGCCATAAGGAGAAACTTTCCTTCTCGATAATCGCAAAGAAAAA
AAAAAAAATCATCTATTTCTCTATCTTTTGTCGTTCTTTGAATCGGTCCCCGTTTTACTTCCGTCTTTCCGATTTCCCACTGCGATCCATGGCGTCTGGTTCAGCTGGTC
GCCCTAATTCGGGCTCTAAGGGGTTTGATTTTGGTACCGATGATGTTCTTTGTTCTTATGAGGACTACGGCAACCAGGAATCTTCCAACGGTAGCCATAGCGATCTCTCC
GTCGCGAATTCTAGCAAGGATTTTCACAAAAGCAGAATATCTACCGTTTATCCTGCTGCTGCTTATGGTCAGCCAGAAGATTCCATGAAACAAGATGTGATTTCTACTGT
TGAGAACAGCATGAAAAAGTATTCTGATAACATTTTGCGTTTTCTCGAGGGAATAAGCTCGCGCCTATCACAACTTGAACTGAATTACTACAACCTTGATAAATCTGTTG
GAGAAATGCGATCTGACTTGATTCGTGACCACGAAGAGGCAGATTTGAAGCTTAAATCTCTCGAGAAGCATCTACAAGAAGTCCACAGGTCTGTGCAGATTATAAGAGAC
AAGCAAGAGCTCGCCGAGACTCAGAAAGACTTAGCCAAACTTCATCTTCTGCAGAAAGAGTCGTCTTTGTCGGGCCATTCGCATTCAAACGAGGAGAGAGCTTCACCTGG
TGCCTTTGATCCTAAGAAGAATGAAATTCCGTCCAAGAATCACAATCAGCAACTAGCTCTTGCCCTGCCGCACCAGATTGTCCCACAGCAACATCCACCTCCTCCAGCAG
CTTTGCCGGAGAATGTGCCGCAACAGCAACCTTATTACATCCAGCATCCTCAGAGCCAACATCAAATGACCAATGCCCATGCCCAGCTAAGTCAAACTCCACCACCACCA
CCACAACAGTTCAGTCAGTATCAACAACAATGGCCGCAGCAGCCACCTCAACAGGCACAACCACCACAACAGCATCCTTCTATGCAACCTCAGATCAGGCTGCCGCCTAC
TTCAGTCTACTCTTCTTATTCGATGAATCAACCGACTTCTATGCCAGAGACTCCGCCTATGCAAGTGTCATTTTCACCTATTCCTCAACCGGGTTCGAGCCGCATGGACA
CCGTGCAATATGGATATGTTGGAAGTGCTGGTACTATGCCCCAGCAACCTCCTCAAGTCAAAAATGCTTTTGGTGGAGGACCACAAGCCGGAGAAGGATATTTACCTTCT
GGACCACAATCTGGGCTTTCCTCGGGAGGTGCATATATGATATATGATAGGGAAAATGGAAGACCACCGCACCATCCACCGCACCATCCTCCGCAACCTCAGCAACCACC
ACACCATCCTCCGCAACCACAACAACCGCCACACCATCCTCCTCAGCCCCAACAACCACACTTCAACCAAAGTGGATACCCTCCGGCCAATGTTCAGATTCCTCAGCATC
CATCAGGTCCGCACGTTATGGCCAGGAATCATCCGAATCAGTCGCATTTTATGCGTAACCAGAACCATCCTTACGGCGAAATAGTTGATAAACTTGTTGGGATGGGTTTC
AGGAGTGACCACATTGGCAGTGTAATTCATAGGATGGAGGAGAGCGGGCAACCTATCGACTTCAACGCTGTTTTAGACGGGTTGAGTAATTCTGGAGGTCGCCCTAATTC
CGCCCCCAAATCCTTTGATTTTGGTTCTGATAATATCCTTTGCTCATTTGAGGACTACACTAAACAGGAACCTTCAAACGGTAGCCATAGCAATCCAGTCTCCGTTGCCA
ATTCTAGCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGGTGCTGCATATGGTCAACCAGATGATTCCATTAATCAAGATGTGATTGCTGCTGTTGAGAAC
AGCATGAAAAAGCATTCCGATAACCTTTTGCGTTTTCTCGAGGGAATAAGTTCACGCCTATCACAGCTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAAAT
GCGGTCTGACTTAGCCCGTGACCATGAAGAGGCAGAATCCAAGCTTAAATCTATTGAGAAGCATGTACAAGAGGTTCACAGATCTGTACAGATTATCAGAGACAAGCAAG
AGCTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCCCACAGAAAGAGCCATCTTTGTCGAGCCATTCGCAGACAAATGAGGAGAGGGTTTCAACCGATCCTAAA
AAGAACGAAAATCCATCTGAGATTCACAACCAGCAATTAGCTTTGGCCTTGCCACATCAGATCGTCCCGCAGCAAAATCCTATTACAGCACCCCCTTCAGCAGCTTTGCC
TCAGAATGTGCCTCAACAACAGCAATCTTACTACATTTCTTCATCCCAATTGCCTGGTCAACAACCATCCCATATCCAGCATGCTCAGAACCAGTATATCTCATCCGACT
CCCAACACCGGGCATCACAACCTCAAGACGTTTCGCAGATGACCAATCCCCAGCTAAGTCAAACTCCACAACCATTTAATCAGTATCAACAACAATGGGCGCAGCCACCA
TCTCAGCCGGCACAACCACCACAACAGGCTTCTATGCAACCTCAGATCAGACCACCGCCTACTTCAGTCTACCCTTCTCCTTACCCCCCACCAAATCAACCAACTTCTAT
GCCAGAGACTCTGTCAAGCAGCATGCCTATGCAGATGTCTTTTGCATCTATTCCTCAACCTGGTTCAAGCCGTGCGGATGCAGTGCCTTATGGGTATGCTGCTGCAAGTG
GTGGTTCTGCTCCACAGCAACCTCCTCAAGTGAAAAATGCTTATGGACCAGCAACAGGCGAGGGCTATATGCCTCCTGGACAACAGCCTGCGCTATCCTCTGGAGGAGCA
TATATGATGTATGATAGGGAAAGCGGAAGACCCCCACACCATCTCCCTCAACAGCCACATCATCATCCGTCTCAGCAATCCCACTTCAGTCAAAGTGGATATCCTCCGGC
CAATGCACCTCATCAGGTTCCTCCTCAAGCTCCAACAGGCCCCCATGTCTCAGCCAGGAATCCAAGCCATTCACATTTAATCGAAAAACTGGTTGGCATGGGTTTCAGGG
GCGACCATGTTGGCAATATAATTCAGAGAATGGAGGACAGTGGGCAAACTGTTGACTTCAACGCAGTTCTAGACAGATTGAGTACTCCTGCAGGTCCAGGGCCACAAAGA
GCGTGGTGA
mRNA sequenceShow/hide mRNA sequence
AAAATCGAAATTCGTTTCTATAAGATTACTCTGAACCAAATCCATGGGGAGTTCATTCGCGGAAAGAGCCATAAGGAGAAACTTTCCTTCTCGATAATCGCAAAGAAAAA
AAAAAAAATCATCTATTTCTCTATCTTTTGTCGTTCTTTGAATCGGTCCCCGTTTTACTTCCGTCTTTCCGATTTCCCACTGCGATCCATGGCGTCTGGTTCAGCTGGTC
GCCCTAATTCGGGCTCTAAGGGGTTTGATTTTGGTACCGATGATGTTCTTTGTTCTTATGAGGACTACGGCAACCAGGAATCTTCCAACGGTAGCCATAGCGATCTCTCC
GTCGCGAATTCTAGCAAGGATTTTCACAAAAGCAGAATATCTACCGTTTATCCTGCTGCTGCTTATGGTCAGCCAGAAGATTCCATGAAACAAGATGTGATTTCTACTGT
TGAGAACAGCATGAAAAAGTATTCTGATAACATTTTGCGTTTTCTCGAGGGAATAAGCTCGCGCCTATCACAACTTGAACTGAATTACTACAACCTTGATAAATCTGTTG
GAGAAATGCGATCTGACTTGATTCGTGACCACGAAGAGGCAGATTTGAAGCTTAAATCTCTCGAGAAGCATCTACAAGAAGTCCACAGGTCTGTGCAGATTATAAGAGAC
AAGCAAGAGCTCGCCGAGACTCAGAAAGACTTAGCCAAACTTCATCTTCTGCAGAAAGAGTCGTCTTTGTCGGGCCATTCGCATTCAAACGAGGAGAGAGCTTCACCTGG
TGCCTTTGATCCTAAGAAGAATGAAATTCCGTCCAAGAATCACAATCAGCAACTAGCTCTTGCCCTGCCGCACCAGATTGTCCCACAGCAACATCCACCTCCTCCAGCAG
CTTTGCCGGAGAATGTGCCGCAACAGCAACCTTATTACATCCAGCATCCTCAGAGCCAACATCAAATGACCAATGCCCATGCCCAGCTAAGTCAAACTCCACCACCACCA
CCACAACAGTTCAGTCAGTATCAACAACAATGGCCGCAGCAGCCACCTCAACAGGCACAACCACCACAACAGCATCCTTCTATGCAACCTCAGATCAGGCTGCCGCCTAC
TTCAGTCTACTCTTCTTATTCGATGAATCAACCGACTTCTATGCCAGAGACTCCGCCTATGCAAGTGTCATTTTCACCTATTCCTCAACCGGGTTCGAGCCGCATGGACA
CCGTGCAATATGGATATGTTGGAAGTGCTGGTACTATGCCCCAGCAACCTCCTCAAGTCAAAAATGCTTTTGGTGGAGGACCACAAGCCGGAGAAGGATATTTACCTTCT
GGACCACAATCTGGGCTTTCCTCGGGAGGTGCATATATGATATATGATAGGGAAAATGGAAGACCACCGCACCATCCACCGCACCATCCTCCGCAACCTCAGCAACCACC
ACACCATCCTCCGCAACCACAACAACCGCCACACCATCCTCCTCAGCCCCAACAACCACACTTCAACCAAAGTGGATACCCTCCGGCCAATGTTCAGATTCCTCAGCATC
CATCAGGTCCGCACGTTATGGCCAGGAATCATCCGAATCAGTCGCATTTTATGCGTAACCAGAACCATCCTTACGGCGAAATAGTTGATAAACTTGTTGGGATGGGTTTC
AGGAGTGACCACATTGGCAGTGTAATTCATAGGATGGAGGAGAGCGGGCAACCTATCGACTTCAACGCTGTTTTAGACGGGTTGAGTAATTCTGGAGGTCGCCCTAATTC
CGCCCCCAAATCCTTTGATTTTGGTTCTGATAATATCCTTTGCTCATTTGAGGACTACACTAAACAGGAACCTTCAAACGGTAGCCATAGCAATCCAGTCTCCGTTGCCA
ATTCTAGCAAGGATTTTCACAAGAGTAGAATGTCTACTGTATTCCCTGGTGCTGCATATGGTCAACCAGATGATTCCATTAATCAAGATGTGATTGCTGCTGTTGAGAAC
AGCATGAAAAAGCATTCCGATAACCTTTTGCGTTTTCTCGAGGGAATAAGTTCACGCCTATCACAGCTTGAACTATATTGCTACAACCTTGATAAATCTGTTGGAGAAAT
GCGGTCTGACTTAGCCCGTGACCATGAAGAGGCAGAATCCAAGCTTAAATCTATTGAGAAGCATGTACAAGAGGTTCACAGATCTGTACAGATTATCAGAGACAAGCAAG
AGCTTGCTGAGACTCAAAAAGACTTGGCTAAACTTCAGGTCCCACAGAAAGAGCCATCTTTGTCGAGCCATTCGCAGACAAATGAGGAGAGGGTTTCAACCGATCCTAAA
AAGAACGAAAATCCATCTGAGATTCACAACCAGCAATTAGCTTTGGCCTTGCCACATCAGATCGTCCCGCAGCAAAATCCTATTACAGCACCCCCTTCAGCAGCTTTGCC
TCAGAATGTGCCTCAACAACAGCAATCTTACTACATTTCTTCATCCCAATTGCCTGGTCAACAACCATCCCATATCCAGCATGCTCAGAACCAGTATATCTCATCCGACT
CCCAACACCGGGCATCACAACCTCAAGACGTTTCGCAGATGACCAATCCCCAGCTAAGTCAAACTCCACAACCATTTAATCAGTATCAACAACAATGGGCGCAGCCACCA
TCTCAGCCGGCACAACCACCACAACAGGCTTCTATGCAACCTCAGATCAGACCACCGCCTACTTCAGTCTACCCTTCTCCTTACCCCCCACCAAATCAACCAACTTCTAT
GCCAGAGACTCTGTCAAGCAGCATGCCTATGCAGATGTCTTTTGCATCTATTCCTCAACCTGGTTCAAGCCGTGCGGATGCAGTGCCTTATGGGTATGCTGCTGCAAGTG
GTGGTTCTGCTCCACAGCAACCTCCTCAAGTGAAAAATGCTTATGGACCAGCAACAGGCGAGGGCTATATGCCTCCTGGACAACAGCCTGCGCTATCCTCTGGAGGAGCA
TATATGATGTATGATAGGGAAAGCGGAAGACCCCCACACCATCTCCCTCAACAGCCACATCATCATCCGTCTCAGCAATCCCACTTCAGTCAAAGTGGATATCCTCCGGC
CAATGCACCTCATCAGGTTCCTCCTCAAGCTCCAACAGGCCCCCATGTCTCAGCCAGGAATCCAAGCCATTCACATTTAATCGAAAAACTGGTTGGCATGGGTTTCAGGG
GCGACCATGTTGGCAATATAATTCAGAGAATGGAGGACAGTGGGCAAACTGTTGACTTCAACGCAGTTCTAGACAGATTGAGTACTCCTGCAGGTCCAGGGCCACAAAGA
GCGTGGTGA
Protein sequenceShow/hide protein sequence
KIEIRFYKITLNQIHGEFIRGKSHKEKLSFSIIAKKKKKIIYFSIFCRSLNRSPFYFRLSDFPLRSMASGSAGRPNSGSKGFDFGTDDVLCSYEDYGNQESSNGSHSDLS
VANSSKDFHKSRISTVYPAAAYGQPEDSMKQDVISTVENSMKKYSDNILRFLEGISSRLSQLELNYYNLDKSVGEMRSDLIRDHEEADLKLKSLEKHLQEVHRSVQIIRD
KQELAETQKDLAKLHLLQKESSLSGHSHSNEERASPGAFDPKKNEIPSKNHNQQLALALPHQIVPQQHPPPPAALPENVPQQQPYYIQHPQSQHQMTNAHAQLSQTPPPP
PQQFSQYQQQWPQQPPQQAQPPQQHPSMQPQIRLPPTSVYSSYSMNQPTSMPETPPMQVSFSPIPQPGSSRMDTVQYGYVGSAGTMPQQPPQVKNAFGGGPQAGEGYLPS
GPQSGLSSGGAYMIYDRENGRPPHHPPHHPPQPQQPPHHPPQPQQPPHHPPQPQQPHFNQSGYPPANVQIPQHPSGPHVMARNHPNQSHFMRNQNHPYGEIVDKLVGMGF
RSDHIGSVIHRMEESGQPIDFNAVLDGLSNSGGRPNSAPKSFDFGSDNILCSFEDYTKQEPSNGSHSNPVSVANSSKDFHKSRMSTVFPGAAYGQPDDSINQDVIAAVEN
SMKKHSDNLLRFLEGISSRLSQLELYCYNLDKSVGEMRSDLARDHEEAESKLKSIEKHVQEVHRSVQIIRDKQELAETQKDLAKLQVPQKEPSLSSHSQTNEERVSTDPK
KNENPSEIHNQQLALALPHQIVPQQNPITAPPSAALPQNVPQQQQSYYISSSQLPGQQPSHIQHAQNQYISSDSQHRASQPQDVSQMTNPQLSQTPQPFNQYQQQWAQPP
SQPAQPPQQASMQPQIRPPPTSVYPSPYPPPNQPTSMPETLSSSMPMQMSFASIPQPGSSRADAVPYGYAAASGGSAPQQPPQVKNAYGPATGEGYMPPGQQPALSSGGA
YMMYDRESGRPPHHLPQQPHHHPSQQSHFSQSGYPPANAPHQVPPQAPTGPHVSARNPSHSHLIEKLVGMGFRGDHVGNIIQRMEDSGQTVDFNAVLDRLSTPAGPGPQR
AW