; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021423 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021423
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRubredoxin
Genome locationchr05:1409108..1409944
RNA-Seq ExpressionPay0021423
SyntenyPay0021423
Gene Ontology termsGO:0010207 - photosystem II assembly (biological process)
GO:0022900 - electron transport chain (biological process)
GO:0043448 - alkane catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR018527 - Rubredoxin, iron-binding site
IPR024934 - Rubredoxin-like domain
IPR024935 - Rubredoxin domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33910.1 rubredoxin [Cucumis melo subsp. melo]1.4e-10599Show/hide
Query:  MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVL
        MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPS  PPPVVAVEKEEEKFDKRRLEERFAVL
Subjt:  MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVL

Query:  NTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
        NTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
Subjt:  NTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ

XP_004139218.2 uncharacterized protein LOC101219075 [Cucumis sativus]3.0e-9793.1Show/hide
Query:  MAVYSSARP-TLSFHLSQPSLPIPRFNFKPPVAAT--IPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERF
        MAVYSSARP TLSFHLSQPSLPIPRFNFKPP+ AT  IPPL R SAARPISIFTLNSIDVSKEDKPTSDDPNT  P P PVVAVE+EEEKFDKRRLEE+F
Subjt:  MAVYSSARP-TLSFHLSQPSLPIPRFNFKPPVAAT--IPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERF

Query:  AVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGY
        AVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSL FFFALFLSGY
Subjt:  AVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGY

Query:  FLQ
        FLQ
Subjt:  FLQ

XP_016901839.1 PREDICTED: uncharacterized protein LOC103499548 [Cucumis melo]3.6e-15199.28Show/hide
Query:  MSKNKAQLNTLQSSSFPTTTPYPPPPLSLTEAKTSIYPFPIPWILLILLLLLLPSEFPFFFKHTYENQNLQTQIPPPSMAVYSSARPTLSFHLSQPSLPI
        MSKNKAQLNTLQSSSFPTTTPYPPPPLSLTEAKTSIYPFPIPWILLILLLLLLPSEFPFFFKHTYENQNLQTQIPPPSMAVYSSARPTLSFHLSQPSLPI
Subjt:  MSKNKAQLNTLQSSSFPTTTPYPPPPLSLTEAKTSIYPFPIPWILLILLLLLLPSEFPFFFKHTYENQNLQTQIPPPSMAVYSSARPTLSFHLSQPSLPI

Query:  PRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPT
        PRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPS  PPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPT
Subjt:  PRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPT

Query:  YPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
        YPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
Subjt:  YPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ

XP_022946609.1 uncharacterized protein LOC111450624 [Cucurbita moschata]1.8e-8683.92Show/hide
Query:  AVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVLN
        A Y+SARPTLSFHLSQPS+P PRFNFKPP+ A  PPLQR +AARPI IFTLNSIDVSKEDKPTSDDP+  +       A    EEK D+RR+EE+FAVLN
Subjt:  AVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVLN

Query:  TGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
        TGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYG+L FFFALFLSGYFLQ
Subjt:  TGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ

XP_038891199.1 rubredoxin [Benincasa hispida]1.2e-9689.42Show/hide
Query:  MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAAT--------IPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRR
        MA YSSARPTLSFHLSQPSLPIPRFNFKPP +A+        +PPLQR +AARP SIFTLNSIDVSKEDKPTSDDPNTPS  PPPVVAV+ EEEKFDKRR
Subjt:  MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAAT--------IPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRR

Query:  LEERFAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFAL
        LEE+FAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSL FFFAL
Subjt:  LEERFAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFAL

Query:  FLSGYFLQ
        FLSGYFLQ
Subjt:  FLSGYFLQ

TrEMBL top hitse value%identityAlignment
A0A0A0LJE4 Rubredoxin-like domain-containing protein1.5e-9793.1Show/hide
Query:  MAVYSSARP-TLSFHLSQPSLPIPRFNFKPPVAAT--IPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERF
        MAVYSSARP TLSFHLSQPSLPIPRFNFKPP+ AT  IPPL R SAARPISIFTLNSIDVSKEDKPTSDDPNT  P P PVVAVE+EEEKFDKRRLEE+F
Subjt:  MAVYSSARP-TLSFHLSQPSLPIPRFNFKPPVAAT--IPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERF

Query:  AVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGY
        AVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSL FFFALFLSGY
Subjt:  AVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGY

Query:  FLQ
        FLQ
Subjt:  FLQ

A0A1S4E0U3 uncharacterized protein LOC1034995481.8e-15199.28Show/hide
Query:  MSKNKAQLNTLQSSSFPTTTPYPPPPLSLTEAKTSIYPFPIPWILLILLLLLLPSEFPFFFKHTYENQNLQTQIPPPSMAVYSSARPTLSFHLSQPSLPI
        MSKNKAQLNTLQSSSFPTTTPYPPPPLSLTEAKTSIYPFPIPWILLILLLLLLPSEFPFFFKHTYENQNLQTQIPPPSMAVYSSARPTLSFHLSQPSLPI
Subjt:  MSKNKAQLNTLQSSSFPTTTPYPPPPLSLTEAKTSIYPFPIPWILLILLLLLLPSEFPFFFKHTYENQNLQTQIPPPSMAVYSSARPTLSFHLSQPSLPI

Query:  PRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPT
        PRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPS  PPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPT
Subjt:  PRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPT

Query:  YPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
        YPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
Subjt:  YPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ

A0A5A7VDM5 Rubredoxin6.6e-10699Show/hide
Query:  MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVL
        MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPS  PPPVVAVEKEEEKFDKRRLEERFAVL
Subjt:  MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVL

Query:  NTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
        NTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
Subjt:  NTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ

A0A6J1G4B1 uncharacterized protein LOC1114506248.9e-8783.92Show/hide
Query:  AVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVLN
        A Y+SARPTLSFHLSQPS+P PRFNFKPP+ A  PPLQR +AARPI IFTLNSIDVSKEDKPTSDDP+  +       A    EEK D+RR+EE+FAVLN
Subjt:  AVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVLN

Query:  TGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
        TGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYG+L FFFALFLSGYFLQ
Subjt:  TGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ

E5GBR8 Rubredoxin6.6e-10699Show/hide
Query:  MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVL
        MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPS  PPPVVAVEKEEEKFDKRRLEERFAVL
Subjt:  MAVYSSARPTLSFHLSQPSLPIPRFNFKPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVL

Query:  NTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
        NTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
Subjt:  NTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ

SwissProt top hitse value%identityAlignment
P00270 Rubredoxin1.3e-1054Show/hide
Query:  IYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESK
        IY C  CG+++D A GDP   I PG  FE LP+DW CP CGA+K  FE +
Subjt:  IYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESK

P73068 Rubredoxin1.0e-1842.73Show/hide
Query:  KRRLEERFAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIA--GFAQNQQYGLGGNTLTSGQKAVLIYGSLL
        +R  E+  A L +  +ECR+CG+ +  + GD    ++PG PFE LP +W+CP CGA +++F S     A  GFA+N  YG G N ++ G+K +LI+GSL 
Subjt:  KRRLEERFAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIA--GFAQNQQYGLGGNTLTSGQKAVLIYGSLL

Query:  FFFALFLSGY
          F  FLS Y
Subjt:  FFFALFLSGY

Q9FDN6 High molecular weight rubredoxin2.2e-1040Show/hide
Query:  SPPPPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFE
        +P   P   V +E++K            L +  Y+C  C + +D   GDP + IAPG PF  LPEDW CP CGA K  FE
Subjt:  SPPPPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFE

Q9WWN1 Rubredoxin1.9e-1745.79Show/hide
Query:  AVLNTGI--YECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFES--KSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALF
        AV NT +  +ECRSCG+ ++   GD  + IAP  PF +LP +WRCP C A K+ F +   +   +GF +N  YGLG N LT  QK +LI+G+L   F  F
Subjt:  AVLNTGI--YECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFES--KSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALF

Query:  LSGYFLQ
        +S Y LQ
Subjt:  LSGYFLQ

Q9XBL8 Rubredoxin1.1e-1745.79Show/hide
Query:  AVLNTGI--YECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFES--KSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALF
        AV NT +  +ECRSCG+ ++   GD  + IAP  PF +LP +WRCP C A K+ F +   +   +GF +N  YGLG N LT  QK +LI+G+L   F  F
Subjt:  AVLNTGI--YECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFES--KSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALF

Query:  LSGYFLQ
        +S Y LQ
Subjt:  LSGYFLQ

Arabidopsis top hitse value%identityAlignment
AT1G54500.1 Rubredoxin-like superfamily protein3.5e-5170.8Show/hide
Query:  DDPNTP--SPPPPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQN
        DD  TP  S         ++ E+  +KRR+EE+FAVLNTGIYECRSCG+K+DE+ GDP+YPI PG  F++LPEDWRCPTCGAA+SFFESK VEIAGFAQN
Subjt:  DDPNTP--SPPPPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFEQLPEDWRCPTCGAAKSFFESKSVEIAGFAQN

Query:  QQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ
        QQYGLGGN LTSGQK  LI+GSLL FFALFLSGYF+Q
Subjt:  QQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAAAATAAAGCCCAATTGAACACACTCCAATCTTCTTCCTTCCCCACCACCACACCTTATCCTCCACCTCCTCTTTCTCTCACAGAAGCGAAAACCAGC
ATATACCCTTTCCCCATTCCATGGATCCTTCTGATTCTTCTTCTTCTACTTCTTCCTTCTGAATTTCCCTTCTTTTTCAAACACACATACGAAAATCAAAACCTT
CAAACACAAATTCCACCTCCATCCATGGCTGTTTATTCCTCAGCTAGACCCACTCTCTCCTTCCATCTCTCCCAACCTTCCCTTCCAATTCCCAGATTCAACTTC
AAACCCCCCGTCGCCGCCACCATTCCGCCGTTGCAGAGATTATCTGCGGCGAGACCCATCTCCATTTTCACCCTCAACTCTATCGACGTCTCTAAGGAGGACAAA
CCCACTTCTGATGACCCAAATACGCCGTCGCCGCCGCCGCCGCCGGTTGTGGCGGTTGAAAAGGAGGAGGAGAAGTTTGATAAGCGACGGCTGGAGGAGAGATTT
GCAGTGTTGAATACAGGAATATATGAGTGCAGGTCATGTGGACATAAGTTCGATGAGGCCGTGGGGGATCCGACGTATCCAATAGCGCCGGGACTGCCGTTTGAG
CAGCTGCCGGAGGACTGGCGGTGTCCGACGTGTGGGGCGGCGAAGAGCTTCTTTGAGAGTAAGAGTGTGGAGATTGCTGGGTTTGCTCAGAATCAGCAGTATGGA
CTTGGAGGAAACACTTTGACTTCTGGGCAGAAGGCTGTGCTTATATATGGAAGCTTGTTGTTCTTCTTTGCTCTATTCCTCTCCGGCTATTTCTTGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAAAATAAAGCCCAATTGAACACACTCCAATCTTCTTCCTTCCCCACCACCACACCTTATCCTCCACCTCCTCTTTCTCTCACAGAAGCGAAAACCAGC
ATATACCCTTTCCCCATTCCATGGATCCTTCTGATTCTTCTTCTTCTACTTCTTCCTTCTGAATTTCCCTTCTTTTTCAAACACACATACGAAAATCAAAACCTT
CAAACACAAATTCCACCTCCATCCATGGCTGTTTATTCCTCAGCTAGACCCACTCTCTCCTTCCATCTCTCCCAACCTTCCCTTCCAATTCCCAGATTCAACTTC
AAACCCCCCGTCGCCGCCACCATTCCGCCGTTGCAGAGATTATCTGCGGCGAGACCCATCTCCATTTTCACCCTCAACTCTATCGACGTCTCTAAGGAGGACAAA
CCCACTTCTGATGACCCAAATACGCCGTCGCCGCCGCCGCCGCCGGTTGTGGCGGTTGAAAAGGAGGAGGAGAAGTTTGATAAGCGACGGCTGGAGGAGAGATTT
GCAGTGTTGAATACAGGAATATATGAGTGCAGGTCATGTGGACATAAGTTCGATGAGGCCGTGGGGGATCCGACGTATCCAATAGCGCCGGGACTGCCGTTTGAG
CAGCTGCCGGAGGACTGGCGGTGTCCGACGTGTGGGGCGGCGAAGAGCTTCTTTGAGAGTAAGAGTGTGGAGATTGCTGGGTTTGCTCAGAATCAGCAGTATGGA
CTTGGAGGAAACACTTTGACTTCTGGGCAGAAGGCTGTGCTTATATATGGAAGCTTGTTGTTCTTCTTTGCTCTATTCCTCTCCGGCTATTTCTTGCAATGA
Protein sequenceShow/hide protein sequence
MSKNKAQLNTLQSSSFPTTTPYPPPPLSLTEAKTSIYPFPIPWILLILLLLLLPSEFPFFFKHTYENQNLQTQIPPPSMAVYSSARPTLSFHLSQPSLPIPRFNF
KPPVAATIPPLQRLSAARPISIFTLNSIDVSKEDKPTSDDPNTPSPPPPPVVAVEKEEEKFDKRRLEERFAVLNTGIYECRSCGHKFDEAVGDPTYPIAPGLPFE
QLPEDWRCPTCGAAKSFFESKSVEIAGFAQNQQYGLGGNTLTSGQKAVLIYGSLLFFFALFLSGYFLQ