; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0023730 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0023730
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGATA transcription factor 16-like isoform X1
Genome locationchr01:25176991..25178731
RNA-Seq ExpressionPI0023730
SyntenyPI0023730
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046673.1 uncharacterized protein E6C27_scaffold427G00400 [Cucumis melo var. makuwa]1.1e-11889.56Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRL
        MQSRLRAIAP+SNWALWVTQ QGLRRGGLTT RTADPSVHANDD  NDP+VLSGEPERSQDNLEPDNAKANYEGR+DPKQG  NGPFG  KAQHASSPRL
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRL

Query:  ETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEE-EDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGR
        ETTVVGQASKPITQQKRAHS+AIDDVSCIGVYGGPLE+GKE++TTEMK++E+EE+EEE ED RDYYKHHKASPLAEIEFVDTRKPITRATDGTA  G+G+
Subjt:  ETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEE-EDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGR

Query:  TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

XP_008451516.1 PREDICTED: uncharacterized protein LOC103492778 [Cucumis melo]1.7e-11990.4Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRL
        MQSRLRAIAP+SNWALWVTQFQGLRRGGLTT RTADPSVHANDD  NDP+VLSGEPERSQDNLEPDNAKANYEGR+DPKQGD NGPFG  KAQHASSPRL
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRL

Query:  ETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEM--KEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEG
        ETTVVGQASKPITQQKRAHS+AIDDVSCIGVYGGPLE+GKE++TTEM  KEEE+EE EE ED RDYYKHHKASPLAEIEFVDTRKPITRATDGTA  G+G
Subjt:  ETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEM--KEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEG

Query:  RTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        +TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  RTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

XP_011659381.1 uncharacterized protein LOC105436175 [Cucumis sativus]1.1e-11387.65Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASS
        MQS LRAIAPKSNWA WVTQFQ LRRGG  LTT RTADPS+HAN   DDNDPAVLSGEPERSQDNLEPDNAKANY+ R+DPKQGD  GPFG P AQHASS
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASS

Query:  PRLETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGE
        PRLETTVVGQASKPITQQKRAHS  IDDVSCIGVYGGPLEQGKEN+TTEM       KEEEED RDYYKHHKASPLAEIEF DTRKPITRATDGTAYDGE
Subjt:  PRLETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGE

Query:  GRTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
          TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
Subjt:  GRTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

XP_038899333.1 uncharacterized protein LOC120086662 isoform X1 [Benincasa hispida]1.8e-10883.74Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLET
        MQSRL AIAPKSNWAL + QFQ LRR  LTT RTADPSVHANDDNDPAVLSGEPE SQDNLEPDN KANYE R+DPK GD NGPFG PKAQHASSPRLET
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLET

Query:  TVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGRTVI
         VVGQASKPITQQKR  S   D+VSCIGVYGGPLE+GKEN+TTEM       KE+EED RDYYKHHKASPLAEIEF DTRKPITRATDGTAYDG+G+ VI
Subjt:  TVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGRTVI

Query:  GWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        GWLPEQ+DTVDDSLRRATEIWKQNAMRGDPDAPQSR+LRALRGE+F
Subjt:  GWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

XP_038899334.1 uncharacterized protein LOC120086662 isoform X2 [Benincasa hispida]6.2e-10682.93Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLET
        MQSRL AIAPKSNWAL + QFQ LRR  LTT RTADPSVHANDDNDPAVLSGEPE   DNLEPDN KANYE R+DPK GD NGPFG PKAQHASSPRLET
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLET

Query:  TVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGRTVI
         VVGQASKPITQQKR  S   D+VSCIGVYGGPLE+GKEN+TTEM       KE+EED RDYYKHHKASPLAEIEF DTRKPITRATDGTAYDG+G+ VI
Subjt:  TVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGRTVI

Query:  GWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        GWLPEQ+DTVDDSLRRATEIWKQNAMRGDPDAPQSR+LRALRGE+F
Subjt:  GWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

TrEMBL top hitse value%identityAlignment
A0A0A0K9G7 Uncharacterized protein1.8e-12287.22Show/hide
Query:  KTSPPDINVWKRSGRMQSRLRAIAPKSNWALWVTQFQGLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGD
        +TSPPD NVWKR+GRMQS LRAIAPKSNWA WVTQFQ LRRGG  LTT RTADPS+HAN   DDNDPAVLSGEPERSQDNLEPDNAKANY+ R+DPKQGD
Subjt:  KTSPPDINVWKRSGRMQSRLRAIAPKSNWALWVTQFQGLRRGG--LTTCRTADPSVHAN---DDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGD

Query:  LNGPFGAPKAQHASSPRLETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTR
          GPFG P AQHASSPRLETTVVGQASKPITQQKRAHS  IDDVSCIGVYGGPLEQGKEN+TTEM       KEEEED RDYYKHHKASPLAEIEF DTR
Subjt:  LNGPFGAPKAQHASSPRLETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTR

Query:  KPITRATDGTAYDGEGRTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        KPITRATDGTAYDGE  TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
Subjt:  KPITRATDGTAYDGEGRTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

A0A1S3BR22 uncharacterized protein LOC1034927788.2e-12090.4Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRL
        MQSRLRAIAP+SNWALWVTQFQGLRRGGLTT RTADPSVHANDD  NDP+VLSGEPERSQDNLEPDNAKANYEGR+DPKQGD NGPFG  KAQHASSPRL
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRL

Query:  ETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEM--KEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEG
        ETTVVGQASKPITQQKRAHS+AIDDVSCIGVYGGPLE+GKE++TTEM  KEEE+EE EE ED RDYYKHHKASPLAEIEFVDTRKPITRATDGTA  G+G
Subjt:  ETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEM--KEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEG

Query:  RTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        +TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  RTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

A0A5D3D3D5 Uncharacterized protein5.3e-11989.56Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRL
        MQSRLRAIAP+SNWALWVTQ QGLRRGGLTT RTADPSVHANDD  NDP+VLSGEPERSQDNLEPDNAKANYEGR+DPKQG  NGPFG  KAQHASSPRL
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDD--NDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRL

Query:  ETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEE-EDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGR
        ETTVVGQASKPITQQKRAHS+AIDDVSCIGVYGGPLE+GKE++TTEMK++E+EE+EEE ED RDYYKHHKASPLAEIEFVDTRKPITRATDGTA  G+G+
Subjt:  ETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEE-EDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGR

Query:  TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  TVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

A0A6J1GPT4 uncharacterized protein LOC1114563884.8e-9676.42Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLET
        M SRL AIA K NW   + QFQ LRR GLTTCRTADPSVHANDDN PAV SGEPE+SQDNLEPD+AK+NYE R+D KQGD NGPF  PKAQ+ASSPRLET
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLET

Query:  TVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGRTVI
        T V QASKPITQQKRAHS  + DVSCIG  GGP          E K  EK+ KE+++D R+YYKHHKASPLAEIEFVDTRKPITRATDGTAYDG G+ +I
Subjt:  TVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGRTVI

Query:  GWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
        GWLPEQ DTVDDSL+RATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  GWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

A0A6J1JL82 uncharacterized protein LOC1114879531.4e-9577.24Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLET
        M SRL AIA K NWA  + QFQ LRR GLTTCRTADPSVHANDDN PAV SGEPE+SQDNLEPD AKANY   +D KQGD NGPF  PKAQ+ASSPRLET
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLET

Query:  TVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGRTVI
        T V QASKPITQQKRAHS  + DVSCIG  GGP  +       E K  EK+ KE+EED R+YYKHHKASPLAEIEF DTRKPITRATDGTAYDG G+ VI
Subjt:  TVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGEGRTVI

Query:  GWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF
         WLPEQ DTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGE+F
Subjt:  GWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02700.1 unknown protein1.0e-4546.83Show/hide
Query:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHA-NDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLE
        MQSRL A A  +   +     + L  G  T+ RTADP +HA ND  DPA+   +PE   D   P  A A     + P+      P   PK+  A++ +LE
Subjt:  MQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHA-NDDNDPAVLSGEPERSQDNLEPDNAKANYEGREDPKQGDLNGPFGAPKAQHASSPRLE

Query:  TTVVGQASKPITQQKRAHSIA----IDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGE
        +T VG  S+P  QQKR +S A    +D VSC G+ G P  + +     E++E+ + E E E D +++YKHHKASPL+EIEF DTRKPIT+ATDGTAY   
Subjt:  TTVVGQASKPITQQKRAHSIA----IDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRKPITRATDGTAYDGE

Query:  GRTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDA-PQSRVLRALRGEEF
        G+ VIGWLPEQ+DT ++SL +AT I+K+NA RGDP+  P SR+LR +RGE F
Subjt:  GRTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDA-PQSRVLRALRGEEF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACGTAACCGGCTCGCCACGTGTTATCTTTTCAGTCTCATCTCTAGAAACTCCTTATTTTTTCCTTTTAAGACATCGCCGCCGGATATCAACGTTTGGAAACGAAG
TGGAAGGATGCAATCGAGACTGAGAGCGATCGCACCGAAATCGAATTGGGCCTTGTGGGTGACCCAATTCCAAGGCCTCCGCCGAGGTGGTCTGACTACATGTCGTACTG
CTGACCCTTCCGTTCACGCCAACGACGACAATGACCCCGCCGTTTTATCCGGTGAACCTGAGAGATCACAAGACAATTTAGAGCCAGATAATGCGAAAGCCAATTACGAA
GGGAGGGAGGACCCTAAACAAGGAGATTTAAATGGACCATTTGGGGCACCCAAGGCCCAACACGCCTCCTCCCCTCGTCTAGAAACAACAGTAGTGGGCCAGGCCTCGAA
GCCCATTACTCAACAAAAGAGAGCCCACAGTATAGCGATCGACGACGTGAGTTGTATCGGCGTCTATGGCGGGCCTCTGGAGCAGGGGAAAGAAAACAAAACAACTGAAA
TGAAAGAAGAAGAAAAAGAAGAAAAAGAAGAAGAAGAAGACTATAGAGATTATTACAAGCACCACAAGGCGTCTCCGTTAGCAGAGATCGAGTTTGTGGATACGCGTAAG
CCGATAACGAGAGCGACGGACGGGACGGCGTACGATGGGGAGGGAAGGACTGTGATTGGCTGGTTGCCAGAGCAGGTGGATACGGTGGACGATTCGCTCCGGAGAGCAAC
GGAGATTTGGAAACAAAATGCAATGCGTGGAGATCCTGATGCTCCACAGTCGAGAGTTCTTAGGGCTTTGCGTGGGGAAGAGTTTTAA
mRNA sequenceShow/hide mRNA sequence
TTCTCGAGAAATGAAGGATGGGAGTTTGTTCAGTTCTGAAGATTATTCCCCATTGTTCTAGGCGCATCTCGACAAAGACACAAACAAAGATAACAATAAAATAATCATTA
ATTAAAAACAAAATTAGAATAACCCAGAACTTACTACTCAAAAGTACACGTGGGTGCTGTCCTGTACATGCCACGTAACCGGCTCGCCACGTGTTATCTTTTCAGTCTCA
TCTCTAGAAACTCCTTATTTTTTCCTTTTAAGACATCGCCGCCGGATATCAACGTTTGGAAACGAAGTGGAAGGATGCAATCGAGACTGAGAGCGATCGCACCGAAATCG
AATTGGGCCTTGTGGGTGACCCAATTCCAAGGCCTCCGCCGAGGTGGTCTGACTACATGTCGTACTGCTGACCCTTCCGTTCACGCCAACGACGACAATGACCCCGCCGT
TTTATCCGGTGAACCTGAGAGATCACAAGACAATTTAGAGCCAGATAATGCGAAAGCCAATTACGAAGGGAGGGAGGACCCTAAACAAGGAGATTTAAATGGACCATTTG
GGGCACCCAAGGCCCAACACGCCTCCTCCCCTCGTCTAGAAACAACAGTAGTGGGCCAGGCCTCGAAGCCCATTACTCAACAAAAGAGAGCCCACAGTATAGCGATCGAC
GACGTGAGTTGTATCGGCGTCTATGGCGGGCCTCTGGAGCAGGGGAAAGAAAACAAAACAACTGAAATGAAAGAAGAAGAAAAAGAAGAAAAAGAAGAAGAAGAAGACTA
TAGAGATTATTACAAGCACCACAAGGCGTCTCCGTTAGCAGAGATCGAGTTTGTGGATACGCGTAAGCCGATAACGAGAGCGACGGACGGGACGGCGTACGATGGGGAGG
GAAGGACTGTGATTGGCTGGTTGCCAGAGCAGGTGGATACGGTGGACGATTCGCTCCGGAGAGCAACGGAGATTTGGAAACAAAATGCAATGCGTGGAGATCCTGATGCT
CCACAGTCGAGAGTTCTTAGGGCTTTGCGTGGGGAAGAGTTTTAAATGTGGAGTGGGTTTAAAATTAATAAAGGGTTGTAAGATGTGTTGAAGTTTTGATCTAAGAACTT
GGTTTTACGAGTATAAGAGCGAAACTTACCAAAAAGTTCTATTAAAAATTGATATTTTGTAAATACCTAGAATTTTACGATGTAGAGTTCAATTTTATATAATTTTGAAA
GACAGGTATTTATGTTGGTGACCCTACTGGGAATCGATCCTCCACAATTTGTTGTTGTCGATAGCTGGAAAGCCAGAAAAGGGAGGAATTAATGGAATAGGTGGCACGGT
GCCCTTTATACCCAACCTCTTATACCTGATTCCGATTCTCAGCCCCGATGTGGGACAACAAGTTGATTCCCCAACGGCAAATTAGGACATAATGGTGGATTGAGACCTTA
GATTTGGGTCTATTAAAACTTATGTAATGCTCTATATTCGAGTAGTTAGTTAAAAAGAAAAAAAATTGGAAGTTAATACCCATGGAGCAATTCTATGATGGCAAGTAATT
CTACTTGGTTTGCAAGTGAATGGAAAATCTTATATCGTATACGGGCAGCACATTTATGTGTGTATACAAATAGTCCGTCTTAATAAGTTTATGCTGATAT
Protein sequenceShow/hide protein sequence
MPRNRLATCYLFSLISRNSLFFPFKTSPPDINVWKRSGRMQSRLRAIAPKSNWALWVTQFQGLRRGGLTTCRTADPSVHANDDNDPAVLSGEPERSQDNLEPDNAKANYE
GREDPKQGDLNGPFGAPKAQHASSPRLETTVVGQASKPITQQKRAHSIAIDDVSCIGVYGGPLEQGKENKTTEMKEEEKEEKEEEEDYRDYYKHHKASPLAEIEFVDTRK
PITRATDGTAYDGEGRTVIGWLPEQVDTVDDSLRRATEIWKQNAMRGDPDAPQSRVLRALRGEEF