; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0020051 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0020051
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr05:8532433..8534654
RNA-Seq ExpressionPI0020051
SyntenyPI0020051
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062320.1 DUF4283 domain-containing protein [Cucumis melo var. makuwa]1.0e-6441.26Show/hide
Query:  RRPIWRGINPKGLQREP--NSGEGLTRMRSMAAAAHGSFGEGNLEPVGSIKRVGSPEIGSEKV---TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGP
        R PI  G+ P+G + +    +G  + R+R  A       G G+L        VG  E G   V   +AE  G P         GP    + L G    G 
Subjt:  RRPIWRGINPKGLQREP--NSGEGLTRMRSMAAAAHGSFGEGNLEPVGSIKRVGSPEIGSEKV---TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGP

Query:  KSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRVVVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVG
        K+DG  NG  +D   L A+      D + P         NT  AG          V+  P +    +     K   ++      EG    LPYT P  +G
Subjt:  KSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRVVVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVG

Query:  SKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP------
         K+VV+P+E +I QG+R+WENSLVGQL+DA LPY+VIQRL+EKI GKIEMP IT+LEN LICFQFRR  S+EWILSRGPWHL GKPML  KW P      
Subjt:  SKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP------

Query:  --------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAV
                                              DLATKERRRLSYARVCVE+EGG+++P+E+TVNLRGVE +V V YEWKPR+CN C +FGH + 
Subjt:  --------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAV

Query:  RGSEIQGSPSRQ
        + S  + S + Q
Subjt:  RGSEIQGSPSRQ

TYK18951.1 uncharacterized protein E5676_scaffold418G00380 [Cucumis melo var. makuwa]7.1e-5841.05Show/hide
Query:  PNSFGPNYDGLLLDGKQAVGPKSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFE-GNTGKAG--GPKVVGPSRVVVGGPVSKDKIYEGGSVK---
        P+  GP+ +     G +  G + DG  +    ++ ++          K+  +SS  + E G+   +G   P  V    V+     +   + + G  K   
Subjt:  PNSFGPNYDGLLLDGKQAVGPKSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFE-GNTGKAG--GPKVVGPSRVVVGGPVSKDKIYEGGSVK---

Query:  EAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEW
        ++ S   ++        L YT P ++G K+VV P E +I QG+++WENSLVGQL+D+ LPY VIQ L+EKIWGKIEMP IT+LEN LICFQFRR  S+EW
Subjt:  EAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEW

Query:  ILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRG
        ILSRGPWHL  K MLL KW P                                            DLATKERRRLSYARVCVE+E G+++P E+TV+LRG
Subjt:  ILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRG

Query:  VECSVLVTYEWKPRRCNSCHSFGH
        V+ +V V YEWKPR+CN C +FGH
Subjt:  VECSVLVTYEWKPRRCNSCHSFGH

TYK26656.1 DUF4283 domain-containing protein [Cucumis melo var. makuwa]5.1e-6441.02Show/hide
Query:  RRPIWRGINPKGLQREP--NSGEGLTRMRSMAAAAHGSFGEGNLEPVGSIKRVGSPEIGSEKV---TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGP
        R PI  G+ P+G + +    +G  + R+R  A       G G+L        V   E G   V   +AE  G P         GP    + L G    G 
Subjt:  RRPIWRGINPKGLQREP--NSGEGLTRMRSMAAAAHGSFGEGNLEPVGSIKRVGSPEIGSEKV---TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGP

Query:  KSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRVVVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVG
        K+DG  NG  +D   L A+      D + P         NT  AG          V+  P +    +     K   ++      EG    LPYT P  +G
Subjt:  KSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRVVVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVG

Query:  SKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP------
         K+VV+P+E +I QG+R+WENSLVGQL+DA LPY+VIQRL+EKI GKIEMP IT+LEN LICFQFRR  S+EWILSRGPWHL GKPML  KW P      
Subjt:  SKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP------

Query:  --------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAV
                                              DLATKERRRLSYARVCVE+EGG+++P+E+TVNLRGVE +V V YEWKPR+CN C +FGH + 
Subjt:  --------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAV

Query:  RGSEIQGSPSRQ
        + S  + S + Q
Subjt:  RGSEIQGSPSRQ

XP_008460524.1 PREDICTED: uncharacterized protein LOC103499323 [Cucumis melo]1.6e-6542.19Show/hide
Query:  TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGPKSDG---KLNGALMDKRILKAN---VGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRV-----
        TAE+ G P         GP   GL + G +  G +  G    LN A   +     N   +GP     V  ++  G    N G   GP++   + V     
Subjt:  TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGPKSDG---KLNGALMDKRILKAN---VGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRV-----

Query:  --------VVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGK
                V+  P +    +     K   ++      EG    LPYT P  +G K+VV+P+E +I QG+R+WENSLVGQL+DA LPY+VIQRL+EKI GK
Subjt:  --------VVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGK

Query:  IEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRR
        IEMP IT+LEN LICFQFRR  S+EWILSRGPWHL GKPML  KW P                                            DLATKERRR
Subjt:  IEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRR

Query:  LSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAVRGSEIQGSPSRQ
        LSYARVCVE+EGG+++P+E+TVNLRGVE +V V YEWKPR+CN C +FGH + + S  + S + Q
Subjt:  LSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAVRGSEIQGSPSRQ

XP_008463187.1 PREDICTED: uncharacterized protein LOC103501395 [Cucumis melo]7.1e-5841.05Show/hide
Query:  PNSFGPNYDGLLLDGKQAVGPKSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFE-GNTGKAG--GPKVVGPSRVVVGGPVSKDKIYEGGSVK---
        P+  GP+ +     G +  G + DG  +    ++ ++          K+  +SS  + E G+   +G   P  V    V+     +   + + G  K   
Subjt:  PNSFGPNYDGLLLDGKQAVGPKSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFE-GNTGKAG--GPKVVGPSRVVVGGPVSKDKIYEGGSVK---

Query:  EAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEW
        ++ S   ++        L YT P ++G K+VV P E +I QG+++WENSLVGQL+D+ LPY VIQ L+EKIWGKIEMP IT+LEN LICFQFRR  S+EW
Subjt:  EAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEW

Query:  ILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRG
        ILSRGPWHL  K MLL KW P                                            DLATKERRRLSYARVCVE+E G+++P E+TV+LRG
Subjt:  ILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRG

Query:  VECSVLVTYEWKPRRCNSCHSFGH
        V+ +V V YEWKPR+CN C +FGH
Subjt:  VECSVLVTYEWKPRRCNSCHSFGH

TrEMBL top hitse value%identityAlignment
A0A1S3CC83 uncharacterized protein LOC1034993237.6e-6642.19Show/hide
Query:  TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGPKSDG---KLNGALMDKRILKAN---VGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRV-----
        TAE+ G P         GP   GL + G +  G +  G    LN A   +     N   +GP     V  ++  G    N G   GP++   + V     
Subjt:  TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGPKSDG---KLNGALMDKRILKAN---VGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRV-----

Query:  --------VVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGK
                V+  P +    +     K   ++      EG    LPYT P  +G K+VV+P+E +I QG+R+WENSLVGQL+DA LPY+VIQRL+EKI GK
Subjt:  --------VVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGK

Query:  IEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRR
        IEMP IT+LEN LICFQFRR  S+EWILSRGPWHL GKPML  KW P                                            DLATKERRR
Subjt:  IEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRR

Query:  LSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAVRGSEIQGSPSRQ
        LSYARVCVE+EGG+++P+E+TVNLRGVE +V V YEWKPR+CN C +FGH + + S  + S + Q
Subjt:  LSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAVRGSEIQGSPSRQ

A0A5A7TWG5 Reverse transcriptase domain-containing protein3.4e-5841.05Show/hide
Query:  PNSFGPNYDGLLLDGKQAVGPKSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFE-GNTGKAG--GPKVVGPSRVVVGGPVSKDKIYEGGSVK---
        P+  GP+ +     G +  G + DG  +    ++ ++          K+  +SS  + E G+   +G   P  V    V+     +   + + G  K   
Subjt:  PNSFGPNYDGLLLDGKQAVGPKSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFE-GNTGKAG--GPKVVGPSRVVVGGPVSKDKIYEGGSVK---

Query:  EAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEW
        ++ S   ++        L YT P ++G K+VV P E +I QG+++WENSLVGQL+D+ LPY VIQ L+EKIWGKIEMP IT+LEN LICFQFRR  S+EW
Subjt:  EAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEW

Query:  ILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRG
        ILSRGPWHL  K MLL KW P                                            DLATKERRRLSYARVCVE+E G+++P E+TV+LRG
Subjt:  ILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRG

Query:  VECSVLVTYEWKPRRCNSCHSFGH
        V+ +V V YEWKPR+CN C +FGH
Subjt:  VECSVLVTYEWKPRRCNSCHSFGH

A0A5A7V507 DUF4283 domain-containing protein4.9e-6541.26Show/hide
Query:  RRPIWRGINPKGLQREP--NSGEGLTRMRSMAAAAHGSFGEGNLEPVGSIKRVGSPEIGSEKV---TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGP
        R PI  G+ P+G + +    +G  + R+R  A       G G+L        VG  E G   V   +AE  G P         GP    + L G    G 
Subjt:  RRPIWRGINPKGLQREP--NSGEGLTRMRSMAAAAHGSFGEGNLEPVGSIKRVGSPEIGSEKV---TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGP

Query:  KSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRVVVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVG
        K+DG  NG  +D   L A+      D + P         NT  AG          V+  P +    +     K   ++      EG    LPYT P  +G
Subjt:  KSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRVVVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVG

Query:  SKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP------
         K+VV+P+E +I QG+R+WENSLVGQL+DA LPY+VIQRL+EKI GKIEMP IT+LEN LICFQFRR  S+EWILSRGPWHL GKPML  KW P      
Subjt:  SKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP------

Query:  --------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAV
                                              DLATKERRRLSYARVCVE+EGG+++P+E+TVNLRGVE +V V YEWKPR+CN C +FGH + 
Subjt:  --------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAV

Query:  RGSEIQGSPSRQ
        + S  + S + Q
Subjt:  RGSEIQGSPSRQ

A0A5D3D5X6 Reverse transcriptase domain-containing protein3.4e-5841.05Show/hide
Query:  PNSFGPNYDGLLLDGKQAVGPKSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFE-GNTGKAG--GPKVVGPSRVVVGGPVSKDKIYEGGSVK---
        P+  GP+ +     G +  G + DG  +    ++ ++          K+  +SS  + E G+   +G   P  V    V+     +   + + G  K   
Subjt:  PNSFGPNYDGLLLDGKQAVGPKSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFE-GNTGKAG--GPKVVGPSRVVVGGPVSKDKIYEGGSVK---

Query:  EAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEW
        ++ S   ++        L YT P ++G K+VV P E +I QG+++WENSLVGQL+D+ LPY VIQ L+EKIWGKIEMP IT+LEN LICFQFRR  S+EW
Subjt:  EAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEW

Query:  ILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRG
        ILSRGPWHL  K MLL KW P                                            DLATKERRRLSYARVCVE+E G+++P E+TV+LRG
Subjt:  ILSRGPWHLSGKPMLLHKWVP--------------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRG

Query:  VECSVLVTYEWKPRRCNSCHSFGH
        V+ +V V YEWKPR+CN C +FGH
Subjt:  VECSVLVTYEWKPRRCNSCHSFGH

A0A5D3DSG9 DUF4283 domain-containing protein2.5e-6441.02Show/hide
Query:  RRPIWRGINPKGLQREP--NSGEGLTRMRSMAAAAHGSFGEGNLEPVGSIKRVGSPEIGSEKV---TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGP
        R PI  G+ P+G + +    +G  + R+R  A       G G+L        V   E G   V   +AE  G P         GP    + L G    G 
Subjt:  RRPIWRGINPKGLQREP--NSGEGLTRMRSMAAAAHGSFGEGNLEPVGSIKRVGSPEIGSEKV---TAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGP

Query:  KSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRVVVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVG
        K+DG  NG  +D   L A+      D + P         NT  AG          V+  P +    +     K   ++      EG    LPYT P  +G
Subjt:  KSDGKLNGALMDKRILKANVGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRVVVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVG

Query:  SKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP------
         K+VV+P+E +I QG+R+WENSLVGQL+DA LPY+VIQRL+EKI GKIEMP IT+LEN LICFQFRR  S+EWILSRGPWHL GKPML  KW P      
Subjt:  SKLVVVPSEAIIAQGVRMWENSLVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVP------

Query:  --------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAV
                                              DLATKERRRLSYARVCVE+EGG+++P+E+TVNLRGVE +V V YEWKPR+CN C +FGH + 
Subjt:  --------------------------------------DLATKERRRLSYARVCVEVEGGADLPTEVTVNLRGVECSVLVTYEWKPRRCNSCHSFGHFAV

Query:  RGSEIQGSPSRQ
        + S  + S + Q
Subjt:  RGSEIQGSPSRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTCGACGTCCGATTTGGCGGGGTATTAATCCAAAAGGCTTACAAAGGGAACCTAATTCCGGCGAAGGTTTGACTAGGATGAGATCTATGGCGGCGGCGGCGCA
TGGAAGCTTTGGAGAGGGTAACCTAGAACCGGTCGGTTCAATTAAACGGGTTGGTTCTCCAGAAATAGGTTCTGAGAAGGTGACGGCTGAGATGCAAGGTGGGCCAACTA
AGGGTCTTCTGCCGAATTCATTTGGGCCTAACTATGATGGGCTTCTTTTGGATGGAAAACAAGCGGTTGGGCCTAAGTCTGATGGGAAGTTAAATGGGGCTTTGATGGAT
AAAAGGATTCTGAAGGCTAACGTTGGGCCTTCTTTGGTGGACAAGGTTGGGCCTGTTTCAAGTGTTGGGCAATTTGAGGGAAATACGGGCAAGGCTGGAGGGCCGAAAGT
AGTTGGTCCTTCAAGGGTGGTTGTTGGTGGGCCTGTGTCAAAAGATAAAATATATGAGGGAGGGTCGGTTAAGGAGGCTGGATCCAATGCACGGAATATAAATATTGAAG
GAAGGGTAAATTGTCTTCCGTATACTCCACCATATTTGGTTGGATCGAAATTAGTGGTTGTTCCTTCGGAGGCGATTATCGCTCAAGGTGTTCGGATGTGGGAAAACTCT
TTAGTGGGCCAACTTGTTGACGCTACTTTGCCATATGCAGTGATTCAACGGCTTATTGAGAAAATTTGGGGGAAAATCGAAATGCCAACCATTACGATGCTAGAGAATGG
GCTTATTTGCTTTCAATTTCGTCGTCCCAATTCGATAGAGTGGATTCTATCCCGTGGGCCATGGCATCTTAGTGGGAAACCTATGCTCCTCCACAAATGGGTTCCAGATT
TGGCCACTAAGGAGAGACGTAGACTGTCGTATGCTAGGGTGTGTGTTGAAGTAGAAGGGGGTGCTGATTTGCCTACTGAGGTCACAGTTAATTTGAGGGGTGTGGAATGC
AGTGTTCTGGTTACTTATGAGTGGAAACCACGTAGGTGTAATTCATGTCATTCGTTTGGTCATTTTGCTGTTAGAGGAAGTGAGATTCAGGGTTCTCCGAGTAGACAGGT
GTCTTCGACAATAATGGTGGGGGTGTGGTTATGGGAGATTTTAATGCAATTCGAGTGCACTCTGAAGCTTGTGGGTGGGAGTCCGGTTACTGGTGATATGGAGGAGTTTG
ATCTTGCTATTCGTGATGCTGACTTGGTTGAGCCAGCTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTCGACGTCCGATTTGGCGGGGTATTAATCCAAAAGGCTTACAAAGGGAACCTAATTCCGGCGAAGGTTTGACTAGGATGAGATCTATGGCGGCGGCGGCGCA
TGGAAGCTTTGGAGAGGGTAACCTAGAACCGGTCGGTTCAATTAAACGGGTTGGTTCTCCAGAAATAGGTTCTGAGAAGGTGACGGCTGAGATGCAAGGTGGGCCAACTA
AGGGTCTTCTGCCGAATTCATTTGGGCCTAACTATGATGGGCTTCTTTTGGATGGAAAACAAGCGGTTGGGCCTAAGTCTGATGGGAAGTTAAATGGGGCTTTGATGGAT
AAAAGGATTCTGAAGGCTAACGTTGGGCCTTCTTTGGTGGACAAGGTTGGGCCTGTTTCAAGTGTTGGGCAATTTGAGGGAAATACGGGCAAGGCTGGAGGGCCGAAAGT
AGTTGGTCCTTCAAGGGTGGTTGTTGGTGGGCCTGTGTCAAAAGATAAAATATATGAGGGAGGGTCGGTTAAGGAGGCTGGATCCAATGCACGGAATATAAATATTGAAG
GAAGGGTAAATTGTCTTCCGTATACTCCACCATATTTGGTTGGATCGAAATTAGTGGTTGTTCCTTCGGAGGCGATTATCGCTCAAGGTGTTCGGATGTGGGAAAACTCT
TTAGTGGGCCAACTTGTTGACGCTACTTTGCCATATGCAGTGATTCAACGGCTTATTGAGAAAATTTGGGGGAAAATCGAAATGCCAACCATTACGATGCTAGAGAATGG
GCTTATTTGCTTTCAATTTCGTCGTCCCAATTCGATAGAGTGGATTCTATCCCGTGGGCCATGGCATCTTAGTGGGAAACCTATGCTCCTCCACAAATGGGTTCCAGATT
TGGCCACTAAGGAGAGACGTAGACTGTCGTATGCTAGGGTGTGTGTTGAAGTAGAAGGGGGTGCTGATTTGCCTACTGAGGTCACAGTTAATTTGAGGGGTGTGGAATGC
AGTGTTCTGGTTACTTATGAGTGGAAACCACGTAGGTGTAATTCATGTCATTCGTTTGGTCATTTTGCTGTTAGAGGAAGTGAGATTCAGGGTTCTCCGAGTAGACAGGT
GTCTTCGACAATAATGGTGGGGGTGTGGTTATGGGAGATTTTAATGCAATTCGAGTGCACTCTGAAGCTTGTGGGTGGGAGTCCGGTTACTGGTGATATGGAGGAGTTTG
ATCTTGCTATTCGTGATGCTGACTTGGTTGAGCCAGCTGTTTAG
Protein sequenceShow/hide protein sequence
MAIRRPIWRGINPKGLQREPNSGEGLTRMRSMAAAAHGSFGEGNLEPVGSIKRVGSPEIGSEKVTAEMQGGPTKGLLPNSFGPNYDGLLLDGKQAVGPKSDGKLNGALMD
KRILKANVGPSLVDKVGPVSSVGQFEGNTGKAGGPKVVGPSRVVVGGPVSKDKIYEGGSVKEAGSNARNINIEGRVNCLPYTPPYLVGSKLVVVPSEAIIAQGVRMWENS
LVGQLVDATLPYAVIQRLIEKIWGKIEMPTITMLENGLICFQFRRPNSIEWILSRGPWHLSGKPMLLHKWVPDLATKERRRLSYARVCVEVEGGADLPTEVTVNLRGVEC
SVLVTYEWKPRRCNSCHSFGHFAVRGSEIQGSPSRQVSSTIMVGVWLWEILMQFECTLKLVGGSPVTGDMEEFDLAIRDADLVEPAV