; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002796 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002796
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr11:12809644..12812063
RNA-Seq ExpressionHG10002796
SyntenyHG10002796
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010686122.1 PREDICTED: uncharacterized protein LOC104900404 [Beta vulgaris subsp. vulgaris]4.5e-4229.61Show/hide
Query:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQSENYPKGR--W
        FD+ L++L+   +  +PS+ +     FW+R+ +LP+G++++    +IG  IG   + + D   V W  + R++I +DI +PLR+   +Q  +   G    
Subjt:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQSENYPKGR--W

Query:  FNLCYERLSDFCFGCGRIGHLLKECL-------------------------ESKEREIGGEEE------------------------------KKWRFTG
         ++ YERL  FC+ CG IGH+ ++CL                          SK R + G  +                              ++WRF G
Subjt:  FNLCYERLSDFCFGCGRIGHLLKECL-------------------------ESKEREIGGEEE------------------------------KKWRFTG

Query:  VYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTL-HGVRRNAGIWERLDRFVGTIE
        VYG+PE  NKH TW L+R +  E + P ++GGD NEI    EK GG  ++ R +  FREV+D C L DL   G  +T   G      I ERLDRF+ +  
Subjt:  VYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTL-HGVRRNAGIWERLDRFVGTIE

Query:  FIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRR---RPFRFEECWINLPECRNIIEGAGDWELSNGHHSLQNSLKICSNRLLPWGRSLFNKRKV
        ++QLF +  V HL    S+H  I        +  PK ++   R F+FE  W+    C   +  A  W+ S G   +Q+ L + +  L+ W ++       
Subjt:  FIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRR---RPFRFEECWINLPECRNIIEGAGDWELSNGHHSLQNSLKICSNRLLPWGRSLFNKRKV

Query:  EIARCKQMLKEA
        +I R ++ L  A
Subjt:  EIARCKQMLKEA

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.7e-3629.14Show/hide
Query:  RTGLLKQALGFFDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQ
        R  + K     FD+ L+++  PV    PS+  F     WVR  DLPLG  T+ MA+++G+A+G  E+ D D     WG NLR+++ +DIS+PLR+GI + 
Subjt:  RTGLLKQALGFFDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQ

Query:  SENYPKGRWFNLCYERLSDFCFGCG--------RIGHLLK-------------------------ECLESKEREIGG-----------------------
         +    G W  + YERL DFC+ CG        + G  L+                             S    +G                        
Subjt:  SENYPKGRWFNLCYERLSDFCFGCG--------RIGHLLK-------------------------ECLESKEREIGG-----------------------

Query:  EEEKK----------------------------------------------------------WRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGG
        E  KK                                                           RFTG YG+P A+ +HLTW LLRRI      PWL+GG
Subjt:  EEEKK----------------------------------------------------------WRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGG

Query:  DLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRRNAG--IWERLDRFVGTIEFIQLF
        D+N I W+ E       D+  +E FR ++D C L D+GF+G  FT     R AG  +W+RLDRF+    F  +F
Subjt:  DLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRRNAG--IWERLDRFVGTIEFIQLF

XP_027118730.1 uncharacterized protein LOC113735973 [Coffea arabica]2.6e-3729.03Show/hide
Query:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQSENYPKGRWFN
        FD  L+V+   V + +PS  K +  SFWV++ +LPLG+     A  IG  +G  E FD     +  G+ LRI+++++++ PL++ + +  E       F 
Subjt:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQSENYPKGRWFN

Query:  LCYERLSDFCFGCGRIGHLLKEC---------------------------------LESKEREIGGEEEK----------------------KWRFTGVY
          YERL   C  CGRIGH  ++C                                 + ++ R +  + ++                       WR TG Y
Subjt:  LCYERLSDFCFGCGRIGHLLKEC---------------------------------LESKEREIGGEEEK----------------------KWRFTGVY

Query:  GYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRRNAGIWE-RLDRFVGTIEFI
        G+PEA  +  TW ++R++      PW+  GD NE+   +E  G   +    + NFR+ L DC L DLG EG+ FT    R        RLDR   + +F+
Subjt:  GYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRRNAGIWE-RLDRFVGTIEFI

Query:  QLFEKGRVSHLDWLFSNHRPIEFTF-LFDDSFVPK---RRRRPFRFEECWINLPECRNIIEGAGDWELSNGHHSLQN---SLKICSNRLLPWGRSLF-NK
         LF   R+ H   LFS+H PI        D F  +   RR + F FE  WI   +C  II+ +  W+ S+   +  N       C   LL W +  F N 
Subjt:  QLFEKGRVSHLDWLFSNHRPIEFTF-LFDDSFVPK---RRRRPFRFEECWINLPECRNIIEGAGDWELSNGHHSLQN---SLKICSNRLLPWGRSLF-NK

Query:  RKV
         KV
Subjt:  RKV

XP_030922765.1 uncharacterized protein LOC115949628 [Quercus lobata]2.9e-3637.44Show/hide
Query:  EEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFT-LHGVRRNAGIWER
        +E++WRFTG YG P+   ++ +W  LRR++ + + PW+  GD NEI    EK GG  +  + +E FREVLD+C   DLGF GS +T   G   N  IWER
Subjt:  EEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFT-LHGVRRNAGIWER

Query:  LDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIEGA-GDWELSNGHHSLQNSLKICSNRLLPWGRS
        LDR V T ++I+LF   +V HL+   S+H+PI    +     + KRR++P+RFE+ W+  P C+ I+  A G + L      ++  ++ C  +L  W R 
Subjt:  LDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIEGA-GDWELSNGHHSLQNSLKICSNRLLPWGRS

Query:  LFNKRKVEIARCKQMLKEA
         F      +   K+ L++A
Subjt:  LFNKRKVEIARCKQMLKEA

XP_030925053.1 uncharacterized protein LOC115952114 [Quercus lobata]6.8e-3840Show/hide
Query:  IGGEEEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRRNAGI
        + G  E  WR TG YG P+   ++  W++L+ +  +   PW V GD NE+   +EK GG  +   L++NFR+VLD C  VDLG+ G  FT HG RR   I
Subjt:  IGGEEEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRRNAGI

Query:  WERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIEGAGDWELSNGHHSLQNSLKI--CSNRLLP
        WERLDR +   E++  F  GRV HL+   S+HRPI  + L       K RR+PFRFE  W + PEC+ +IE A D    NG+     + KI  C  +L  
Subjt:  WERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIEGAGDWELSNGHHSLQNSLKI--CSNRLLP

Query:  WGRSLFNKRKVEIARCKQMLKEAYDKLPEA
        W R       V     KQ +K+A ++L +A
Subjt:  WGRSLFNKRKVEIARCKQMLKEAYDKLPEA

TrEMBL top hitse value%identityAlignment
A0A2N9E949 CCHC-type domain-containing protein2.9e-4228.44Show/hide
Query:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPL--RKGILIQSENYPKGRW
        FDK L++L+    D K    +   ASFW++I +LP     +  A  +G+A+GV E+ D   + + WGE +R+++RID+S PL  R+ + +  E   +  W
Subjt:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPL--RKGILIQSENYPKGRW

Query:  FNLCYERLSDFCFGCGRIGHLLKECL------ESKE----------------REIGG-------------------------------------------
          L YE+L  FC+ CG +GH  +EC       +SK+                R+ G                                            
Subjt:  FNLCYERLSDFCFGCGRIGHLLKECL------ESKE----------------REIGG-------------------------------------------

Query:  ------------------------------EEEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREV
                                      E EK WR TG YG PE   +  +W LL+ +  +   PW+V GD NEI  + EK G  L+    + +FRE 
Subjt:  ------------------------------EEEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREV

Query:  LDDCKLVDLGFEGSPFTLHGVRRN-AGIWERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIEG
        L+   L DLGF G  +T    R     I ERLDR V   E+  +F +  V HL    S+H PI    LF    +P RR+R FRFE+ W  +  C   ++ 
Subjt:  LDDCKLVDLGFEGSPFTLHGVRRN-AGIWERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIEG

Query:  AGDWELSNGH-HSLQNSLKICSNRLLPWGRSLFNKRKVEIARCKQMLKEA
        +   +    H   +  S+K C   L+ W RS++   ++   R  Q+ +EA
Subjt:  AGDWELSNGH-HSLQNSLKICSNRLLPWGRSLFNKRKVEIARCKQMLKEA

A0A2N9FNT0 RNase H domain-containing protein1.5e-4329.05Show/hide
Query:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGEN-LRIKIRIDISQPLRKGILIQSENYPKGRWF
        +DKFL+V +    D    D  F+  SFWV++ +LP+  +T+  A  IG +IG+ E      E+   GEN +R++IR++I++PL +G L++ E   KG W 
Subjt:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGEN-LRIKIRIDISQPLRKGILIQSENYPKGRWF

Query:  NLCYERLSDFCFGCGRIGHLLKEC---------------------------------------------------------------------LESKERE
           YERL +FC+ CG + H  K+C                                                                     L   +RE
Subjt:  NLCYERLSDFCFGCGRIGHLLKEC---------------------------------------------------------------------LESKERE

Query:  IG----------------------------GEEEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFRE
         G                               ++ WR T  YG PE + +  +W+LLR + G+   PW   GD NEI    EK G   +    ++ FR 
Subjt:  IG----------------------------GEEEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFRE

Query:  VLDDCKLVDLGFEGSPFTLHGVRR-NAGIWERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIE
        V+D+C  +DLGF G PFT    RR NA  W RLDRF+ T E++  F    V H++   S+H+PI          VP+ R++ FRFE+ W   P+C  ++ 
Subjt:  VLDDCKLVDLGFEGSPFTLHGVRR-NAGIWERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIE

Query:  GAGDW-ELSNGHHSLQNSLKI--CSNRLLPWGRSLFN------KRKVEIAR
         A  W   + G    Q   KI  C + L  W R+ F       K K E+ R
Subjt:  GAGDW-ELSNGHHSLQNSLKI--CSNRLLPWGRSLFN------KRKVEIAR

A0A2N9GPY1 Reverse transcriptase domain-containing protein7.8e-4025.88Show/hide
Query:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQSENYPKGRWFN
        +DK L+VL+   ED    D  F   SFWV++  LP+       A+ IG ++G           V  G  +R+++ +DI++PL +G  ++ E   +  W  
Subjt:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQSENYPKGRWFN

Query:  LCYERLSDFCFGCGRIGHLLKEC-----------------------------------------------------------------------------
          YERL +FC+ CG + H  K+C                                                                             
Subjt:  LCYERLSDFCFGCGRIGHLLKEC-----------------------------------------------------------------------------

Query:  ---------------------LESKEREIGG----------------------------EEEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVG
                               +K R  GG                             ++  WR TG YG PE  N+  +W+LLRR+  + + PW   
Subjt:  ---------------------LESKEREIGG----------------------------EEEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVG

Query:  GDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRRNAGIWERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDS
        GD NE+   +EK G   +  R ++ FR+VLDD   +DLGF G PFT    R     WERLDR V T  ++ LF   RV HLD  +S+H+PI   ++  + 
Subjt:  GDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRRNAGIWERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDS

Query:  FVPKRRRRPFRFEECWINLPECRNIIEGAGDWELSN---GHHSLQNSLKICSNRLLPWGRSLFNKRKVEIARCKQMLKEAYDK
         V    R+PFRFEE W +   C   IE +  W +       +++   +  C   L  W +  F   K++I   +  LK A  K
Subjt:  FVPKRRRRPFRFEECWINLPECRNIIEGAGDWELSN---GHHSLQNSLKICSNRLLPWGRSLFNKRKVEIARCKQMLKEAYDK

A0A2N9INH4 Reverse transcriptase domain-containing protein9.2e-4131.61Show/hide
Query:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGEN-LRIKIRIDISQPLRKGILIQSENYPKGRWF
        FDKFL+V E   ED    D  F+  +FWV+I +LP+   T+  A  IG  +G  E    D ++   GEN +R+++R+D++ PL +G +I+ E   K  W 
Subjt:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGEN-LRIKIRIDISQPLRKGILIQSENYPKGRWF

Query:  NLCYERLSDFCFGCGRIGHLLKECLESKERE-------------IGGEEEKKWRFTGVYGYPEAYNK----HLTWSLLR----RIQG--------EMNEP
           YERL +FC+ CG + H  K+C +  +++             +  E ++  R T +       +     H +  LL+    RI G        + + P
Subjt:  NLCYERLSDFCFGCGRIGHLLKECLESKERE-------------IGGEEEKKWRFTGVYGYPEAYNK----HLTWSLLR----RIQG--------EMNEP

Query:  WLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRR-NAGIWERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTF
        W   GD NEI    E  G   ++ R ++ FR V+DDC+ +DLG+ G PFT    RR  A  W RLDRF+ T E++  F    V HL+   S+H+PI  T 
Subjt:  WLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRR-NAGIWERLDRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTF

Query:  LFDDSFVPKRRRRPFRFEECWINLPECRNIIEGAGDWELSNGHH-SLQNSLKICSNRLLPWGRSLFNKRKVEIARCKQMLKEAYDK
            S  PK +   FRF+E W     C   I  A   ++       +Q+ +  C   L  W R  F      I + K  LK A ++
Subjt:  LFDDSFVPKRRRRPFRFEECWINLPECRNIIEGAGDWELSNGHH-SLQNSLKICSNRLLPWGRSLFNKRKVEIARCKQMLKEAYDK

A0A803N338 Uncharacterized protein7.8e-4024.9Show/hide
Query:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQSENYPKGRWFN
        FD   ++L+    D +PSD  F+   FW+R++D+P G +T   A ++GD IG C + D + + + W E +R+K+ ++I++PLR+G+ +  E+    +W  
Subjt:  FDKFLIVLETPVEDWKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQSENYPKGRWFN

Query:  LCYERLSDFCFGCGRIGHLLKECLESKERE------------------------IGGEEEKKW-------------------------------------
        L YERL DFC+ CG IGH  K+C+E ++ +                        +  E+E+KW                                     
Subjt:  LCYERLSDFCFGCGRIGHLLKECLESKERE------------------------IGGEEEKKW-------------------------------------

Query:  ----------------------------------------------------------------------------------RFTGVYGYPEAYNKHLTW
                                                                                          R  G+YG+PE   KH TW
Subjt:  ----------------------------------------------------------------------------------RFTGVYGYPEAYNKHLTW

Query:  SLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTL-HGVRRNAGIWERLDRFVGTIEFIQLFEKGRVSHLD
         L+  ++ + N P ++ GDLNEI    EK GG  +  RL++ FR  +D+C L DLGF+GS FT   G      + ERLDRF+    ++ LF +    +  
Subjt:  SLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTL-HGVRRNAGIWERLDRFVGTIEFIQLFEKGRVSHLD

Query:  WLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIEGAGDWELSNGHHSLQNSLKICSNRLLPWGRSLFNKRKVEIARCKQMLKE
           S+H PI  +    +  V     + FRFE  W++  EC  I++ +  W  S     +   +  C   L  W    F   K +I + +Q  +E
Subjt:  WLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIEGAGDWELSNGHHSLQNSLKICSNRLLPWGRSLFNKRKVEIARCKQMLKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGTTGAGGGAGCTTTCTTGAGTTTCAAGAATTCTTCAAGAATCCTCGGTTTACTGGAGGTCTTGTCAGAAGCTTTTTGCTGTCTGTTTGGTTTGTGTGGTCTACT
GAACTCTTTGGGAGTTGAAGTCATGGATGAAGAAGGGTTACTGAAGGATTGGGGTAAACTTCGTCTTACAGAAGGAGAAAAGAAGACGATATTAAGGAAAGAGGAAGTTA
TACTTTGCTCAATAAGTGATCATCTTGAGGGGACAAGAACTGGGTTATTAAAGCAGGCTCTTGGTTTTTTTGACAAATTCTTAATAGTTTTGGAAACTCCAGTTGAAGAT
TGGAAGCCTTCTGACTATAAGTTTAATTTTGCATCCTTTTGGGTGAGAATTATGGATCTTCCGTTGGGTTTTCAAACAAAGATGATGGCATTAAAGATAGGGGATGCTAT
AGGAGTATGTGAAGATTTTGATGGGGATCTAGAAGAGGTCTGTTGGGGTGAGAATCTGAGAATAAAGATTAGAATTGATATCTCACAACCTTTGCGCAAGGGTATTTTAA
TACAATCTGAAAATTATCCGAAAGGGAGGTGGTTCAATCTTTGTTATGAAAGATTATCAGATTTTTGCTTTGGTTGTGGACGGATTGGGCATTTGTTGAAGGAGTGTTTA
GAGTCGAAAGAAAGGGAGATTGGTGGAGAGGAAGAGAAGAAGTGGAGATTTACGGGTGTCTATGGATACCCAGAAGCCTACAACAAGCACTTGACTTGGAGTTTACTGAG
GAGAATTCAAGGGGAGATGAATGAGCCATGGCTTGTTGGGGGTGATTTAAATGAAATCTGTTGGGATAAGGAGAAATTTGGAGGCCCTTTGAAAGATAGTAGACTTCTTG
AAAATTTCAGGGAGGTTCTTGATGATTGTAAACTGGTAGATCTGGGTTTTGAGGGTTCTCCCTTTACATTGCATGGTGTGAGGAGGAATGCTGGAATTTGGGAGCGCCTT
GATCGTTTTGTTGGTACGATTGAGTTCATCCAATTATTTGAAAAGGGTCGAGTTAGTCACTTAGATTGGCTTTTTTCAAATCACAGGCCTATTGAATTCACGTTTCTCTT
TGATGATTCATTTGTGCCTAAGAGAAGAAGAAGACCTTTTAGATTTGAAGAATGTTGGATTAATCTTCCCGAATGTAGGAATATTATTGAAGGAGCTGGAGACTGGGAGT
TGTCTAATGGTCACCATTCCCTTCAAAATAGTTTGAAGATATGTTCGAACAGACTTCTTCCTTGGGGAAGATCCTTATTCAATAAAAGAAAAGTTGAAATAGCAAGATGC
AAACAGATGTTAAAAGAGGCATATGATAAGCTCCCTGAGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGTTGAGGGAGCTTTCTTGAGTTTCAAGAATTCTTCAAGAATCCTCGGTTTACTGGAGGTCTTGTCAGAAGCTTTTTGCTGTCTGTTTGGTTTGTGTGGTCTACT
GAACTCTTTGGGAGTTGAAGTCATGGATGAAGAAGGGTTACTGAAGGATTGGGGTAAACTTCGTCTTACAGAAGGAGAAAAGAAGACGATATTAAGGAAAGAGGAAGTTA
TACTTTGCTCAATAAGTGATCATCTTGAGGGGACAAGAACTGGGTTATTAAAGCAGGCTCTTGGTTTTTTTGACAAATTCTTAATAGTTTTGGAAACTCCAGTTGAAGAT
TGGAAGCCTTCTGACTATAAGTTTAATTTTGCATCCTTTTGGGTGAGAATTATGGATCTTCCGTTGGGTTTTCAAACAAAGATGATGGCATTAAAGATAGGGGATGCTAT
AGGAGTATGTGAAGATTTTGATGGGGATCTAGAAGAGGTCTGTTGGGGTGAGAATCTGAGAATAAAGATTAGAATTGATATCTCACAACCTTTGCGCAAGGGTATTTTAA
TACAATCTGAAAATTATCCGAAAGGGAGGTGGTTCAATCTTTGTTATGAAAGATTATCAGATTTTTGCTTTGGTTGTGGACGGATTGGGCATTTGTTGAAGGAGTGTTTA
GAGTCGAAAGAAAGGGAGATTGGTGGAGAGGAAGAGAAGAAGTGGAGATTTACGGGTGTCTATGGATACCCAGAAGCCTACAACAAGCACTTGACTTGGAGTTTACTGAG
GAGAATTCAAGGGGAGATGAATGAGCCATGGCTTGTTGGGGGTGATTTAAATGAAATCTGTTGGGATAAGGAGAAATTTGGAGGCCCTTTGAAAGATAGTAGACTTCTTG
AAAATTTCAGGGAGGTTCTTGATGATTGTAAACTGGTAGATCTGGGTTTTGAGGGTTCTCCCTTTACATTGCATGGTGTGAGGAGGAATGCTGGAATTTGGGAGCGCCTT
GATCGTTTTGTTGGTACGATTGAGTTCATCCAATTATTTGAAAAGGGTCGAGTTAGTCACTTAGATTGGCTTTTTTCAAATCACAGGCCTATTGAATTCACGTTTCTCTT
TGATGATTCATTTGTGCCTAAGAGAAGAAGAAGACCTTTTAGATTTGAAGAATGTTGGATTAATCTTCCCGAATGTAGGAATATTATTGAAGGAGCTGGAGACTGGGAGT
TGTCTAATGGTCACCATTCCCTTCAAAATAGTTTGAAGATATGTTCGAACAGACTTCTTCCTTGGGGAAGATCCTTATTCAATAAAAGAAAAGTTGAAATAGCAAGATGC
AAACAGATGTTAAAAGAGGCATATGATAAGCTCCCTGAGGCTTGA
Protein sequenceShow/hide protein sequence
MMVEGAFLSFKNSSRILGLLEVLSEAFCCLFGLCGLLNSLGVEVMDEEGLLKDWGKLRLTEGEKKTILRKEEVILCSISDHLEGTRTGLLKQALGFFDKFLIVLETPVED
WKPSDYKFNFASFWVRIMDLPLGFQTKMMALKIGDAIGVCEDFDGDLEEVCWGENLRIKIRIDISQPLRKGILIQSENYPKGRWFNLCYERLSDFCFGCGRIGHLLKECL
ESKEREIGGEEEKKWRFTGVYGYPEAYNKHLTWSLLRRIQGEMNEPWLVGGDLNEICWDKEKFGGPLKDSRLLENFREVLDDCKLVDLGFEGSPFTLHGVRRNAGIWERL
DRFVGTIEFIQLFEKGRVSHLDWLFSNHRPIEFTFLFDDSFVPKRRRRPFRFEECWINLPECRNIIEGAGDWELSNGHHSLQNSLKICSNRLLPWGRSLFNKRKVEIARC
KQMLKEAYDKLPEA