; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:7736220..7738131
RNA-Seq ExpressionMoc04g10470
SyntenyMoc04g10470
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]1.0e-3331.6Show/hide
Query:  VGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGR
        +GF   + P R+L    G PLL+  L   DC PL++RI S +RSW+A+VLSF G LQL+R VL+S Q+YWASVF+LPA + ++V++ILRS+LW+G+E GR
Subjt:  VGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGR

Query:  SGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRC-----VIGLL---HGIAYVVGILPFLFPG-------------------------------------
         G KVAW +VC+P  +GG       I   P W       ++ L+    G  +V  +  ++  G                                     
Subjt:  SGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRC-----VIGLL---HGIAYVVGILPFLFPG-------------------------------------

Query:  -------------------------------------------------VFCAGFESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISA
                                                         ++  G ESR+HLFF C F  +VWS ++    S +R+ +W  EL+WICH   
Subjt:  -------------------------------------------------VFCAGFESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISA

Query:  GKSARRQ
        GK  RR+
Subjt:  GKSARRQ

XP_022157473.1 uncharacterized protein LOC111024165 [Momordica charantia]1.8e-4347.16Show/hide
Query:  RIGSRVGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKG
        R+ + +GF +     R+L V+    LLS  +SH DC+PLLERI+  VR+WSA++LSF   L LIRLV QSFQ+YWASVF+LPAR++HDVE+IL SFLWKG
Subjt:  RIGSRVGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKG

Query:  REIGRSGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRCVIGLLHGIAYVVGILP--------------------------FLFPGVFCAGFESRNHLFF
         E G SG KVAW E+       GL     ++  L   R  +     + +  G +P                           +   VFC G ES ++LFF
Subjt:  REIGRSGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRCVIGLLHGIAYVVGILP--------------------------FLFPGVFCAGFESRNHLFF

Query:  ECPFSWEVWSGMIAWAGSFHRVSYWSTEL
        ECPFSWEVWS MIAWAGS  R+SYWSTEL
Subjt:  ECPFSWEVWSGMIAWAGSFHRVSYWSTEL

XP_022158861.1 uncharacterized protein LOC111025324 [Momordica charantia]5.8e-3432.69Show/hide
Query:  VLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVA-------------------WYE----------VCIPLRKGGL
        +LSF G LQLI  VLQSFQ+YWASVF+LPAR+VH+VER+LRSFLWKG E G SGAKVA                   +YE          V +    GGL
Subjt:  VLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVA-------------------WYE----------VCIPLRKGGL

Query:  VGT---------FQSIF-------SLPGW-------------------------RCVIGLLHGIA--YVVGILPFLFPG---------------------
          T         F SIF         PG+                         RC   +++ +A   +  ++ FL P                      
Subjt:  VGT---------FQSIF-------SLPGW-------------------------RCVIGLLHGIA--YVVGILPFLFPG---------------------

Query:  ---------------------------------------------------------------------------------VFCAGFESRNHLFFECPFS
                                                                                         VFCAG ESR+HLF +CP+S
Subjt:  ---------------------------------------------------------------------------------VFCAGFESRNHLFFECPFS

Query:  WEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH
          VW+GMI+WAGS HRVSYWSTEL WICH++ G S+RR  W LAWT TVS +WR+ N R+H
Subjt:  WEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]7.1e-4027.95Show/hide
Query:  YATNFPSEQVSLWESLASICSSWL-ILLSLVCLGHGLLGRIGSRVGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSL
        Y+ +F  E +  +  L+ + ++     + LV +      R+ + +GF + H P R+L    G PLL   L   DC PL++RI S +RSWSA+VLSF G L
Subjt:  YATNFPSEQVSLWESLASICSSWL-ILLSLVCLGHGLLGRIGSRVGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSL

Query:  QLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVAWYEVCIPLRKGGL----------VGTFQSIF-------------------
        QL+R VL+S Q+YWASVF+LP ++  DV++ILRS+LW+G+E GR GAKVAW EVC+P  +GGL            T + ++                   
Subjt:  QLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVAWYEVCIPLRKGGL----------VGTFQSIF-------------------

Query:  ---SLPGW--------RCVIG-----------------------------------------------------------------LLHGIAYVVGILP-
           S+ GW         CV G                                                                 L+    ++ G+ P 
Subjt:  ---SLPGW--------RCVIG-----------------------------------------------------------------LLHGIAYVVGILP-

Query:  -------FLFPG-----------------------------------------------------------------VFCAG-FESRNHLFFECPFSWEV
                  PG                                                                 + C G +ESR+HLFF CPF WE+
Subjt:  -------FLFPG-----------------------------------------------------------------VFCAG-FESRNHLFFECPFSWEV

Query:  WSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH
        WS ++ +  S HR+ YW  EL+WIC+   GK  RR+ W L W +T+ FIW++RN  +H
Subjt:  WSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH

XP_031740424.1 uncharacterized protein LOC116403425 [Cucumis sativus]2.8e-3640.89Show/hide
Query:  VGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGR
        +GF + H P R+L    G PLLS  L   DC PL++RI S +RSWSA+VLSF G LQL+R VL+S Q+YWASVF+LP ++  DV++ILR++LW+G E GR
Subjt:  VGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGR

Query:  SGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRCVIGLL---HGIAYVVGILPFLFPGVFCAGFESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTE
         GAKVAW EVC+P  +GGL     S +++     ++ LL    G  +++G     +  +   G  S +  +    +   VW  +I  AG     S W   
Subjt:  SGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRCVIGLL---HGIAYVVGILPFLFPGVFCAGFESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTE

Query:  LAWICH-------ISAGKSARRQ-WCLAWTSTVSFIWRDRNARVHEV
        L            I   KS RR+ W L W +T+ FIW++RN R+H V
Subjt:  LAWICH-------ISAGKSARRQ-WCLAWTSTVSFIWRDRNARVHEV

TrEMBL top hitse value%identityAlignment
A0A5A7TZS0 Reverse transcriptase domain-containing protein5.3e-3329.83Show/hide
Query:  DCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVAWYEVCIPLRKGGL----------
        DC PL++RI S +RSW+A+VLSF G LQL+R VL+S Q+YWASVF+LPA + ++V++ILRS+LW+G+E GR G KVAW +VC+P  +GGL          
Subjt:  DCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVAWYEVCIPLRKGGL----------

Query:  ----------VGT----FQSIFSLPG-------------W--RCVIGLLHGIAYVVG------------------ILP---FLFPGV-------------
                  +G+    +   + L G             W  R ++     + + VG                  I P   +L+P V             
Subjt:  ----------VGT----FQSIFSLPG-------------W--RCVIGLLHGIAYVVG------------------ILP---FLFPGV-------------

Query:  -----------------------------------------------------FCA------------------------------GFESRNHLFFECPF
                                                             FCA                              G ESR+HLFF CPF
Subjt:  -----------------------------------------------------FCA------------------------------GFESRNHLFFECPF

Query:  SWEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH
          +VWS +     S HR+ +W  EL+WICH   GK  RR+ W + W +T+ FIW +RN R+H
Subjt:  SWEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH

A0A5A7V2L9 Putative reverse transcriptase6.3e-3430.79Show/hide
Query:  IGSRVGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGR
        + + +GF L + P R+L    G PLLS  L   DC PL++RI S +RSW A+V SF G LQLIR VL+S Q++WASVF+LPA + + V++ILRS+LW+ +
Subjt:  IGSRVGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGR

Query:  EIGRSGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWR--------------------------------------------CVIGLL-------------
        E GR G KVAW EVC+P  +GGL     +I   P W                                             C+  +L             
Subjt:  EIGRSGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWR--------------------------------------------CVIGLL-------------

Query:  --HGIAYVVGILPFLFPGVF---------------------------------------------------CAGFESRN--HLFFECPF---SWEVWSGM
          HG    V + P+L  G                                                     C  F  R    L   C F     +VWS +
Subjt:  --HGIAYVVGILPFLFPGVF---------------------------------------------------CAGFESRN--HLFFECPF---SWEVWSGM

Query:  IAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH
        +    S HR+ +W  EL+WICH   GK  RR+ W + W +T+ +IW +RN R+H
Subjt:  IAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH

A0A5D3DXE4 Reverse transcriptase domain-containing protein4.8e-3431.6Show/hide
Query:  VGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGR
        +GF   + P R+L    G PLL+  L   DC PL++RI S +RSW+A+VLSF G LQL+R VL+S Q+YWASVF+LPA + ++V++ILRS+LW+G+E GR
Subjt:  VGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGR

Query:  SGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRC-----VIGLL---HGIAYVVGILPFLFPG-------------------------------------
         G KVAW +VC+P  +GG       I   P W       ++ L+    G  +V  +  ++  G                                     
Subjt:  SGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRC-----VIGLL---HGIAYVVGILPFLFPG-------------------------------------

Query:  -------------------------------------------------VFCAGFESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISA
                                                         ++  G ESR+HLFF C F  +VWS ++    S +R+ +W  EL+WICH   
Subjt:  -------------------------------------------------VFCAGFESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISA

Query:  GKSARRQ
        GK  RR+
Subjt:  GKSARRQ

A0A6J1DTG0 uncharacterized protein LOC1110241658.8e-4447.16Show/hide
Query:  RIGSRVGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKG
        R+ + +GF +     R+L V+    LLS  +SH DC+PLLERI+  VR+WSA++LSF   L LIRLV QSFQ+YWASVF+LPAR++HDVE+IL SFLWKG
Subjt:  RIGSRVGFPLTHEPERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKG

Query:  REIGRSGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRCVIGLLHGIAYVVGILP--------------------------FLFPGVFCAGFESRNHLFF
         E G SG KVAW E+       GL     ++  L   R  +     + +  G +P                           +   VFC G ES ++LFF
Subjt:  REIGRSGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGWRCVIGLLHGIAYVVGILP--------------------------FLFPGVFCAGFESRNHLFF

Query:  ECPFSWEVWSGMIAWAGSFHRVSYWSTEL
        ECPFSWEVWS MIAWAGS  R+SYWSTEL
Subjt:  ECPFSWEVWSGMIAWAGSFHRVSYWSTEL

A0A6J1E271 uncharacterized protein LOC1110253242.8e-3432.69Show/hide
Query:  VLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVA-------------------WYE----------VCIPLRKGGL
        +LSF G LQLI  VLQSFQ+YWASVF+LPAR+VH+VER+LRSFLWKG E G SGAKVA                   +YE          V +    GGL
Subjt:  VLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVA-------------------WYE----------VCIPLRKGGL

Query:  VGT---------FQSIF-------SLPGW-------------------------RCVIGLLHGIA--YVVGILPFLFPG---------------------
          T         F SIF         PG+                         RC   +++ +A   +  ++ FL P                      
Subjt:  VGT---------FQSIF-------SLPGW-------------------------RCVIGLLHGIA--YVVGILPFLFPG---------------------

Query:  ---------------------------------------------------------------------------------VFCAGFESRNHLFFECPFS
                                                                                         VFCAG ESR+HLF +CP+S
Subjt:  ---------------------------------------------------------------------------------VFCAGFESRNHLFFECPFS

Query:  WEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH
          VW+GMI+WAGS HRVSYWSTEL WICH++ G S+RR  W LAWT TVS +WR+ N R+H
Subjt:  WEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQ-WCLAWTSTVSFIWRDRNARVH

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657508.0e-1031.09Show/hide
Query:  PLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVAWYEVCIPLRKGGL
        P+L   ++      +LER+ S +  W  + LSF G L L + VL S  ++  S  LLP  I++ ++++ R+FLW      +    V W +VC P ++GGL
Subjt:  PLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVAWYEVCIPLRKGGL

Query:  -----VGTFQSIFSLPGWR
                 +++ S  GWR
Subjt:  -----VGTFQSIFSLPGWR

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.9e-0538.36Show/hide
Query:  ESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQWCLAWTSTVSFIWRDRNARVH
        ESR HLFFECPF   VW      A  F         L W+ + S  K+      LA+ + V  IWR+RN  +H
Subjt:  ESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQWCLAWTSTVSFIWRDRNARVH

AT1G45063.1 copper ion binding;electron carriers1.2e-0536.84Show/hide
Query:  ESRNHLFFECPFSWEVWSGMIAWAGSFHRVS---YWSTELAWICHISAGKSARRQWCLAWTSTVSFIWRDRNARVH
        E+R H+FF+CPFS EVWS   + A    RV+    +     W+ H    K       LA+ ++V  IWR+RN R++
Subjt:  ESRNHLFFECPFSWEVWSGMIAWAGSFHRVS---YWSTELAWICHISAGKSARRQWCLAWTSTVSFIWRDRNARVH

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-0431.82Show/hide
Query:  FLFPG--VFC-AGFESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQWCLAWTSTVSFIWRDRNARVHE
        F+FP   +FC    E+R HLFF+C F+ EVW    +    F  + +    + W+ +    K+      L+  ++V  IW++RNAR+H+
Subjt:  FLFPG--VFC-AGFESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQWCLAWTSTVSFIWRDRNARVHE

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-1842.34Show/hide
Query:  PERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVAWY
        P R+L    G PLL+  ++  D  PL+E+I   +  W+A+ LSF G LQLI  V+ S   +W S F LP+  + +++ I  SFLW G E+    AKVAW 
Subjt:  PERFLFVISGFPLLSLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVAWY

Query:  EVCIPLRKGGL
        +VC P  +GGL
Subjt:  EVCIPLRKGGL

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-0438.36Show/hide
Query:  ESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQWCLAWTSTVSFIWRDRNARVH
        +SR HLFFEC FS  VW    A + + +  +     L W+   S  K+      LA+ S V  IWR+RN R+H
Subjt:  ESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQWCLAWTSTVSFIWRDRNARVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGTGTGGGTTCTAAAGGTTCCTCAGGTGTCGACTGAGGCTCCGAAGGTTAGTGGACCTTCTTTGGATTCGCTAGTGTCGGTTCCTTTAGTTTTGGATAAGGAGCC
TGTGGTTCCAGATAGGATGCCTGTAGTTTCAAATCTAGGTCCAAATCGACCTGTTGTGGGTGGAAATCGTTTTGCTGCATTGGCGTCTTGTGAGGATGTTATTGAGGATG
CGGCTGACATAGGTCAAGTGTTGTGTCCAGACGTGCTAAACAGCGGCTGCGGGTTCTCTGAGTACGAGAGCGGCTTCCTCATCCCCTCCTGTTTACCATGTCTAGTTGGT
GTTCATGGAATAGCGGATTTGGGTGTTGTGGCTCCGGACCGTTATGCTTTTACTCCACTTGAGCTTTCTTCCCCATTTGTGCATGGGGTGGTCCTTGACAAGGCCTCTCA
GATTTCAATTCACGTTCTTTGTGTGTATGCAACGAATTTTCCTTCCGAGCAGGTTTCTCTTTGGGAATCTTTGGCGTCTATTTGTTCGTCGTGGCTGATCTTGTTGAGCC
TCGTGTGTCTGGGCCATGGTTTACTTGGACGAATAGGTTCGCGGGTGGGGTTTCCGCTCACCCACGAACCTGAACGGTTCCTTTTCGTTATCTCTGGGTTTCCACTCCTT
TCGCTTATGCTTTCTCATCGTGATTGTCAGCCTCTGCTTGAACGGATTGTCTCCCTTGTTCGGAGTTGGTCGGCTCAAGTACTTTCTTTTGTTGGCTCGTTGCAGCTTAT
TCGGTTGGTTTTACAGAGTTTTCAGATTTATTGGGCTAGTGTGTTTCTTCTTCCAGCTCGGATTGTCCATGATGTTGAGCGGATTCTGCGTTCTTTCTTGTGGAAGGGTC
GTGAGATCGGGCGGTCAGGGGCTAAGGTGGCGTGGTATGAGGTGTGTATTCCGTTGCGGAAAGGGGGTTTAGTGGGAACATTTCAAAGCATTTTTTCATTACCTGGTTGG
CGGTGCGTGATCGGCTTACTACACGGGATCGCTTATGTCGTTGGGATTCTTCCGTTCTTGTTTCCTGGTGTGTTTTGTGCTGGCTTTGAGTCTCGGAATCATCTTTTCTT
TGAGTGCCCTTTTAGTTGGGAGGTTTGGTCTGGAATGATTGCCTGGGCTGGTTCTTTTCACCGAGTTTCATATTGGTCTACTGAGCTTGCTTGGATCTGTCACATTAGTG
CTGGGAAGTCTGCTCGTCGTCAGTGGTGTTTGGCTTGGACTTCGACTGTCTCCTTTATTTGGAGGGACCGTAATGCTAGAGTTCATGAGGTGGGCTGGGCTGGTCGCCTT
CTACCTTCCTACGCACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGTGTGGGTTCTAAAGGTTCCTCAGGTGTCGACTGAGGCTCCGAAGGTTAGTGGACCTTCTTTGGATTCGCTAGTGTCGGTTCCTTTAGTTTTGGATAAGGAGCC
TGTGGTTCCAGATAGGATGCCTGTAGTTTCAAATCTAGGTCCAAATCGACCTGTTGTGGGTGGAAATCGTTTTGCTGCATTGGCGTCTTGTGAGGATGTTATTGAGGATG
CGGCTGACATAGGTCAAGTGTTGTGTCCAGACGTGCTAAACAGCGGCTGCGGGTTCTCTGAGTACGAGAGCGGCTTCCTCATCCCCTCCTGTTTACCATGTCTAGTTGGT
GTTCATGGAATAGCGGATTTGGGTGTTGTGGCTCCGGACCGTTATGCTTTTACTCCACTTGAGCTTTCTTCCCCATTTGTGCATGGGGTGGTCCTTGACAAGGCCTCTCA
GATTTCAATTCACGTTCTTTGTGTGTATGCAACGAATTTTCCTTCCGAGCAGGTTTCTCTTTGGGAATCTTTGGCGTCTATTTGTTCGTCGTGGCTGATCTTGTTGAGCC
TCGTGTGTCTGGGCCATGGTTTACTTGGACGAATAGGTTCGCGGGTGGGGTTTCCGCTCACCCACGAACCTGAACGGTTCCTTTTCGTTATCTCTGGGTTTCCACTCCTT
TCGCTTATGCTTTCTCATCGTGATTGTCAGCCTCTGCTTGAACGGATTGTCTCCCTTGTTCGGAGTTGGTCGGCTCAAGTACTTTCTTTTGTTGGCTCGTTGCAGCTTAT
TCGGTTGGTTTTACAGAGTTTTCAGATTTATTGGGCTAGTGTGTTTCTTCTTCCAGCTCGGATTGTCCATGATGTTGAGCGGATTCTGCGTTCTTTCTTGTGGAAGGGTC
GTGAGATCGGGCGGTCAGGGGCTAAGGTGGCGTGGTATGAGGTGTGTATTCCGTTGCGGAAAGGGGGTTTAGTGGGAACATTTCAAAGCATTTTTTCATTACCTGGTTGG
CGGTGCGTGATCGGCTTACTACACGGGATCGCTTATGTCGTTGGGATTCTTCCGTTCTTGTTTCCTGGTGTGTTTTGTGCTGGCTTTGAGTCTCGGAATCATCTTTTCTT
TGAGTGCCCTTTTAGTTGGGAGGTTTGGTCTGGAATGATTGCCTGGGCTGGTTCTTTTCACCGAGTTTCATATTGGTCTACTGAGCTTGCTTGGATCTGTCACATTAGTG
CTGGGAAGTCTGCTCGTCGTCAGTGGTGTTTGGCTTGGACTTCGACTGTCTCCTTTATTTGGAGGGACCGTAATGCTAGAGTTCATGAGGTGGGCTGGGCTGGTCGCCTT
CTACCTTCCTACGCACTTTGA
Protein sequenceShow/hide protein sequence
MQVWVLKVPQVSTEAPKVSGPSLDSLVSVPLVLDKEPVVPDRMPVVSNLGPNRPVVGGNRFAALASCEDVIEDAADIGQVLCPDVLNSGCGFSEYESGFLIPSCLPCLVG
VHGIADLGVVAPDRYAFTPLELSSPFVHGVVLDKASQISIHVLCVYATNFPSEQVSLWESLASICSSWLILLSLVCLGHGLLGRIGSRVGFPLTHEPERFLFVISGFPLL
SLMLSHRDCQPLLERIVSLVRSWSAQVLSFVGSLQLIRLVLQSFQIYWASVFLLPARIVHDVERILRSFLWKGREIGRSGAKVAWYEVCIPLRKGGLVGTFQSIFSLPGW
RCVIGLLHGIAYVVGILPFLFPGVFCAGFESRNHLFFECPFSWEVWSGMIAWAGSFHRVSYWSTELAWICHISAGKSARRQWCLAWTSTVSFIWRDRNARVHEVGWAGRL
LPSYAL