; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007777 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007777
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:4569694..4571456
RNA-Seq ExpressionLag0007777
SyntenyLag0007777
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EAZ08016.1 hypothetical protein OsI_30282 [Oryza sativa Indica Group]2.1e-3127.39Show/hide
Query:  VNEGKAKEICRELGVKRSDSLGHYLGLPA--------QFGRNKGVLVG-----DEFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNL
        VN    ++I  + G+++ D L  YL L              ++G + G     ++ HW+SW+ +C  K +GGIG+RDL LFN  ML++  WR++  P +L
Subjt:  VNEGKAKEICRELGVKRSDSLGHYLGLPA--------QFGRNKGVLVG-----DEFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNL

Query:  MAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQDPWLMNEGRWIPISVKEDLKGKRV---CDPQNGGSTEDFWCGAF
         A+VL+ KYF +G+ + V+++   S  WRS + G    + G  WRVG+G  + I  DPWL +     PI+ +      +V    DP  G   ++   G F
Subjt:  MAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQDPWLMNEGRWIPISVKEDLKGKRV---CDPQNGGSTEDFWCGAF

Query:  KLLD-KQELTLATQ----------------IIWAIWTAKAK-------------ETHPSH-----------------------SRWKPSDPNCWKINCDA
           D KQ LT+  +                ++W  W+A+ K             E H  +                       + WK   P   KIN D 
Subjt:  KLLD-KQELTLATQ----------------IIWAIWTAKAK-------------ETHPSH-----------------------SRWKPSDPNCWKINCDA

Query:  AWFESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDREVGRLSNPLIVEFDCVDLIKFLNDEVDDQSKFSVLIDEVKEI
        A+ E SR G  G+V+R   G  V+A   ++       + E++A +  L      D  + R    +I+E DC++L+  L  +  D+S   ++  E++ I
Subjt:  AWFESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDREVGRLSNPLIVEFDCVDLIKFLNDEVDDQSKFSVLIDEVKEI

KAF7138509.1 hypothetical protein RHSIM_Rhsim07G0140700 [Rhododendron simsii]1.3e-3631.15Show/hide
Query:  SKNVNEGKAKEICRELGVKRSDSLGHYLGLPAQFGRNK-------------------------GVLVGDEFHWLSWKKICVAKTRGGIGFRDLELFNKVM
        S N   G    IC +LG++R  +LG+YLGLPA  G++K                          V    + HW++W K+C  K+ GG+GFRDL  FN  +
Subjt:  SKNVNEGKAKEICRELGVKRSDSLGHYLGLPAQFGRNK-------------------------GVLVGDEFHWLSWKKICVAKTRGGIGFRDLELFNKVM

Query:  LSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQDPWLMNEGRWIPISVKE---DLKGKRVCD
        L +  WRIL     L+ ++LK +YF + +FL  K     S  WRS L G+ +   G RWRVGNG++V +  DPW+  +GR+ PI V E   ++    + D
Subjt:  LSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQDPWLMNEGRWIPISVKE---DLKGKRVCD

Query:  PQNGGSTEDFWCGAFKLLDKQELTLATQIIWAIWTAKAKETHPSHSRWKPSDPNCWKINCDAAWF--ESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSI
         +      D     F   D +E   AT+    +   +A+    +  +W P      K+N DAA    +     G G V+R   G  V A    +    S+
Subjt:  PQNGGSTEDFWCGAFKLLDKQELTLATQIIWAIWTAKAKETHPSHSRWKPSDPNCWKINCDAAWF--ESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSI

Query:  KIIESKAIIEGLL--NLTGSD
           E+ A  + L   +L G D
Subjt:  KIIESKAIIEGLL--NLTGSD

XP_022145148.1 uncharacterized protein LOC111014662 [Momordica charantia]2.3e-3334.5Show/hide
Query:  MCMISKNVNEGKAKEICRELGVKRSDSLGHYLGLPAQFGRNKGVLVGD----------------------------------------------------
        M M+S+N +   A  I  EL V R++ +G YLGLP+Q  RNK  +  +                                                    
Subjt:  MCMISKNVNEGKAKEICRELGVKRSDSLGHYLGLPAQFGRNKGVLVGD----------------------------------------------------

Query:  ---------------EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFE
                       + HW SWK++CV+K +GG+GF+DL +FN+ ML+K SW+I+K P++L+ +VL+GKYF  GNF+T +     S +WRS LWG+ LF 
Subjt:  ---------------EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFE

Query:  AGYRWRVGNGKRVYINQDPWLMNEGRWIP
         G RWR+GNG  V I  DPW++ EG  +P
Subjt:  AGYRWRVGNGKRVYINQDPWLMNEGRWIP

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]1.3e-3151.3Show/hide
Query:  EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYI
        + HW SWK +C+ K  GG+GFRD+ +FN+ ML+K SWRIL+ P +L+AK L+GKYF +G+FL  K   + S  WRS LWG+ LF+ GYRW+VGNG  + +
Subjt:  EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYI

Query:  NQDPWLMNEGRWIPI
        + DPWL  +G + P+
Subjt:  NQDPWLMNEGRWIPI

XP_024178198.1 uncharacterized protein LOC112184155 [Rosa chinensis]2.4e-3027.06Show/hide
Query:  EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYI
        + HW++W+K+C +K  GG+GFRD+ LFN  +L+K  WR++  P +L+A+VLK +YF   +F+  +     S  WRS L G+ L   G R++VGNG+ + +
Subjt:  EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYI

Query:  NQDPWLMNEGRWIPISV----KEDLKGKRVCDPQNGG----------------STEDFW-CGAFKLLDKQELTLATQIIWAIWTAK--------------
          DPW+    R+ P S+     E+ K     D + G                 +  D W     + LD ++L      +W +W  +              
Subjt:  NQDPWLMNEGRWIPISV----KEDLKGKRVCDPQNGG----------------STEDFW-CGAFKLLDKQELTLATQIIWAIWTAK--------------

Query:  -----------AKETHPSHSRWKPSDPNCW--------KINCDAAWFESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDR
                    K  H          P  W        KIN D ++   + SGG+G VVR+  G+ +    + +    S   +E++A+  GLL     D 
Subjt:  -----------AKETHPSHSRWKPSDPNCW--------KINCDAAWFESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDR

Query:  EVGRLSNPLIVEFDCVDLIKFLNDEVDDQSKFSVLIDEVK
        +       + VE DCV+L+  L+ + DD S    ++D+ +
Subjt:  EVGRLSNPLIVEFDCVDLIKFLNDEVDDQSKFSVLIDEVK

TrEMBL top hitse value%identityAlignment
A0A6J1CV63 uncharacterized protein LOC1110146621.1e-3334.5Show/hide
Query:  MCMISKNVNEGKAKEICRELGVKRSDSLGHYLGLPAQFGRNKGVLVGD----------------------------------------------------
        M M+S+N +   A  I  EL V R++ +G YLGLP+Q  RNK  +  +                                                    
Subjt:  MCMISKNVNEGKAKEICRELGVKRSDSLGHYLGLPAQFGRNKGVLVGD----------------------------------------------------

Query:  ---------------EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFE
                       + HW SWK++CV+K +GG+GF+DL +FN+ ML+K SW+I+K P++L+ +VL+GKYF  GNF+T +     S +WRS LWG+ LF 
Subjt:  ---------------EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFE

Query:  AGYRWRVGNGKRVYINQDPWLMNEGRWIP
         G RWR+GNG  V I  DPW++ EG  +P
Subjt:  AGYRWRVGNGKRVYINQDPWLMNEGRWIP

A0A6J1DRA0 uncharacterized protein LOC1110224236.1e-3251.3Show/hide
Query:  EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYI
        + HW SWK +C+ K  GG+GFRD+ +FN+ ML+K SWRIL+ P +L+AK L+GKYF +G+FL  K   + S  WRS LWG+ LF+ GYRW+VGNG  + +
Subjt:  EFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYI

Query:  NQDPWLMNEGRWIPI
        + DPWL  +G + P+
Subjt:  NQDPWLMNEGRWIPI

A0A803PA86 Uncharacterized protein5.0e-3426.58Show/hide
Query:  HWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQ
        HW+SW+++C  KT GG+GFR+L  FN  ML K  WR+L  P++L+A++ K +YFS+G FLT +     S +WRS L  Q L  +G RW VG+GK + +  
Subjt:  HWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQ

Query:  DPWLMNEGRWIPISVKEDLKGKRVCD-----------------------------PQNGGSTEDF-----------------------------------
        DPWL ++     IS    L   +V +                             P N  +++D                                    
Subjt:  DPWLMNEGRWIPISVKEDLKGKRVCD-----------------------------PQNGGSTEDF-----------------------------------

Query:  WC-GAFKLLDKQELTLATQIIWAIWTAKA-------------------------KETHPS--------------HSRWKPSDPNCWKINCDAAWFESSRS
        WC   F+ ++K++  L   + WAIW A+                          K  H S                 W P   N  K+N DAA FE+SR 
Subjt:  WC-GAFKLLDKQELTLATQIIWAIWTAKA-------------------------KETHPS--------------HSRWKPSDPNCWKINCDAAWFESSRS

Query:  GGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDREVGRLSNPLIVEFDCVDLIKFLNDEVDDQSKFSVLIDEVKEIACSFR
         G+GWV RD+ G L+    K  + + + ++ E+  I E L  +  S  +       +++E DC+ +++ L   +   S F  +++E K +    R
Subjt:  GGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDREVGRLSNPLIVEFDCVDLIKFLNDEVDDQSKFSVLIDEVKEIACSFR

A0A803PM23 Uncharacterized protein3.3e-3826.51Show/hide
Query:  MCMISKNVNEGKAKEICRELGVKRSDSLGHYLGLPAQFG------------RNKGVLVG------------------------DEFHWLSWKKICVAKTR
        +C+ SK + + +   +  E+GVK  +    YLGLPA  G            +N+  L G                        ++ HW  W K+C  K +
Subjt:  MCMISKNVNEGKAKEICRELGVKRSDSLGHYLGLPAQFG------------RNKGVLVG------------------------DEFHWLSWKKICVAKTR

Query:  GGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQDPWL----------
        GG+GF++LE FN+ +L+K  W+I+  PH+L+A+VLK  Y+++ NFL  K     S +WRS LWG+ + + G RWRV +G+ V IN+D WL          
Subjt:  GGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQDPWL----------

Query:  ---MNEG--------RWIPISVKEDLKGKRV-------CDPQNGGSTEDFWCGAFKLLDKQELTLATQIIWAIWTAKAKETHPS-HSRWKPSDPNCWKIN
           +++G         + P  +     G RV        +      T  +W   +       L +  ++   IW     E +P    RW+P  P  + +N
Subjt:  ---MNEG--------RWIPISVKEDLKGKRV-------CDPQNGGSTEDFWCGAFKLLDKQELTLATQIIWAIWTAKAKETHPS-HSRWKPSDPNCWKIN

Query:  CDAAWFESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDREVGRLSNPLIVEFDCVDLIKFLNDEVDDQSKFSVLIDEVKE
         DA+  ++  +GG+G ++R+  G +V A  ++     S+++ E  A+  G+         +  ++ P I++ DC+ ++ +LN     ++ +S L+D+++E
Subjt:  CDAAWFESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDREVGRLSNPLIVEFDCVDLIKFLNDEVDDQSKFSVLIDEVKE

Query:  IACSFRGCLLFRFLS
            F  CL  + +S
Subjt:  IACSFRGCLLFRFLS

A0A803QNS7 Uncharacterized protein6.7e-3929.09Show/hide
Query:  DEFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVY
        ++ HW  W KIC  K +GG+GF++LE FN+ +L+K  W+I+  PH+L+A+VLK  Y+++ NFL  K     S +WRS LWG+ + + G RWRV +G+ V 
Subjt:  DEFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVY

Query:  INQDPWLMNEGRW---IPISVKEDLKGKRVCDPQN-GGSTEDFWCGAFKL--------LDKQELTLATQIIWAIWTAKAKETHPSH--------------
        IN+D WL     +    P+ V +     ++ D Q  GG+    W    KL          +  L L    +WA W     E   +H              
Subjt:  INQDPWLMNEGRW---IPISVKEDLKGKRVCDPQN-GGSTEDFWCGAFKL--------LDKQELTLATQIIWAIWTAKAKETHPSH--------------

Query:  -SRWKPSDPNCWKINCDAAWFESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDREVGRLSNPLIVEFDCVDLIKFLNDEV
          RW+P  P  + +N DA+  ++  +GG+G ++R+  G +V    ++     S+++ E  A+  G+         +   + P I++ DC+ ++ +LN   
Subjt:  -SRWKPSDPNCWKINCDAAWFESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDREVGRLSNPLIVEFDCVDLIKFLNDEV

Query:  DDQSKFSVLIDEVKEIACSFRGCLLFRFLS
          ++ ++ L+D+++E    F  CL  + +S
Subjt:  DDQSKFSVLIDEVKEIACSFRGCLLFRFLS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657508.3e-1031.19Show/hide
Query:  HWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKY----FSSGNFLTVKQRWKDSIIWRSTLWG-QSLFEAGYRWRVGNGKR
        H + W K+C  K  GG+G R  +  N+ ++SK+ WR+L+  ++L   VL+ KY         +L  K  W  S  WRS   G + +   G  W  G+G++
Subjt:  HWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKY----FSSGNFLTVKQRWKDSIIWRSTLWG-QSLFEAGYRWRVGNGKR

Query:  VYINQDPWL
        +    D W+
Subjt:  VYINQDPWL

P93295 Uncharacterized mitochondrial protein AtMg003101.6e-1634.21Show/hide
Query:  WLSWKKICVAK-TRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQ
        W++W+K+C +K   GG+GFRDL  FN+ +L+K S+RI+  PH L++++L+ +YF   + +      + S  WRS + G+ L   G    +G+G    +  
Subjt:  WLSWKKICVAK-TRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQ

Query:  DPWLMNEGRWIPIS
        D W+M+E    P++
Subjt:  DPWLMNEGRWIPIS

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-1839.13Show/hide
Query:  RNKGVLVGDEFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWR
        RNK    G   HW +W  +   K  GGIGF+D+E FN  +L K  WR+L  P +LMAKV K +YF   + L      + S +W+S    Q +   G R  
Subjt:  RNKGVLVGDEFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWR

Query:  VGNGKRVYINQDPWL
        VGNG+ + I +  WL
Subjt:  VGNGKRVYINQDPWL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-1734.21Show/hide
Query:  WLSWKKICVAK-TRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQ
        W++W+K+C +K   GG+GFRDL  FN+ +L+K S+RI+  PH L++++L+ +YF   + +      + S  WRS + G+ L   G    +G+G    +  
Subjt:  WLSWKKICVAK-TRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFLTVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQ

Query:  DPWLMNEGRWIPIS
        D W+M+E    P++
Subjt:  DPWLMNEGRWIPIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCATGATCAGTAAGAATGTCAATGAGGGCAAAGCCAAGGAAATTTGCAGGGAGCTAGGTGTGAAGCGATCCGACTCTTTGGGGCACTATCTAGGCCTTCCAGCGCA
ATTCGGGAGAAACAAAGGGGTTTTGGTGGGGGATGAGTTTCATTGGCTTAGTTGGAAGAAGATCTGCGTGGCTAAAACAAGAGGAGGGATAGGTTTTAGGGACCTCGAGC
TTTTCAACAAGGTGATGCTCTCTAAAATCAGCTGGCGAATTCTTAAATTTCCTCACAACCTCATGGCAAAAGTTCTTAAGGGGAAGTATTTTAGTAGTGGCAACTTTCTG
ACGGTGAAGCAAAGGTGGAAAGACTCTATAATATGGAGGAGCACTTTGTGGGGCCAATCCCTCTTTGAAGCTGGATACCGTTGGAGAGTGGGAAACGGTAAACGAGTTTA
CATCAACCAAGATCCGTGGCTTATGAATGAAGGTCGATGGATCCCGATTAGTGTCAAAGAGGACCTCAAGGGCAAAAGAGTTTGTGATCCTCAGAATGGGGGAAGTACTG
AGGACTTCTGGTGTGGGGCTTTCAAATTGCTCGACAAACAAGAGTTAACTCTTGCGACTCAAATTATTTGGGCAATTTGGACGGCCAAGGCGAAGGAGACCCACCCTAGT
CATAGCAGGTGGAAGCCGTCGGATCCCAATTGTTGGAAGATCAACTGTGATGCAGCGTGGTTTGAGTCTTCAAGGTCTGGTGGAATAGGATGGGTTGTTCGTGACTCAGC
TGGTTCTCTTGTTGTAGCAAGCTGCAAGAAAATCGATAGGAAAGGGAGCATCAAAATCATCGAATCAAAGGCAATTATTGAAGGGTTGTTGAATCTCACTGGCTCTGACC
GTGAAGTAGGTAGGCTCTCAAATCCTCTTATTGTGGAGTTTGATTGTGTTGATTTAATCAAGTTTCTGAACGACGAAGTGGATGATCAGTCGAAATTTTCCGTGCTGATT
GATGAGGTTAAAGAGATCGCATGCAGCTTTCGTGGGTGTCTCCTCTTTCGTTTTTTGTCCTACAGTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCATGATCAGTAAGAATGTCAATGAGGGCAAAGCCAAGGAAATTTGCAGGGAGCTAGGTGTGAAGCGATCCGACTCTTTGGGGCACTATCTAGGCCTTCCAGCGCA
ATTCGGGAGAAACAAAGGGGTTTTGGTGGGGGATGAGTTTCATTGGCTTAGTTGGAAGAAGATCTGCGTGGCTAAAACAAGAGGAGGGATAGGTTTTAGGGACCTCGAGC
TTTTCAACAAGGTGATGCTCTCTAAAATCAGCTGGCGAATTCTTAAATTTCCTCACAACCTCATGGCAAAAGTTCTTAAGGGGAAGTATTTTAGTAGTGGCAACTTTCTG
ACGGTGAAGCAAAGGTGGAAAGACTCTATAATATGGAGGAGCACTTTGTGGGGCCAATCCCTCTTTGAAGCTGGATACCGTTGGAGAGTGGGAAACGGTAAACGAGTTTA
CATCAACCAAGATCCGTGGCTTATGAATGAAGGTCGATGGATCCCGATTAGTGTCAAAGAGGACCTCAAGGGCAAAAGAGTTTGTGATCCTCAGAATGGGGGAAGTACTG
AGGACTTCTGGTGTGGGGCTTTCAAATTGCTCGACAAACAAGAGTTAACTCTTGCGACTCAAATTATTTGGGCAATTTGGACGGCCAAGGCGAAGGAGACCCACCCTAGT
CATAGCAGGTGGAAGCCGTCGGATCCCAATTGTTGGAAGATCAACTGTGATGCAGCGTGGTTTGAGTCTTCAAGGTCTGGTGGAATAGGATGGGTTGTTCGTGACTCAGC
TGGTTCTCTTGTTGTAGCAAGCTGCAAGAAAATCGATAGGAAAGGGAGCATCAAAATCATCGAATCAAAGGCAATTATTGAAGGGTTGTTGAATCTCACTGGCTCTGACC
GTGAAGTAGGTAGGCTCTCAAATCCTCTTATTGTGGAGTTTGATTGTGTTGATTTAATCAAGTTTCTGAACGACGAAGTGGATGATCAGTCGAAATTTTCCGTGCTGATT
GATGAGGTTAAAGAGATCGCATGCAGCTTTCGTGGGTGTCTCCTCTTTCGTTTTTTGTCCTACAGTCTGTAA
Protein sequenceShow/hide protein sequence
MCMISKNVNEGKAKEICRELGVKRSDSLGHYLGLPAQFGRNKGVLVGDEFHWLSWKKICVAKTRGGIGFRDLELFNKVMLSKISWRILKFPHNLMAKVLKGKYFSSGNFL
TVKQRWKDSIIWRSTLWGQSLFEAGYRWRVGNGKRVYINQDPWLMNEGRWIPISVKEDLKGKRVCDPQNGGSTEDFWCGAFKLLDKQELTLATQIIWAIWTAKAKETHPS
HSRWKPSDPNCWKINCDAAWFESSRSGGIGWVVRDSAGSLVVASCKKIDRKGSIKIIESKAIIEGLLNLTGSDREVGRLSNPLIVEFDCVDLIKFLNDEVDDQSKFSVLI
DEVKEIACSFRGCLLFRFLSYSL