; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028655 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028655
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:27570997..27572001
RNA-Seq ExpressionLag0028655
SyntenyLag0028655
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]2.0e-3131.76Show/hide
Query:  SSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRD
        S  SS+S  S W   W   +  K+KIF WR   + LPT  NL +R +    +C  CG + E  +H    CK +K V  NAG      +     L+ +L +
Subjt:  SSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRD

Query:  MRDRLDWDRFEYLVVVLWALWNCRNR--VKLKGEGLSLEIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGL
        ++ +      E +VV+ W +W  +N    + K E   L I    A  ++SF+R        +          W  P   W+KINVDAA + +   +G+G+
Subjt:  MRDRLDWDRFEYLVVVLWALWNCRNR--VKLKGEGLSLEIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGL

Query:  VVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGN
        ++RN  GEV+A A +   F  SS   EA AV+ GIK A   GF P++IETDS+   +     K  + +  ++I+D  +S   S      +T RE N
Subjt:  VVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGN

XP_021847414.1 uncharacterized protein LOC110787151 [Spinacia oleracea]5.7e-3432.12Show/hide
Query:  SSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDMRD
        SS +  S W+  WK+++L +VK+F WR C + LPT   L+KR      VC +C C+ ES +H    CK ++ + C + F  + S ++  SL+    D  D
Subjt:  SSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDMRD

Query:  R----LDWDRFEYLVVVLWALWNCRNRVKLKGEGLSLEIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIA----PLVSWYKINVDAAFDSEMMFSG
        R    LD +  E  + + WA+W  RN+  +  EGL+ + PT +  +  +          + + GS   GA W      P   WYK+NVDA    E + SG
Subjt:  R----LDWDRFEYLVVVLWALWNCRNRVKLKGEGLSLEIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIA----PLVSWYKINVDAAFDSEMMFSG

Query:  VGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGNR
        +G V+R+  G V++ A R      ++++AEA AV+ G++L AELG   VV+E+D     +A+RR+    S+ + ++ D +        +  +F  R GN+
Subjt:  VGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGNR

Query:  LA
        +A
Subjt:  LA

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]5.1e-5138.89Show/hide
Query:  QSPSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLL
        Q+PSSSSSE ++ WW G WKM I +K+K+F WRLCLDRLPT  NL+KRGV++ N C  CG  GE ++H+FW+CKF++ +  N+ FG L       S  L+
Subjt:  QSPSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLL

Query:  LRDMRDRLDWDRFEYLVVVLWALWNCRNRVKLKGE-----GLSLEIPTWSAGFLESFRRANLHRV-AEVTNGSGREGARWIAPLVSWYKINVDAAFDSEM
        LR+  + L    FE L VV+W LWN RN             + +E+  W+  +   FR A  + +   VTN +      W  P    YKIN DA+F +  
Subjt:  LRDMRDRLDWDRFEYLVVVLWALWNCRNRVKLKGE-----GLSLEIPTWSAGFLESFRRANLHRV-AEVTNGSGREGARWIAPLVSWYKINVDAAFDSEM

Query:  MFSGVGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYR
          +G+G+++ N RG+VMA AT+    + S DMAEA A V+G++LA+E+G +P +                +DLS+   ++  A + W  S      F  R
Subjt:  MFSGVGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYR

Query:  EGNRLA
        EGN+ A
Subjt:  EGNRLA

XP_023908235.1 uncharacterized protein LOC112019924 [Quercus suber]3.1e-3232.03Show/hide
Query:  SSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDM
        SSS       W   WK+ I +K+K+F WR C D LPT  NL K+ + V ++C LCG   ES +H  W C  ++++   +G             + LL  +
Subjt:  SSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDM

Query:  RDRLDWDRFEYLVVVLWALWNCRNRVKLKGEGLSLE-IPTWSAGFLESFRRANLHRVAEVTNGSGR-----EGARWIAPLVSWYKINVDAAFDSEMMFSG
         DRL+    E  +V  W +WN RNRV   G+ +    +   + G +E F+++      +   G  R     +G RWI P  + YK+N DAA       SG
Subjt:  RDRLDWDRFEYLVVVLWALWNCRNRVKLKGEGLSLE-IPTWSAGFLESFRRANLHRVAEVTNGSGR-----EGARWIAPLVSWYKINVDAAFDSEMMFSG

Query:  VGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSW-PLSW-ALKCAFTYREG
         G VVRN  G+VMA  T     V  S++AE  A    ++ A + GF  +++E DS  A + +    D  S +  ++ D       L W +++C  T R G
Subjt:  VGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSW-PLSW-ALKCAFTYREG

Query:  NRLATI
        NR+A +
Subjt:  NRLATI

XP_030964861.1 uncharacterized protein LOC115986145 [Quercus lobata]3.8e-3032.73Show/hide
Query:  SSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDMR
        SSS   +  W   WK+ + +K+K+F WR C + LPT DNL +R +   + C LC    E+ +H  W C  +K+V  +              +L L +++ 
Subjt:  SSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDMR

Query:  DRLDWDRFEYLVVVLWALWNCRNRVKLKGEGLSLEIPTW----SAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGL
        +RL   +FE  +V  W +WN RN V     G  L+ P+W    +  +LE F +A       VT+ SG +   W  P    YK+N DAA  S++  SGVG 
Subjt:  DRLDWDRFEYLVVVLWALWNCRNRVKLKGEGLSLEIPTW----SAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGL

Query:  VVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISD
        V+RN   EVMA  +     V  +  AE  A    ++ A + GF  +VIE D++    ++     DLS L  +I D
Subjt:  VVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISD

TrEMBL top hitse value%identityAlignment
A0A2N9EZZ2 Uncharacterized protein1.4e-3031.77Show/hide
Query:  PSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLR
        PSSS  +  QS WR  W + I  K ++F W+   + LPT  NL KR + V ++C +CG   E A+H  W CK  ++      F    +    D L  +L 
Subjt:  PSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLR

Query:  DMRDRLDWDRFEYLVVVLWALWNCRNRVKLKGEGLSL-EIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGL
          R        E  +V+ WALW  RN++++  E   + ++ + +  +LE + R N H      N       RW+ P +  YKIN D A   E   +G+G+
Subjt:  DMRDRLDWDRFEYLVVVLWALWNCRNRVKLKGEGLSL-EIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGL

Query:  VVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGNRLA
        +VR+ +G VMA  T+   F  S    EAWAV   I+ A E+G      E DS+    A+      L+    L++DA +          +   REGNRLA
Subjt:  VVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGNRLA

A0A2N9GQG2 Uncharacterized protein2.1e-2933.85Show/hide
Query:  SPSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLL
        SP SS+S  +Q +W+  W M +  K++ F WRLCL+ LPT DNLAKR +     C +CG   ES++HVF  C F+K V           +  T  L LL+
Subjt:  SPSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLL

Query:  RDMRDRLDW----------DRFEYLVVVLWALWNCRNRVKLKGEGLSLEIPTWSAGFLESFRRANLHRVAEVTN-------GSGREGARWIAPLVSWYKI
           +  LDW             + + VV W LWN RN    +     ++IP  S  +L++ +   ++  A+V +        +GR+G  W  P   WYK+
Subjt:  RDMRDRLDW----------DRFEYLVVVLWALWNCRNRVKLKGEGLSLEIPTWSAGFLESFRRANLHRVAEVTN-------GSGREGARWIAPLVSWYKI

Query:  NVDAAFDSEMMFSGVGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGF
        N+D A  S++   GVG+V+RN  GE +   +    F   +  AEAWA +  IK  A + F
Subjt:  NVDAAFDSEMMFSGVGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGF

A0A5C7H0P0 Uncharacterized protein1.3e-2829.73Show/hide
Query:  SSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDMR
        SSS  L  WWR  WK +I +K KIFFW+     LPT   LA+R VDV++ C +C    ES  H+ W C  +  V        +  +        ++  + 
Subjt:  SSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDMR

Query:  DRLDWDRFEYLVVVLWALWNCRNRVKLKGEG-LSLEIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGLVVR
          +D   F  L++  W LW  RN V     G  + ++ TW   F   FR AN     EV +        W  P    +KIN DA+F      +GVG+++R
Subjt:  DRLDWDRFEYLVVVLWALWNCRNRVKLKGEG-LSLEIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGLVVR

Query:  NHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGNRLA
        +++G  +A  +       S +M EA A ++GI LA ++G   V+IE+D+    + +       ++L  +I  +L        L      RE N +A
Subjt:  NHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGNRLA

A0A6J1DAR4 uncharacterized protein LOC1110189542.5e-5138.89Show/hide
Query:  QSPSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLL
        Q+PSSSSSE ++ WW G WKM I +K+K+F WRLCLDRLPT  NL+KRGV++ N C  CG  GE ++H+FW+CKF++ +  N+ FG L       S  L+
Subjt:  QSPSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLL

Query:  LRDMRDRLDWDRFEYLVVVLWALWNCRNRVKLKGE-----GLSLEIPTWSAGFLESFRRANLHRV-AEVTNGSGREGARWIAPLVSWYKINVDAAFDSEM
        LR+  + L    FE L VV+W LWN RN             + +E+  W+  +   FR A  + +   VTN +      W  P    YKIN DA+F +  
Subjt:  LRDMRDRLDWDRFEYLVVVLWALWNCRNRVKLKGE-----GLSLEIPTWSAGFLESFRRANLHRV-AEVTNGSGREGARWIAPLVSWYKINVDAAFDSEM

Query:  MFSGVGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYR
          +G+G+++ N RG+VMA AT+    + S DMAEA A V+G++LA+E+G +P +                +DLS+   ++  A + W  S      F  R
Subjt:  MFSGVGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYR

Query:  EGNRLA
        EGN+ A
Subjt:  EGNRLA

A0A803PV25 Uncharacterized protein1.1e-3031.1Show/hide
Query:  SSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQG-ESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDM
        S++E+   WWR  WK+ I  KVK F W++    +PT   LA R V +   C+ C     E+  H  W C+ + +V   +GF     +   + +L  L  M
Subjt:  SSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQG-ESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDM

Query:  RDRLDWDRFEYLVVVLWALWNCRNRVKLKG-EGLSLEIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGLVV
           L  + FEY +V+ W LW  RN V   G + ++  I  W + FL  FR +N+ + A    G+ R  ARW+AP+   Y INVDA        + V  V+
Subjt:  RDRLDWDRFEYLVVVLWALWNCRNRVKLKG-EGLSLEIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGLVV

Query:  RNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGNRLATI
        R+H G V   A R      S   AE  A+ DGIK   +       +ETD  +A   + +D     D+  L++              +F YRE N+ A +
Subjt:  RNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGNRLATI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33710.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-0434.38Show/hide
Query:  SWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFS
        SW +  W      K     W   LDRLPT   LA  G+ +   C LC    E   H+F  C+F+
Subjt:  SWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFS

AT4G29090.1 Ribonuclease H-like superfamily protein4.7e-1825.64Show/hide
Query:  SPSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLL
        SP   S  SL   ++  WK     K++ F W+   + LP    LA R +   + C  C    E+  H+ + C F++     +           DS+ + L
Subjt:  SPSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLL

Query:  RDM----RDRLDWDRFEYLVV-VLWALWNCRNRVKLKG-EGLSLEIPTWSAGFLESFR-RANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEM
          +         W++   LV  +LW LW  RN +  +G E  + E+   +   LE +R R            +     RW  P   W K N DA ++ + 
Subjt:  RDM----RDRLDWDRFEYLVV-VLWALWNCRNRVKLKG-EGLSLEIPTWSAGFLESFR-RANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEM

Query:  MFSGVGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRD------KDDLSDLFFLISDALDSWPLSWALK
           G+G V+RN +GEV  +  R    + S   AE  A+   +   +   +  V+ E+DS+   E +  D      K  + DL  L+S   +       +K
Subjt:  MFSGVGLVVRNHRGEVMAVATRTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRD------KDDLSDLFFLISDALDSWPLSWALK

Query:  CAFTYREGNRLA
          F  REGN LA
Subjt:  CAFTYREGNRLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATATATTGGCACAAAGTCCATCATCATCCTCTTCAGAATCCTTGCAAAGTTGGTGGAGAGGGTGCTGGAAGATGTCAATCCTAAGTAAGGTTAAAATTTTCTTTTG
GCGTTTATGCTTGGATAGACTTCCAACTATAGATAATCTGGCCAAACGAGGGGTTGATGTTCTGAATGTTTGTTCCCTTTGTGGCTGCCAGGGGGAGTCTGCTATGCACG
TGTTTTGGCTTTGTAAGTTTTCTAAGAATGTCATTTGCAATGCGGGTTTTGGTTTCCTCTTTAGTCAGTTTCGGACAGATAGTTTGTTGCTGCTTCTGAGAGATATGAGG
GACCGATTGGATTGGGATCGATTTGAATATTTGGTAGTGGTGCTGTGGGCTTTGTGGAACTGTAGAAATCGAGTAAAGCTGAAGGGGGAGGGGTTGTCGTTGGAAATCCC
GACTTGGTCCGCCGGTTTTCTGGAATCATTCCGGCGAGCTAATTTGCATCGGGTGGCTGAGGTAACGAATGGGTCTGGCCGTGAAGGAGCGAGGTGGATAGCGCCGCTGG
TTAGCTGGTATAAGATTAATGTTGATGCCGCTTTCGACAGTGAGATGATGTTTTCTGGAGTTGGGCTGGTTGTGCGTAATCACAGAGGCGAGGTAATGGCGGTAGCGACA
AGAACCCATGGTTTTGTTGGAAGTTCGGACATGGCTGAAGCGTGGGCGGTGGTTGACGGTATAAAACTGGCCGCTGAGTTGGGTTTCTACCCGGTGGTGATCGAAACTGA
TTCGAAAAGGGCATTTGAGGCTATGCGCAGAGATAAAGATGACCTCTCTGATCTCTTTTTTCTGATTTCTGATGCCCTTGATTCCTGGCCTCTGTCATGGGCTTTGAAGT
GTGCTTTTACCTATCGTGAAGGGAACCGTTTGGCCACCATTTGGCTAAGCTGGCGATGTGTGGTCGTAGTGATTTCGTTTGGCTTGAGGAAGTTCCTGTTTGTGCTGAGA
GCTTTTTGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATATATTGGCACAAAGTCCATCATCATCCTCTTCAGAATCCTTGCAAAGTTGGTGGAGAGGGTGCTGGAAGATGTCAATCCTAAGTAAGGTTAAAATTTTCTTTTG
GCGTTTATGCTTGGATAGACTTCCAACTATAGATAATCTGGCCAAACGAGGGGTTGATGTTCTGAATGTTTGTTCCCTTTGTGGCTGCCAGGGGGAGTCTGCTATGCACG
TGTTTTGGCTTTGTAAGTTTTCTAAGAATGTCATTTGCAATGCGGGTTTTGGTTTCCTCTTTAGTCAGTTTCGGACAGATAGTTTGTTGCTGCTTCTGAGAGATATGAGG
GACCGATTGGATTGGGATCGATTTGAATATTTGGTAGTGGTGCTGTGGGCTTTGTGGAACTGTAGAAATCGAGTAAAGCTGAAGGGGGAGGGGTTGTCGTTGGAAATCCC
GACTTGGTCCGCCGGTTTTCTGGAATCATTCCGGCGAGCTAATTTGCATCGGGTGGCTGAGGTAACGAATGGGTCTGGCCGTGAAGGAGCGAGGTGGATAGCGCCGCTGG
TTAGCTGGTATAAGATTAATGTTGATGCCGCTTTCGACAGTGAGATGATGTTTTCTGGAGTTGGGCTGGTTGTGCGTAATCACAGAGGCGAGGTAATGGCGGTAGCGACA
AGAACCCATGGTTTTGTTGGAAGTTCGGACATGGCTGAAGCGTGGGCGGTGGTTGACGGTATAAAACTGGCCGCTGAGTTGGGTTTCTACCCGGTGGTGATCGAAACTGA
TTCGAAAAGGGCATTTGAGGCTATGCGCAGAGATAAAGATGACCTCTCTGATCTCTTTTTTCTGATTTCTGATGCCCTTGATTCCTGGCCTCTGTCATGGGCTTTGAAGT
GTGCTTTTACCTATCGTGAAGGGAACCGTTTGGCCACCATTTGGCTAAGCTGGCGATGTGTGGTCGTAGTGATTTCGTTTGGCTTGAGGAAGTTCCTGTTTGTGCTGAGA
GCTTTTTGTTTTTAG
Protein sequenceShow/hide protein sequence
MHILAQSPSSSSSESLQSWWRGCWKMSILSKVKIFFWRLCLDRLPTIDNLAKRGVDVLNVCSLCGCQGESAMHVFWLCKFSKNVICNAGFGFLFSQFRTDSLLLLLRDMR
DRLDWDRFEYLVVVLWALWNCRNRVKLKGEGLSLEIPTWSAGFLESFRRANLHRVAEVTNGSGREGARWIAPLVSWYKINVDAAFDSEMMFSGVGLVVRNHRGEVMAVAT
RTHGFVGSSDMAEAWAVVDGIKLAAELGFYPVVIETDSKRAFEAMRRDKDDLSDLFFLISDALDSWPLSWALKCAFTYREGNRLATIWLSWRCVVVVISFGLRKFLFVLR
AFCF