; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g018640 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g018640
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:42826672..42828387
RNA-Seq ExpressionLcy06g018640
SyntenyLcy06g018640
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]9.3e-11140.07Show/hide
Query:  HLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGAIERKEKEINEL
        HL +  SDHR L   + K +++      +   RFE+ W++  EC++IV   W    Q      +    +C   L  W++ K+ GS+   I+    ++ +L
Subjt:  HLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGAIERKEKEINEL

Query:  SN-RSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHNLFTSSNPNID
         N R    T  E+ A E +L  LL  +EI+W+QR+R  W+ + ++NTK+FH RAN R KA  ++GL N E +   D+     I  +++ NLFT+S+P+ D
Subjt:  SN-RSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHNLFTSSNPNID

Query:  SIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLSLI-------------------
         I+ +L ++   ++   NR+L  +FT  EV   I +M P K        A+F+QQ+W+IVG  V    L+ LN +  L +I                   
Subjt:  SIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLSLI-------------------

Query:  -------------IAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWI
                     IAK L NRLK +L  II+ +QSAFVPG++ITDN +I +ECLH +++ RSGK   VA+KLDMSKAYDRVEW +I  +++K+GF+++W+
Subjt:  -------------IAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWI

Query:  DKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLL
         KIMKCV SV++   +N     +  P+RGLRQGDPLSPYLFLICAEG S++L  AE R +  GL+I    P ISHLF+ADDSLLF KA  +   SI+ + 
Subjt:  DKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLL

Query:  SCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
        S Y + SGQ+INF+KS    SPNT +         L +Q  E++  YLGLP   G++KK +F +IK++V
Subjt:  SCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]9.3e-11143.07Show/hide
Query:  MLQWCTGFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGA
        ML  C   KV HL  L+SDHRP+LA W  E         +   RFE++W++ D CRDI+   W  +          K   C++ L  W++ +   S++GA
Subjt:  MLQWCTGFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGA

Query:  IERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHN
        I  KEKE+  L                R+L+                                                                     
Subjt:  IERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHN

Query:  LFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLSLIIAKTLANRL
                                + QN +L  +FTREE+ + +K MHPSK        A+F+Q++W                     +++IAK LANRL
Subjt:  LFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLSLIIAKTLANRL

Query:  KRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRT
        K V+D++ISP+QSAFVPG+ ITDN +IGFEC++AIKNKR GK G VA+KLDMSKAYDRVEW Y+RG+M KMGFA RW+D IM CVESV F VL+N +P  
Subjt:  KRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRT

Query:  EFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLLSCYEKA-SGQIINFDKSAFMVS
        EF PNRGLRQGDPLSPYLF++CAEGLS+++N  E++   + L+IN  CP ISHLFYADD LLFFKA   +CRSIK +L  YEKA SGQ IN DKS F+VS
Subjt:  EFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLLSCYEKA-SGQIINFDKSAFMVS

Query:  PNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
         NT    V  I++ L+V H ESLGQYLGLPSQ GR+K++VFNNIKDRV
Subjt:  PNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.9e-10839.13Show/hide
Query:  GFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSME-CMNILLAWSRKKYEGSIRGAIERKE
        G KV HL     DH  LL   +      R    ++   FE  W K ++C+ I++  W         + + +++  C   L  WS   Y G I   I+ K 
Subjt:  GFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSME-CMNILLAWSRKKYEGSIRGAIERKE

Query:  KEINELSNRS-DQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHNLFTS
          +N L+ R  D+   +E+     E+ +LL+D+E YW QRA+  W+ + D+NTK+FH +A+ RRK N + G+ + +G+  +++E + + A  YF+N+++S
Subjt:  KEINELSNRS-DQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHNLFTS

Query:  SNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLN--------GDGSLSLI-----
        S+P+   IE + ++I   ++E+ N  L+ EFT+EEV + +K +HP+K        A+F+Q+YW IVG  V +  L VLN           ++SLI     
Subjt:  SNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLN--------GDGSLSLI-----

Query:  -------------------IAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMG
                           I+K LANRLK +L  IIS +QSAF   ++ITDN ++ FE +H + +K +GK+G +A+KLDMSKA+DRVEW +I  VM +MG
Subjt:  -------------------IAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMG

Query:  FAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCR
        F  RW D +M+C+ SVS+ +L+N +      P+RGLRQGDPLSP LFL+CAEGLS+++N A R +  +G+ IN  CP ++HLF+ADDS+LF KA  ++C 
Subjt:  FAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCR

Query:  SIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
         ++ +L  YE+ASGQ IN DKS+   SPNT       I   L         +YLGLPS IGRSK +VF  +K++V
Subjt:  SIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.8e-10939.09Show/hide
Query:  KVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGAIERKEKEI
        KV HL    SDH  LL     + T  +K   +   +FE  W + ++C+DI+  +W    + +  + +   + C    L+   K   G+I   I+ K++ +
Subjt:  KVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGAIERKEKEI

Query:  NELSNRSDQRTMM--EMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHNLFTSSN
        N L + SD+   +  E+    +E+  LL+ +EI W+QR+R +W+   D+NTK+FH +A+ RR+ N + G+M+  G   +  EG+ ++A  YF  +++SS 
Subjt:  NELSNRSDQRTMM--EMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHNLFTSSN

Query:  PNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGS-------------------
        P    I  +LD+I  +++E+ N  L+ EFTREE+   +  MHP+K        AIF+Q+YW+IVG ++    L+VLN + S                   
Subjt:  PNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGS-------------------

Query:  -------LSL------IIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFA
               +SL      +I+K LANRLK +L  IIS +QSAF+ G++ITDN ++ FE +H +++K+ GK+G  A+KLDMSKAYDRVEW +I+ VM KMGF 
Subjt:  -------LSL------IIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFA

Query:  KRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSI
        ++WI  +M C+ SVS+ +L+N        P RGLRQGDP+SPY+FL+CA+G SS+LN   R+   SG+ I   CP I+HLF+ADDSLLF KA  ++C+++
Subjt:  KRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSI

Query:  KRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
          +L  YE ASGQ IN DKS+   S NT       +   L         +YLGLPS IG+SK E+F  +K+RV
Subjt:  KRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]3.1e-10638.83Show/hide
Query:  QWCTGFK---VFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIW-KVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIR
        +W   FK   V HL    SDH  +LA       S+R    K    FE  W K ++C D+++  W           +V     C + L++W+ +   G+I 
Subjt:  QWCTGFK---VFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIW-KVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIR

Query:  GAIERKEKEINELSNRSDQRTM-MEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQY
          I  K + +N ++    Q      +    +EL  LL+ +EI WRQR++  W  + D+NTK+FH RA+ RRK N +  L N +G   +  E +   A  Y
Subjt:  GAIERKEKEINELSNRSDQRTM-MEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQY

Query:  FHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGD--------GSLS
        F N++TSS+P    I  ++++I   ++++ N  L   FT EEV   +K +HP+K        A F+  YWDIVG  + N  L VLN +         ++S
Subjt:  FHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGD--------GSLS

Query:  L------------------------IIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIR
        L                        II+K LANR K +L  IIS +QSAF P ++ITDN ++ FE +H + +K  GK+  +++KLDMSKA+DRVEW +I+
Subjt:  L------------------------IIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIR

Query:  GVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFK
        GVM K+GF ++WI  IM CV SVS+ VL+N        P+RG+RQGDPLSP LFL+CAEGLS++++ A R ++ +G+ I   CP I+HLF+ADDSLLF K
Subjt:  GVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFK

Query:  AVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
        A +++C ++  +L+ YE+ASGQ IN DKS+   SPNT     + I   L         +YLGLPS IG+SK +VF  +KDRV
Subjt:  AVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

TrEMBL top hitse value%identityAlignment
A0A2N9E147 Reverse transcriptase domain-containing protein2.8e-11340.55Show/hide
Query:  QWCTGF---KVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIW-KVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIR
        QW   F   ++ H+    SDH  L+    +   S  +   ++  RFE+AWVK   C +++ + W    + T   KV  K  EC   LL W+R +  G  +
Subjt:  QWCTGF---KVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIW-KVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIR

Query:  GAIERKEKEINELSNRSD-QRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQY
         +I+ K   + +     D  +         +EL +LLE++E YW+QR+R  W+ + DKNTK+FH  AN RR+ N +  L + +G++I  DEG+E+++T Y
Subjt:  GAIERKEKEINELSNRSD-QRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQY

Query:  FHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLE-----------------
        F NLFT+SNP+  SI+ +++S++  +S D N +LL  +T+EEV + +  M PSK        A+F+QQYW ++G E+    L+                 
Subjt:  FHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLE-----------------

Query:  ----VLNGD-----------GSLSLIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIR
            V N D             L  +I+K  ANRLKR L  +IS SQSAFV G++ITDN ++ FE LH +KNKR G    +A KLDMSKAYDR+EW Y++
Subjt:  ----VLNGD-----------GSLSLIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIR

Query:  GVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFK
         VM KMGF  RW+D +M+CV + SF +LLN  P+    P+RGLRQGDPLSPYLFLICAEG + ++  A  ++   G+ IN   P  SHLF+ADDS+LF+K
Subjt:  GVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFK

Query:  AVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
        A  ++C+ +K +L  YE ASGQ +N +K++   S NT  T  + IQ  L  +    L +YLGLP  IGRSK++ F +IK R+
Subjt:  AVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

A0A2N9GPZ7 Reverse transcriptase domain-containing protein2.5e-10940.14Show/hide
Query:  LQWCT---GFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIW--KVMKQTDGSKVVGKSMECMNILLAWSRKKYEGS
        + W T   G  V HL    SDH P+L         +RK   K   RFE  W+K ++CR+++D  W   V + +    VV K   C   L+ WSR+++ GS
Subjt:  LQWCT---GFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIW--KVMKQTDGSKVVGKSMECMNILLAWSRKKYEGS

Query:  IRGAIERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQ
        +  +I+RK +++  L N +       +   + +L  LLE +EI+WRQR+R  W+S+ DKNTK+FH + N RR+ N + GL + +G    +   +  IA  
Subjt:  IRGAIERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQ

Query:  YFHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLS-------
        YF  +FTSSNP+ +SI  +L  +   ++   N  L  EFT++EV + +K M+P+K        AIFYQ YWDIVG EV    L +L+    L        
Subjt:  YFHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLS-------

Query:  -------------------------LIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYI
                                  I++K LANRLK+VL  +IS +QSAFVPG++ITDN ++ FE +H++  KR GK G +ALKLDMSKAYDRVEW ++
Subjt:  -------------------------LIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYI

Query:  RGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFF
          +M  MGFAK WI  +M C+ SVS+ VL+N      F  +RG+RQGD LSPYLFLICAEGLS +L  A   +  +G+  +   P ++HLF+ADDSLLF 
Subjt:  RGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFF

Query:  KAVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
        +A   +C ++  +L  YE ASGQ +N  K++   + +T     + IQ+   V   +S  +YLGLPS +GRSK   F  IK RV
Subjt:  KAVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

A0A2N9IPS8 Reverse transcriptase domain-containing protein2.5e-10940.14Show/hide
Query:  LQWCT---GFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIW--KVMKQTDGSKVVGKSMECMNILLAWSRKKYEGS
        + W T   G  V HL    SDH P+L         +RK   K   RFE  W+K ++CR+++D  W   V + +    VV K   C   L+ WSR+++ GS
Subjt:  LQWCT---GFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIW--KVMKQTDGSKVVGKSMECMNILLAWSRKKYEGS

Query:  IRGAIERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQ
        +  +I+RK +++  L N +       +   + +L  LLE +EI+WRQR+R  W+S+ DKNTK+FH + N RR+ N + GL + +G    +   +  IA  
Subjt:  IRGAIERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQ

Query:  YFHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLS-------
        YF  +FTSSNP+ +SI  +L  +   ++   N  L  EFT++EV + +K M+P+K        AIFYQ YWDIVG EV    L +L+    L        
Subjt:  YFHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLS-------

Query:  -------------------------LIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYI
                                  I++K LANRLK+VL  +IS +QSAFVPG++ITDN ++ FE +H++  KR GK G +ALKLDMSKAYDRVEW ++
Subjt:  -------------------------LIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYI

Query:  RGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFF
          +M  MGFAK WI  +M C+ SVS+ VL+N      F  +RG+RQGD LSPYLFLICAEGLS +L  A   +  +G+  +   P ++HLF+ADDSLLF 
Subjt:  RGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFF

Query:  KAVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
        +A   +C ++  +L  YE ASGQ +N  K++   + +T     + IQ+   V   +S  +YLGLPS +GRSK   F  IK RV
Subjt:  KAVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

A0A6J1DUG8 uncharacterized protein LOC1110241354.5e-11143.07Show/hide
Query:  MLQWCTGFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGA
        ML  C   KV HL  L+SDHRP+LA W  E         +   RFE++W++ D CRDI+   W  +          K   C++ L  W++ +   S++GA
Subjt:  MLQWCTGFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGA

Query:  IERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHN
        I  KEKE+  L                R+L+                                                                     
Subjt:  IERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHN

Query:  LFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLSLIIAKTLANRL
                                + QN +L  +FTREE+ + +K MHPSK        A+F+Q++W                     +++IAK LANRL
Subjt:  LFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQYWDIVGEEVCNFCLEVLNGDGSLSLIIAKTLANRL

Query:  KRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRT
        K V+D++ISP+QSAFVPG+ ITDN +IGFEC++AIKNKR GK G VA+KLDMSKAYDRVEW Y+RG+M KMGFA RW+D IM CVESV F VL+N +P  
Subjt:  KRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRT

Query:  EFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLLSCYEKA-SGQIINFDKSAFMVS
        EF PNRGLRQGDPLSPYLF++CAEGLS+++N  E++   + L+IN  CP ISHLFYADD LLFFKA   +CRSIK +L  YEKA SGQ IN DKS F+VS
Subjt:  EFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLLSCYEKA-SGQIINFDKSAFMVS

Query:  PNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
         NT    V  I++ L+V H ESLGQYLGLPSQ GR+K++VFNNIKDRV
Subjt:  PNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

A0A7N2LIH6 Uncharacterized protein3.1e-11239.3Show/hide
Query:  KVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGAIERKEKEI
        KV H+   ASDH  LLA +  +  +QR+   ++   FE+ W + +EC++IV+  W   ++     V  +   C  +L  W++  + G++   I++K+  +
Subjt:  KVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGAIERKEKEI

Query:  NELSNRS-DQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHNLFTSSNP
         +L + +    T  E++  ++E+  L   +E+ W+QR+R  W+   DKN+K+FH  A+ RR+ NR+ GLM+  G   ED E  E++   YF ++++S+ P
Subjt:  NELSNRS-DQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHNLFTSSNP

Query:  NIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSKA--------IFYQQYWDIVGEEVCNFCLEVLNGD----------------------
           S ++ L+++   ++ + N  L  EF   EV+  ++ MHP+KA        IFYQ+YWDIVG  V N  L+ LN                        
Subjt:  NIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSKA--------IFYQQYWDIVGEEVCNFCLEVLNGD----------------------

Query:  ----------GSLSLIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAK
                    +  II+K LANRLK+VL  +I  +QSAFVPG++ITDN ++ FE +H+I  +R GK+G++A+KLDMSKAYDRVEW Y+  +M KMGF  
Subjt:  ----------GSLSLIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAK

Query:  RWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIK
        RWI  IM CV SVSF VL+N  P+  F P+RGLRQGDP+SPYLFL+C EGLS+++   ER     G+      P ISHLF+ADDS++F +A   +C  + 
Subjt:  RWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIK

Query:  RLLSCYEKASGQIINFDKSAFMVSPNTD---STHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV
        ++L  YE+ SGQ +N DK++   S NT        KGI    ++QH E   +YLGLP  IGR+KK+ FN IKD+V
Subjt:  RLLSCYEKASGQIINFDKSAFMVSPNTD---STHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein6.9e-1623.05Show/hide
Query:  ILLAWSRKKYE----GSIRGAIERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNG
        I L+ S+KK E     S+   ++  EK+      RS ++ ++++  +  ++E+      I   +    E I+K DK           +   N++R   N 
Subjt:  ILLAWSRKKYE----GSIRGAIERKEKEINELSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNG

Query:  EGQKIEDDEGMERIATQYFHNLFTSSNPNIDSIELILDSISV-SISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQY-WDIVGEEVCNF
        +G    D E ++     ++  L+++   N+D ++  LD   V  +++DQ   L +  + +E+  VI ++   K        A FYQ +  D++   + + 
Subjt:  EGQKIEDDEGMERIATQYFHNLFTSSNPNIDSIELILDSISV-SISEDQNRMLLNEFTREEVYMVIKNMHPSK--------AIFYQQY-WDIVGEEVCNF

Query:  CLEVLNGDGSL----------------------------------SLIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKD
            +  +G+L                                  + I+ K LANR++  +  II P Q  F+PG     N       +H I NK   K+
Subjt:  CLEVLNGDGSL----------------------------------SLIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKD

Query:  GMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLR
         M+ + LD  KA+D+++  ++  V+ + G    +++ I          + +N           G RQG PLSPYLF I  E L+  +    +++E  G++
Subjt:  GMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLR

Query:  INNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLLSCYEKASGQIINFDKS-AFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGL
        I      IS L  ADD +++        R +  L++ + +  G  IN +KS AF+ + N  +   K I+E        +  +YLG+
Subjt:  INNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLLSCYEKASGQIINFDKS-AFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-2125.23Show/hide
Query:  KKYEGSIRGAIERKEKEINELSNR----SDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIED
        K   G     IE    E+ +L  R     DQ    E   ++  L ++ +        R+R + +   D+ +++F+     +    ++  L   +G  +ED
Subjt:  KKYEGSIRGAIERKEKEINELSNR----SDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIED

Query:  DEGMERIATQYFHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSKA--------IFYQQYWDIVGEEV------------
         E +   A  ++ NLF+    + D+ E + D + V +SE +   L    T +E+   ++ M  +K+         F+Q +WD +G +             
Subjt:  DEGMERIATQYFHNLFTSSNPNIDSIELILDSISVSISEDQNRMLLNEFTREEVYMVIKNMHPSKA--------IFYQQYWDIVGEEV------------

Query:  ----C-NFCLEVLNGDGSLSL---------------IIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSK
            C    L +L   G L L               I+AK ++ RLK VL  +I P QS  VPG+ I DN  +  + LH    +R+G   +  L LD  K
Subjt:  ----C-NFCLEVLNGDGSLSL---------------IIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSK

Query:  AYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHL
        A+DRV+  Y+ G +    F  +++  +     S    V +N          RG+RQG PLS  L+ +  E    +L     R+  +GL +      +   
Subjt:  AYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHL

Query:  FYADDSLLFFKAVDKDCRSIKRLLSC---YEKASGQIINFDKSA
         YADD +L    V +D   ++R   C   Y  AS   IN+ KS+
Subjt:  FYADDSLLFFKAVDKDCRSIKRLLSC---YEKASGQIINFDKSA

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM3.1e-0827.37Show/hide
Query:  IAKTLANRLKRVLDTIIS------PSQSAFVPGKVITDN-TVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKC
        +   L  +L  +L T ++      P Q  F+P     DN T++     H+ K+ RS         LD+SKA+D +    I   +   G  K ++D +   
Subjt:  IAKTLANRLKRVLDTIIS------PSQSAFVPGKVITDN-TVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKC

Query:  VESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLF
         E     +  +     EF P RG++QGDPLSP LF +  + L   L +        G ++ N     +   +ADD +LF
Subjt:  VESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDSLLF

P92555 Uncharacterized mitochondrial protein AtMg012504.2e-1347.06Show/hide
Query:  LLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDS
        ++N  P+    P+RGLRQGDPLSPYLF++C E LS +   A+ +    G+R++N  P I+HL +ADD+
Subjt:  LLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDS

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)4.1e-0827.19Show/hide
Query:  HAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNT
        H IK +R        + LD+ KA+D V    I   M   G      D IM  +      +++      +     G++QGDPLSP LF I  + L + LN 
Subjt:  HAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNT

Query:  AERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVV--QHKESLG-----QY
             E  G  +   C  I+ L +ADD LL  +  D D  +       Y +  G  +N +K A + +       V   +    +  ++ + LG     +Y
Subjt:  AERRREFSGLRINNYCPFISHLFYADDSLLFFKAVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVV--QHKESLG-----QY

Query:  LGLP-SQIGRSKKEVFN
        LGL  S  G +K  V+N
Subjt:  LGLP-SQIGRSKKEVFN

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.3e-1438.64Show/hide
Query:  LANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMK
        +  RLK ++  +I P+Q++F+PG+V TDN V   E +H+++ K+ G  G + LKLD+ KAYDR+ W Y+   +   GF + W+ +I +
Subjt:  LANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNKRSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMK

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.0e-1447.06Show/hide
Query:  LLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDS
        ++N  P+    P+RGLRQGDPLSPYLF++C E LS +   A+ +    G+R++N  P I+HL +ADD+
Subjt:  LLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYCPFISHLFYADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACAGTGGTGTACTGGTTTTAAGGTTTTTCATCTTCCTTTCCTTGCATCTGACCACCGGCCTTTATTAGCAAAATGGTCAAAAGAAAAAACAAGCCAAAGGAAAAA
TGGAGCAAAATATCCGAGAAGATTTGAGGATGCTTGGGTTAAATACGACGAGTGCAGAGACATTGTGGATCAGATCTGGAAAGTAATGAAACAGACAGATGGAAGCAAGG
TTGTGGGAAAATCTATGGAGTGCATGAATATATTATTAGCTTGGAGTCGGAAAAAATATGAGGGCTCGATCCGAGGAGCGATTGAGAGAAAAGAAAAGGAGATAAATGAA
TTGAGCAATAGAAGTGATCAAAGGACTATGATGGAAATGGAGGCTAAAGAAAGGGAACTGGAGAGTTTGCTAGAGGATGATGAAATTTACTGGAGGCAACGTGCAAGAGA
AGAGTGGATTAGTAAAGAAGACAAGAATACGAAGTGGTTTCACATGAGAGCTAACACAAGGAGGAAAGCCAACCGAGTTCGAGGTTTGATGAATGGGGAGGGTCAGAAGA
TTGAGGATGATGAAGGAATGGAAAGAATTGCAACGCAATATTTCCATAATCTCTTCACATCCTCAAACCCAAACATAGATTCCATCGAGCTCATTCTAGATTCAATCTCG
GTTAGTATATCAGAGGACCAAAATCGTATGTTGTTGAACGAGTTCACCAGGGAAGAGGTCTACATGGTCATAAAGAATATGCATCCTTCAAAGGCAATTTTTTACCAACA
ATATTGGGACATAGTTGGTGAGGAGGTTTGTAATTTTTGTTTGGAAGTCCTGAATGGAGACGGCTCTTTAAGTCTGATTATAGCCAAAACTTTAGCTAACAGACTAAAAA
GAGTTTTGGATACAATTATTTCCCCGAGTCAGTCAGCTTTCGTTCCAGGTAAAGTCATTACTGATAACACAGTGATTGGTTTTGAATGTCTTCATGCGATTAAGAATAAG
AGAAGCGGGAAAGATGGGATGGTTGCGCTCAAATTGGATATGAGCAAAGCGTATGATCGTGTGGAATGGTGCTACATTAGAGGCGTAATGAGTAAGATGGGGTTTGCTAA
AAGATGGATTGACAAGATTATGAAATGTGTGGAATCTGTAAGCTTTCAAGTTCTGCTGAATGAGTTACCTCGCACAGAATTCAAACCAAATCGAGGTCTGAGGCAAGGAG
ATCCGCTATCTCCGTATCTTTTCTTAATTTGTGCAGAGGGCCTATCGAGTATCCTCAATACTGCTGAACGGAGGAGGGAGTTTTCAGGTTTGCGTATCAATAACTATTGC
CCTTTTATATCTCACCTCTTTTATGCAGATGATAGTCTCTTGTTCTTTAAAGCTGTGGATAAGGATTGCAGGTCCATCAAAAGGCTCCTCTCTTGCTATGAAAAGGCGTC
GGGACAAATCATTAATTTTGACAAGTCAGCCTTTATGGTGAGCCCGAACACAGATTCAACCCATGTCAAAGGAATCCAGGAGTGTCTAGTAGTTCAACACAAGGAAAGCT
TGGGTCAATACCTAGGCCTTCCATCTCAAATCGGTAGGAGCAAGAAGGAAGTGTTTAACAACATCAAGGATCGTGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTACAGTGGTGTACTGGTTTTAAGGTTTTTCATCTTCCTTTCCTTGCATCTGACCACCGGCCTTTATTAGCAAAATGGTCAAAAGAAAAAACAAGCCAAAGGAAAAA
TGGAGCAAAATATCCGAGAAGATTTGAGGATGCTTGGGTTAAATACGACGAGTGCAGAGACATTGTGGATCAGATCTGGAAAGTAATGAAACAGACAGATGGAAGCAAGG
TTGTGGGAAAATCTATGGAGTGCATGAATATATTATTAGCTTGGAGTCGGAAAAAATATGAGGGCTCGATCCGAGGAGCGATTGAGAGAAAAGAAAAGGAGATAAATGAA
TTGAGCAATAGAAGTGATCAAAGGACTATGATGGAAATGGAGGCTAAAGAAAGGGAACTGGAGAGTTTGCTAGAGGATGATGAAATTTACTGGAGGCAACGTGCAAGAGA
AGAGTGGATTAGTAAAGAAGACAAGAATACGAAGTGGTTTCACATGAGAGCTAACACAAGGAGGAAAGCCAACCGAGTTCGAGGTTTGATGAATGGGGAGGGTCAGAAGA
TTGAGGATGATGAAGGAATGGAAAGAATTGCAACGCAATATTTCCATAATCTCTTCACATCCTCAAACCCAAACATAGATTCCATCGAGCTCATTCTAGATTCAATCTCG
GTTAGTATATCAGAGGACCAAAATCGTATGTTGTTGAACGAGTTCACCAGGGAAGAGGTCTACATGGTCATAAAGAATATGCATCCTTCAAAGGCAATTTTTTACCAACA
ATATTGGGACATAGTTGGTGAGGAGGTTTGTAATTTTTGTTTGGAAGTCCTGAATGGAGACGGCTCTTTAAGTCTGATTATAGCCAAAACTTTAGCTAACAGACTAAAAA
GAGTTTTGGATACAATTATTTCCCCGAGTCAGTCAGCTTTCGTTCCAGGTAAAGTCATTACTGATAACACAGTGATTGGTTTTGAATGTCTTCATGCGATTAAGAATAAG
AGAAGCGGGAAAGATGGGATGGTTGCGCTCAAATTGGATATGAGCAAAGCGTATGATCGTGTGGAATGGTGCTACATTAGAGGCGTAATGAGTAAGATGGGGTTTGCTAA
AAGATGGATTGACAAGATTATGAAATGTGTGGAATCTGTAAGCTTTCAAGTTCTGCTGAATGAGTTACCTCGCACAGAATTCAAACCAAATCGAGGTCTGAGGCAAGGAG
ATCCGCTATCTCCGTATCTTTTCTTAATTTGTGCAGAGGGCCTATCGAGTATCCTCAATACTGCTGAACGGAGGAGGGAGTTTTCAGGTTTGCGTATCAATAACTATTGC
CCTTTTATATCTCACCTCTTTTATGCAGATGATAGTCTCTTGTTCTTTAAAGCTGTGGATAAGGATTGCAGGTCCATCAAAAGGCTCCTCTCTTGCTATGAAAAGGCGTC
GGGACAAATCATTAATTTTGACAAGTCAGCCTTTATGGTGAGCCCGAACACAGATTCAACCCATGTCAAAGGAATCCAGGAGTGTCTAGTAGTTCAACACAAGGAAAGCT
TGGGTCAATACCTAGGCCTTCCATCTCAAATCGGTAGGAGCAAGAAGGAAGTGTTTAACAACATCAAGGATCGTGTCTGA
Protein sequenceShow/hide protein sequence
MLQWCTGFKVFHLPFLASDHRPLLAKWSKEKTSQRKNGAKYPRRFEDAWVKYDECRDIVDQIWKVMKQTDGSKVVGKSMECMNILLAWSRKKYEGSIRGAIERKEKEINE
LSNRSDQRTMMEMEAKERELESLLEDDEIYWRQRAREEWISKEDKNTKWFHMRANTRRKANRVRGLMNGEGQKIEDDEGMERIATQYFHNLFTSSNPNIDSIELILDSIS
VSISEDQNRMLLNEFTREEVYMVIKNMHPSKAIFYQQYWDIVGEEVCNFCLEVLNGDGSLSLIIAKTLANRLKRVLDTIISPSQSAFVPGKVITDNTVIGFECLHAIKNK
RSGKDGMVALKLDMSKAYDRVEWCYIRGVMSKMGFAKRWIDKIMKCVESVSFQVLLNELPRTEFKPNRGLRQGDPLSPYLFLICAEGLSSILNTAERRREFSGLRINNYC
PFISHLFYADDSLLFFKAVDKDCRSIKRLLSCYEKASGQIINFDKSAFMVSPNTDSTHVKGIQECLVVQHKESLGQYLGLPSQIGRSKKEVFNNIKDRV