; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy05g018100 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy05g018100
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr05:34158272..34159543
RNA-Seq ExpressionLcy05g018100
SyntenyLcy05g018100
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]9.5e-12752.59Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN  L+  +TREEI TA+   HPTKAPG DG   +F+QKYW++VGN  V   L +LNS  S+ + N TNI L+PK + P  +SD+RPISLCNV YK+I+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
        V+ANRLK +L +II E QSAF+ GR I+DN++++ E +H+L+ KK +GK G+AA+KLDMSKAYDRVEW +++Q+MEK+GFH+ WIKLVM CIT+ S+SI+
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVD-ARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK
        +NG  +G I PTRG+RQGDP+SPY+FLLC+ G S+LL D AR   ++GVSI R CPKI+HLFFADDSL+F KA ++E      IL  YE ASGQ +N  K
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVD-ARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI
        S VFFS N P + +  +  ++          YLGLPS   + K   F  + +RV   L GWK +  S GG+E+LIK++ QAIPTY + CF+IPK +  +I
Subjt:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI

Query:  TTLCSRFWWGSQGEKRSMHWKRWE
          +  RFWWG +G++  + W  W+
Subjt:  TTLCSRFWWGSQGEKRSMHWKRWE

XP_023909336.1 uncharacterized protein LOC112020997 [Quercus suber]7.1e-12250.59Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MNQ L   +T  E+  A++      APG DG P +F++ YW+ VG   +S  L++LNS     + NHT I LIPK + P    D+RPISLCNV YK+I+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
         IANRLK  L ++I + QSAF+  R I+DN++++ ETLH L K KRKGK GY ALKLDMSKAYDRVEWT+L  +M+KLGF   WI L+  CI+T SFSI+
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVD-ARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK
        +NG  +G I P RG+RQGDPLSPYLFLLC++GL AL+   A N +++GVS+ RE P+++HL FADDSL+  KA + E      +L  YE+ASGQ +N  K
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVD-ARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI
        + +FFS N    T+  +   + + +S  L  YLGLPS   RGK + F +I +R+W  +QGWK +  SQ GKEVLIKSI+QA+PTY++ CF++P+ +   I
Subjt:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI

Query:  TTLCSRFWWGSQGEKRSMHWKRW
         +L  +FWWG +GE+R  HW  W
Subjt:  TTLCSRFWWGSQGEKRSMHWKRW

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]2.9e-12350.71Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN  LL P+  EE+  A++   P  APG DGFP LFY+ +W+ VG +     L++LNS       NHT I LIPK + PR V+++RPISLCNV YK+I K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
        V+ANRLK +L  +I E QSAF+  R I+DN++++HETLHFL K KR GK+G+ ++KLDMSKAYDRVEWTYL +IME++GF+  WI L+  CI + ++SI+
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNN-SMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK
        LNG+  G I PTRG+RQGDPLSPYLFLL ++GL AL  +A+++  + GVS+    P+ISHL FADDSLVF +A   E    +SIL  YE ASGQC+N  K
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNN-SMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI
        + +FFS N   + +  +   + + ++     YLGLPS   R K + F  I +R+W  L+GWK +  SQ G+E+L+K++IQAIP Y + CFR+PKG+++ I
Subjt:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI

Query:  TTLCSRFWWGSQGEKRSMHWKRWE
         TL  +FWWG +GE++ +HW  WE
Subjt:  TTLCSRFWWGSQGEKRSMHWKRWE

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]5.4e-12252.12Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        M + L   +T EE+  A+    PTKAPG DG   LFYQK+W +VG+  VS  L  LN+   + + NHTNIVLIPK + P  +S++RPISLCNV YKII+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
        V+ANRLK VL +II   QSAF+PGR I+DN+++++ETLH +  +K KGK G  ALKLD+SKAYDRVEW +L+ IMEK+GF   WI+ VM C+TT SFSI+
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSM-AGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK
        +NG+ +  I+P+RGIRQGDP+SPYLFLLC++GL+ALL  A  N M  GVSI R  PKI++L FADDSL+F +A   E      IL  YE+ASGQ +N  K
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSM-AGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI
        S  +FS N     K  +  I+ +        YLGLP+   R K   F  + DRVW  LQGWK    S+ GKE+LIK++ QAIPTY +  F+IP  + +++
Subjt:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI

Query:  TTLCSRFWWGSQGEKRSMHWKRWE
          LC+RFWWG  G +R +HWK W+
Subjt:  TTLCSRFWWGSQGEKRSMHWKRWE

XP_030941688.1 uncharacterized protein LOC115966628 [Quercus lobata]5.4e-12251.54Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        M Q L   +T  EI  A+    PTKAPG DG   LFYQK+W +VG+  +S  L   NS     + NHTNIVLIPK   P  +SD+RPISLCNV YKII+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
        V+ANRLK VL  II   QSAF+PG  I+DN++++ +TLH + + +RKGK G  ALKLD+SKAYDRVEW +L+ IM KLGF D WI  VM C++T +FS++
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDAR-NNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK
        +NG+ FG I P+RG+RQGDPLSPYLFLLC++G S+LL  A     + GVSI +  P+ISHL FADDSL+F +A  +E      IL  Y  ASGQC+N  K
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDAR-NNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI
        S + FS N P   K++    + +       SYLGLP+   R K + F FI DRVW  LQGWK +  S+ GKEVLIK++ Q+IPTY +G F++P  +  ++
Subjt:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI

Query:  TTLCSRFWWGSQGEKRSMHWKRW
        + +C+RFWWG    +R +HWK W
Subjt:  TTLCSRFWWGSQGEKRSMHWKRW

TrEMBL top hitse value%identityAlignment
A0A2N9E9A1 Reverse transcriptase domain-containing protein3.0e-12650.94Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN  L   +   E++ A++   P K+PG DGFP +FYQKYW ++G       L  LNS   ++  NHT+I LIPK + P  V D+RPISLCNV YKII+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
        V+ NRLK +L +I+ E QSAF+PGR I+DN++++ ETLH +  ++R+GK G  ALKLDMSKAYDRVEW YL ++M+++GFH+ W+K++M+CI+T S+SI+
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARN-NSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK
        +NGE  G+I+P+RG+RQGDPLSPYLFL C++GL +LL  A+N  +M GVSI+R  PK++HLFFADDSL+F KA   E    + IL  YE ASGQ +N  K
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARN-NSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI
        + +FFS + P   + ++  ++ + +      YLGLPS   R K   F  I +RVW+ L+GWK +  SQ GKE+LIKS+ QAIPTYA+ CFR+P+ ++ +I
Subjt:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI

Query:  TTLCSRFWWGSQGEKRSMHWKRWE
          L  RFWWG +GE+  MHW  W+
Subjt:  TTLCSRFWWGSQGEKRSMHWKRWE

A0A2N9EPY2 Reverse transcriptase domain-containing protein1.8e-12652.01Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN++L+  +T EE+  A++   P KAPG DG P LFYQKYW ++G       L  LNS   ++  NHT I LIPK + P  V ++RPI+LCNV YKI++K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
        V+ANRLK +L +I+ E Q+AF+PGR I+DN++++ ETLH +  +K KGKVG  ALKLDMSKAYDRVEW+YL+++ME++GFH  W+ L+M+CITT S+SI+
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARN-NSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK
        +NGE  GFI+P RG+RQGDPLSPYLFL C+KGL +L+  A++   + GV+I+R  PKI+HLFFADDSL+F KA   +    +SIL +YE+ASGQ  N  K
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARN-NSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI
        + +FFS + P   +  +  ++ +        YLGLPS   R K   F  I DRVW+ L+GWK +  SQ G+EVLIKS+ QAIPTYA+ CFR+P  ++ +I
Subjt:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI

Query:  TTLCSRFWWGSQGEKRSMHWKRW
          L  RFWWG  G+K  MHW  W
Subjt:  TTLCSRFWWGSQGEKRSMHWKRW

A0A2N9FNH6 Reverse transcriptase domain-containing protein2.8e-12451.3Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN +LLAP+T EEI +A+   HPTKAPG DG   +FYQK+W +VG+   +  L  L+S   ++  N T+I LIPK   P L++ +RPISLCNV YKII+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
        V+ANRLK VL+ II + QSAF+PGR I+DN++++ E LH++ K KRKG+  + A+KLDMSKAYDRVEW +L  +M KLGF   W+ L+M+C+T+ S+S++
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDA-RNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK
        LNGE  G+I+PTRGIRQGDPLSPYLFL+C++GL+ALL  A R+  + G+SI R  P+ISHLFFADDSL+F +A   E     +IL  YE+ASGQ +N  K
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDA-RNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI
        + +FFS N   D +  +  ++  + +G LG YLGLP    RGK + F  I  ++   L GWK +  SQ G+E+LIKS+ QAIP Y + CFRIP  + ++I
Subjt:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI

Query:  TTLCSRFWWGSQGEKRSMHWKRW
         ++ S+FWWG + E++ +HW++W
Subjt:  TTLCSRFWWGSQGEKRSMHWKRW

A0A2N9J3U0 Reverse transcriptase domain-containing protein2.8e-12451.3Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK
        MN +LLAP+T EEI +A+   HPTKAPG DG   +FYQK+W +VG+   +  L  L+S   ++  N T+I LIPK   P L++ +RPISLCNV YKII+K
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
        V+ANRLK VL+ II + QSAF+PGR I+DN++++ E LH++ K KRKG+  + A+KLDMSKAYDRVEW +L  +M KLGF   W+ L+M+C+T+ S+S++
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDA-RNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK
        LNGE  G+I+PTRGIRQGDPLSPYLFL+C++GL+ALL  A R+  + G+SI R  P+ISHLFFADDSL+F +A   E     +IL  YE+ASGQ +N  K
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDA-RNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAK

Query:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI
        + +FFS N   D +  +  ++  + +G LG YLGLP    RGK + F  I  ++   L GWK +  SQ G+E+LIKS+ QAIP Y + CFRIP  + ++I
Subjt:  SMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKI

Query:  TTLCSRFWWGSQGEKRSMHWKRW
         ++ S+FWWG + E++ +HW++W
Subjt:  TTLCSRFWWGSQGEKRSMHWKRW

A0A5B7BN08 Reverse transcriptase domain-containing protein1.6e-12752Show/hide
Query:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILN-SEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIIT
        MN  LL  +  +E+  A+   HPTKAPG DG  TLF+QK+WDVVG       L  LN   GS+E  N+T I LIPK   PR +S++RPISLCNV YKII+
Subjt:  MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILN-SEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIIT

Query:  KVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSI
        K++ANRLK +L  II E QSAF+PGR I+DN++++ E +H L K KRKGK+G +ALKLDMSKAYDRVEW++L  +M ++GFH  W+ L+M C++T SFS+
Subjt:  KVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSI

Query:  ILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARN-NSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFA
        ++NG+  G I+PTRG+RQGDPLSPYLF+LC++  SALL  + N N + G+S+AR  P++SHLFFADDSL+F  A   +      I+  Y  ASGQ VNF 
Subjt:  ILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARN-NSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFA

Query:  KSMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTK
        KS + FS N+ +D ++ +  I+ +++      YLGLPST  R K + F  I DRVW  L+GWK +  S+ G+EVLIKS+ QAIPTY + CF+IP  I  +
Subjt:  KSMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTK

Query:  ITTLCSRFWWGSQGEKRSMHWKRWE
        I  + S FWWG  G +R +HW RW+
Subjt:  ITTLCSRFWWGSQGEKRSMHWKRWE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.8e-3425.84Show/hide
Query:  QMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNS---EGSI-EDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKI
        + L  P T  EI+  + +    K+PG DGF   FYQ+Y +    + V   L +  S   EG +   +   +I+LIPK GR      ++RPISL N+  KI
Subjt:  QMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNS---EGSI-EDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKI

Query:  ITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASF
        + K++ANR++  + ++I   Q  FIPG     N+  S   +  + + K K  V    + +D  KA+D+++  ++ + + KLG    ++K++       + 
Subjt:  ITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASF

Query:  SIILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNF
        +IILNG+         G RQG PLSP LF +  + L+  +   +   + G+ + +E  K+S   FADD +V+ +           ++ ++ K SG  +N 
Subjt:  SIILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNF

Query:  AKSMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGK--TRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGC--FRIPK
         KS  F   N      Q +   +   ++     YLG+  T         +++ +L  +      WK+   S  G+  ++K  I     Y       ++P 
Subjt:  AKSMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGK--TRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGC--FRIPK

Query:  GILTKITTLCSRFWWGSQ
           T++     +F W  +
Subjt:  GILTKITTLCSRFWWGSQ

P08548 LINE-1 reverse transcriptase homolog8.3e-3329.97Show/hide
Query:  QMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIED-WNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITK
        +ML  P +  EI + ++N    K+PG DGF + FYQ + + +    + +    +  EG + + +   NI LIPK G+ P    +YRPISL N+  KI+ K
Subjt:  QMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIED-WNHTNIVLIPK-GRQPRLVSDYRPISLCNVSYKIITK

Query:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII
        ++ NR++  + +II   Q  FIPG     N+  S   +  + K K K    +  L +D  KA+D ++  ++ + ++K+G    ++KL+    +  + +II
Subjt:  VIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSII

Query:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAKS
        LNG          G RQG PLSP LF +  + L+  + + +  ++ G+ I  E  K+S   FADD +V+ +   +       ++ +Y   SG  +N  KS
Subjt:  LNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAKS

Query:  MVFFSGN
        + F   N
Subjt:  MVFFSGN

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-3126.19Show/hide
Query:  LLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNS-------EGSI-EDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSY
        L +P + +EI   + +    K+PG DGF   FYQ + +        D + IL+        EG++   +    I LIPK  + P  + ++RPISL N+  
Subjt:  LLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNS-------EGSI-EDWNHTNIVLIPK-GRQPRLVSDYRPISLCNVSY

Query:  KIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTA
        KI+ K++ANR++  +  II   Q  FIPG     N+  S   +H++ K K K    +  + LD  KA+D+++  ++ +++E+ G    ++ ++    +  
Subjt:  KIITKVIANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTA

Query:  SFSIILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCV
          +I +NGE    I    G RQG PLSPYLF +  + L+  +   +   + G+ I +E  KIS L  ADD +V+            +++  + +  G  +
Subjt:  SFSIILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCV

Query:  NFAKSMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRD--FQFILDRVWAVLQGWKSQFFSQGGKEVLIKSII--QAIPTYALGCFRI
        N  KSM F         K+         ++ ++  YLG+  T       D  F+ +   +   L+ WK    S  G+  ++K  I  +AI  +     +I
Subjt:  NFAKSMVFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGLPSTFHRGKTRD--FQFILDRVWAVLQGWKSQFFSQGGKEVLIKSII--QAIPTYALGCFRI

Query:  PKGILTKITTLCSRFWWGSQ
        P     ++     +F W ++
Subjt:  PKGILTKITTLCSRFWWGSQ

P14381 Transposon TX1 uncharacterized 149 kDa protein6.0e-3126.59Show/hide
Query:  QMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIE-DWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKV
        + L  P T +E+  A+R     K+PG DG    F+Q +WD +G       L     +G +        + L+PK    RL+ ++RP+SL +  YKI+ K 
Subjt:  QMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIE-DWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKV

Query:  IANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSIIL
        I+ RLK VL E+I   QS  +PGR+I DN+ L  + LHF     R+  +  A L LD  KA+DRV+  YL   ++   F   ++  +     +A   + +
Subjt:  IANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSIIL

Query:  NGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAKSM
        N      +   RG+RQG PLS  L+ L  +    LL       + G+ +     ++    +ADD ++       +    +     Y  AS   +N++KS 
Subjt:  NGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAKSM

Query:  VFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGL-PSTFHRGKTRDFQFILDRVWAVLQGWK--SQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTK
            G++  D          ++    +  YLG+  S      +++F  + + V   L  WK  ++  S  G+ ++I  ++ +   Y L C    +  + K
Subjt:  VFFSGNIPSDTKQYLSHIMSMNMSGSLGSYLGL-PSTFHRGKTRDFQFILDRVWAVLQGWK--SQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTK

Query:  ITTLCSRFWW
        I      F W
Subjt:  ITTLCSRFWW

P92555 Uncharacterized mitochondrial protein AtMg012501.5e-1338.94Show/hide
Query:  ILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNS-MAGVSIARECPKISHLFFADD--SLVFFKAKAEEFGFFRSILLDYEKASGQCVN
        I+NG   G + P+RG+RQGDPLSPYLF+LC++ LS L   A+    + G+ ++   P+I+HL FADD  S  +    A+ +  F    L      G  VN
Subjt:  ILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNS-MAGVSIARECPKISHLFFADD--SLVFFKAKAEEFGFFRSILLDYEKASGQCVN

Query:  FAKSMVFFSGNIP
           S ++F G++P
Subjt:  FAKSMVFFSGNIP

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.1e-0836.36Show/hide
Query:  EEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIIT
        +EI  A+      KAPG D F   F+ + W VV + T++       +   ++ +N T I LIPK      +S +RP+S C V YKIIT
Subjt:  EEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIIT

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.5e-1339.29Show/hide
Query:  IANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWI
        +  RLK ++  +I   Q++FIPGR  +DN++   E +H +++K  KG  G+  LKLD+ KAYDR+ W YL   +   GF + W+
Subjt:  IANRLKGVLNEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWI

AT4G29090.1 Ribonuclease H-like superfamily protein2.8e-0740.91Show/hide
Query:  AIPTYALGCFRIPKGILTKITTLCSRFWWGSQGEKRSMHWKRWE
        A+PTY + CF +PK +  +I ++ + FWW ++ E + MHWK W+
Subjt:  AIPTYALGCFRIPKGILTKITTLCSRFWWGSQGEKRSMHWKRWE

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.8e-0640.91Show/hide
Query:  AIPTYALGCFRIPKGILTKITTLCSRFWWGSQGEKRSMHWKRWE
        A+P YA+ CFR+ K +  K+T+  + FWW S   KR + W  W+
Subjt:  AIPTYALGCFRIPKGILTKITTLCSRFWWGSQGEKRSMHWKRWE

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.0e-1438.94Show/hide
Query:  ILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNS-MAGVSIARECPKISHLFFADD--SLVFFKAKAEEFGFFRSILLDYEKASGQCVN
        I+NG   G + P+RG+RQGDPLSPYLF+LC++ LS L   A+    + G+ ++   P+I+HL FADD  S  +    A+ +  F    L      G  VN
Subjt:  ILNGETFGFIRPTRGIRQGDPLSPYLFLLCSKGLSALLVDARNNS-MAGVSIARECPKISHLFFADD--SLVFFKAKAEEFGFFRSILLDYEKASGQCVN

Query:  FAKSMVFFSGNIP
           S ++F G++P
Subjt:  FAKSMVFFSGNIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCAGATGTTGTTGGCACCTTATACCAGAGAGGAAATTATTACTGCGATGCGAAATTTTCATCCAACTAAGGCCCCGGGATCGGATGGATTTCCAACATTATTCTA
TCAGAAATATTGGGACGTTGTGGGAAACAAAACGGTGTCTGATTGCTTGGCAATTCTCAATTCGGAGGGATCGATAGAGGACTGGAATCATACCAATATTGTGCTCATCC
CCAAAGGCCGACAACCCAGGTTAGTATCAGATTATCGCCCAATTAGTCTATGCAATGTCTCTTATAAAATAATAACTAAGGTCATAGCTAATAGACTTAAGGGTGTGTTA
AATGAGATAATCGATGAATGTCAATCTGCGTTCATTCCTGGTAGATCGATATCTGATAACATGATCTTGAGTCATGAGACGCTTCATTTTCTTAAAAAAAAAAAGCGGAA
GGGAAAAGTTGGTTATGCTGCGCTAAAACTAGATATGAGCAAGGCCTATGATAGGGTGGAGTGGACATATTTGAGACAAATCATGGAAAAGTTGGGGTTTCATGATTGTT
GGATCAAGTTAGTTATGAAATGTATTACAACCGCCTCTTTTTCTATCATTTTAAATGGGGAAACCTTTGGGTTCATTAGACCAACTCGTGGAATTCGTCAAGGCGATCCT
TTATCACCTTACTTGTTTTTACTATGTTCCAAAGGCCTGTCGGCTTTGTTGGTGGATGCGAGAAATAATTCAATGGCCGGGGTGTCCATAGCGCGTGAGTGTCCTAAAAT
TTCGCATTTATTCTTTGCGGATGATAGTTTGGTTTTTTTTAAAGCCAAGGCGGAGGAGTTTGGATTTTTCAGATCTATTTTGTTAGATTATGAAAAGGCTTCTGGGCAAT
GTGTTAATTTTGCAAAATCAATGGTGTTCTTCTCGGGGAATATTCCAAGTGACACCAAACAATATCTTAGTCATATTATGTCTATGAATATGTCTGGGTCCTTGGGATCC
TACCTTGGATTGCCATCAACATTCCATAGAGGGAAAACTCGAGATTTCCAGTTCATTTTGGATAGGGTCTGGGCTGTGCTTCAAGGATGGAAGAGCCAATTTTTTTCACA
GGGAGGGAAAGAGGTTCTGATAAAATCTATTATTCAAGCGATTCCAACCTATGCCCTGGGGTGTTTCCGGATTCCGAAAGGTATTCTGACAAAGATTACGACTTTATGCT
CTAGATTCTGGTGGGGCTCTCAGGGTGAAAAGCGAAGTATGCATTGGAAGAGATGGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCAGATGTTGTTGGCACCTTATACCAGAGAGGAAATTATTACTGCGATGCGAAATTTTCATCCAACTAAGGCCCCGGGATCGGATGGATTTCCAACATTATTCTA
TCAGAAATATTGGGACGTTGTGGGAAACAAAACGGTGTCTGATTGCTTGGCAATTCTCAATTCGGAGGGATCGATAGAGGACTGGAATCATACCAATATTGTGCTCATCC
CCAAAGGCCGACAACCCAGGTTAGTATCAGATTATCGCCCAATTAGTCTATGCAATGTCTCTTATAAAATAATAACTAAGGTCATAGCTAATAGACTTAAGGGTGTGTTA
AATGAGATAATCGATGAATGTCAATCTGCGTTCATTCCTGGTAGATCGATATCTGATAACATGATCTTGAGTCATGAGACGCTTCATTTTCTTAAAAAAAAAAAGCGGAA
GGGAAAAGTTGGTTATGCTGCGCTAAAACTAGATATGAGCAAGGCCTATGATAGGGTGGAGTGGACATATTTGAGACAAATCATGGAAAAGTTGGGGTTTCATGATTGTT
GGATCAAGTTAGTTATGAAATGTATTACAACCGCCTCTTTTTCTATCATTTTAAATGGGGAAACCTTTGGGTTCATTAGACCAACTCGTGGAATTCGTCAAGGCGATCCT
TTATCACCTTACTTGTTTTTACTATGTTCCAAAGGCCTGTCGGCTTTGTTGGTGGATGCGAGAAATAATTCAATGGCCGGGGTGTCCATAGCGCGTGAGTGTCCTAAAAT
TTCGCATTTATTCTTTGCGGATGATAGTTTGGTTTTTTTTAAAGCCAAGGCGGAGGAGTTTGGATTTTTCAGATCTATTTTGTTAGATTATGAAAAGGCTTCTGGGCAAT
GTGTTAATTTTGCAAAATCAATGGTGTTCTTCTCGGGGAATATTCCAAGTGACACCAAACAATATCTTAGTCATATTATGTCTATGAATATGTCTGGGTCCTTGGGATCC
TACCTTGGATTGCCATCAACATTCCATAGAGGGAAAACTCGAGATTTCCAGTTCATTTTGGATAGGGTCTGGGCTGTGCTTCAAGGATGGAAGAGCCAATTTTTTTCACA
GGGAGGGAAAGAGGTTCTGATAAAATCTATTATTCAAGCGATTCCAACCTATGCCCTGGGGTGTTTCCGGATTCCGAAAGGTATTCTGACAAAGATTACGACTTTATGCT
CTAGATTCTGGTGGGGCTCTCAGGGTGAAAAGCGAAGTATGCATTGGAAGAGATGGGAATAG
Protein sequenceShow/hide protein sequence
MNQMLLAPYTREEIITAMRNFHPTKAPGSDGFPTLFYQKYWDVVGNKTVSDCLAILNSEGSIEDWNHTNIVLIPKGRQPRLVSDYRPISLCNVSYKIITKVIANRLKGVL
NEIIDECQSAFIPGRSISDNMILSHETLHFLKKKKRKGKVGYAALKLDMSKAYDRVEWTYLRQIMEKLGFHDCWIKLVMKCITTASFSIILNGETFGFIRPTRGIRQGDP
LSPYLFLLCSKGLSALLVDARNNSMAGVSIARECPKISHLFFADDSLVFFKAKAEEFGFFRSILLDYEKASGQCVNFAKSMVFFSGNIPSDTKQYLSHIMSMNMSGSLGS
YLGLPSTFHRGKTRDFQFILDRVWAVLQGWKSQFFSQGGKEVLIKSIIQAIPTYALGCFRIPKGILTKITTLCSRFWWGSQGEKRSMHWKRWE