; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007982 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007982
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:9305513..9306904
RNA-Seq ExpressionLag0007982
SyntenyLag0007982
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023871998.1 uncharacterized protein LOC111984613 [Quercus suber]1.6e-11946.4Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +K+ L  ++ + Q+ FV  RLI+DNV++  E +  ++ +RKGK G +A+K+DMSKAYDRVEW  + +IM KLG+ + W+  IM CV SV YAV++N +P+
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS
           TP RGLRQGDPLSPYLFL CAEGL+ + ++       RG+  ++  P LSH+FFADDSLIF +AT ++C  I+ IL  Y+++SGQ +N  K+S   S
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS

Query:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK
        +N   E    ++ +   Q  K+   YLG+PS  GR+K   F ++K++V   L GWKEKL S  GK+VLIK VA+A+PTYTMSCFK+ NSIC E+  + ++
Subjt:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK

Query:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW
        FWWG  + + K  W+ W  LC  KD+GG+GFRD+K FN+ALLAK  WR+   PNSL  R  + +YF   +F +A LG NPS  WRSI+  +++ +KG +W
Subjt:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW

Query:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNL
        R+GNG  +++  D W+      +++SP +++
Subjt:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNL

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]3.2e-12047.06Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +K  L  IIS+ Q+ F  +RLI+DNV++ FE +  L+ +  GKEG +AIK+DMSKA+DRVEW FI K+ME++G+   W D +M C+ SV Y++ +N +  
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS
            P RGLRQGDPLSP LFLLCAEGL+ L+N+        G+ IN+ CP ++H+FFADDS++FC+A  ++C ++++IL  Y+EASGQ IN DKSS   S
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS

Query:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK
         N  +E   ++ +IL    +     YLG+PS  GR+K+ VF  +K++V + L GWK KL S+GGK++LIK VAQAIPTYTMSCF L   +C ++ ++   
Subjt:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK

Query:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW
        FWWG    + K  WI W  +C+SK  GGLGFR++K FN A+LAK +WRIL  PNSL+ R L+ RYF     L A LGS+PS +WRSI    ++ ++G RW
Subjt:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW

Query:  RIGNGRHVIIDQDPWVAEQGVSRLVSPE-DNLKGRYVAEILD
        R+GNG+ + I +D W+      +++SP+  N +   V+ ++D
Subjt:  RIGNGRHVIIDQDPWVAEQGVSRLVSPE-DNLKGRYVAEILD

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]2.5e-12049.3Show/hide
Query:  KKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPRE
        K  L +IIS+ Q+ F PNRLI+DNV++ FE +  LN + +GKE +++IK+DMSKA+DRVEW FI  +MEKLG+ + WI  IM CV SV Y+V +N     
Subjt:  KKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPRE

Query:  TFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPSK
          TP RG+RQGDPLSP LFLLCAEGL+ L++         G+ I + CP ++H+FFADDSL+FC+A E++C  + +IL+ Y+EASGQ IN DKSS   S 
Subjt:  TFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPSK

Query:  NVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAKF
        N  +E    + +IL          YLG+PS  G++K  VF  VKDRV   L GWK KL S+GG+++LIK VAQA+PTYTMSCF+L  ++C ++  +   F
Subjt:  NVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAKF

Query:  WWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRWR
        WWG    + K  W+ W  +C SK  GG+GFR+I+ FN A+LAK  WRIL  PNSL+AR  + +YF  +  L +  GSNPS  WRSI    D+ +KG RWR
Subjt:  WWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRWR

Query:  IGNGRHVIIDQDPWVAEQGVSRLVSP
        +GNGR + I  D W+      ++VSP
Subjt:  IGNGRHVIIDQDPWVAEQGVSRLVSP

XP_028062862.1 uncharacterized protein LOC114266180 [Camellia sinensis]9.1e-12349.55Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        ++K L +IIS  Q+ FV  RLISDNV+  FE    L ++R GKEGH A+K+DMSKAY+RVEW F+  +ME++G+++ ++D I+ C+ SV Y+V VN  P 
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS
        +TF PKRGLRQGDPLSPYLF+LCAEGL++L+ R E      G+ + +  P +SHIFFADDSL+F  A  ++  M+K IL  Y+ ASGQ IN +KS+   S
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS

Query:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK
        KNV  +    LR+ + + +  + G YLG+P   GR+K  +F  VKDRVW  L GWKEK  S  G+++LIK+VAQ+IPTY MSCF+L + IC EI+ +   
Subjt:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK

Query:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW
        +WWG   SK K+HW  W  LC  K  GG+GFR +K FN+ALLAK  WR++  P+SLLAR L+ +Y+   SFL+A +GSNPS TWRSI   R L +   RW
Subjt:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW

Query:  RIGNGRHVIIDQDPWVAEQGVSRLVSP-EDNLKGRYVAEILD
        R+G+GR + I  D WV +    R+ SP    L+G  V+++++
Subjt:  RIGNGRHVIIDQDPWVAEQGVSRLVSP-EDNLKGRYVAEILD

XP_030930502.1 uncharacterized protein LOC115956198 [Quercus lobata]4.2e-12046.26Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +K+ L ++I+  Q+ FVP RLISDNVV+ FE +  +  R+KGKEG +AIK+DMSKAYDRVEW ++  IM K+G+ + WI  +M CV +V + + +N  PR
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS
           TP RGLRQGDP+SPYLFLLC EGL+ L+ + E +   RG+ ++K  P +S++FFADDS+IFCRAT ++C+ +  +L TY++ SGQ IN DK+S   S
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS

Query:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK
        KN + +    ++++   Q  ++   YLG+P   GR K   F R+KD+V   +  WK KL S  G++VLIK VAQA PTYTM+CFKL +++C+EIN +   
Subjt:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK

Query:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW
        FWWG   S  K  W+ W NLC  K  GG+GF D+K FN ALLAK  WRI + P+SL+ R L+ +YF   SFL A +G  PS  WRS++  + + + G RW
Subjt:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW

Query:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLK-GRYVAEIL
         +G+ + + I +D W+   G  R++SP   +  G  VA ++
Subjt:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLK-GRYVAEIL

TrEMBL top hitse value%identityAlignment
A0A2N9FDP4 Reverse transcriptase domain-containing protein1.2e-12348.19Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +K+ L  IIS++Q+ FVP R I+DN+ + FE +  L +RRKGK  H+A+K+DMSKAYDRVEWIF+ ++M ++G+   WI  +MTCV++  Y+V +N  P 
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS
            P RG+RQGDPLSPYLFLLCAEGL+ LL   E      G+ I ++ P +SH+ FADDSL+FC+ATE++C  +  +L  Y+ AS Q +N +K++   S
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS

Query:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK
        KN  E   R ++ +  +Q +     YLG+P+  G++K   F  +K+R+   L+GWKE+L S  G+ +LIKT+AQAIPTYTMSCFKL  + CA+IN + + 
Subjt:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK

Query:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW
        +WWG  R + K HWI W  LCS K+ GG+GFRDI  FN ALLAK  WR+L  P SL A+  + +YF G SFLKA LGSNPS  WRSI+  RDL +KG RW
Subjt:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW

Query:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGRYVAEILDS
        +IGNG+ V + +D W    G   L    +  + ++VA+++D+
Subjt:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGRYVAEILDS

A0A2N9FNH6 Reverse transcriptase domain-containing protein6.2e-12547.96Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +K  L+ IIS  Q+ FVP RLI+DN+++ FE +  + ++RKG+  H+A+K+DMSKAYDRVEW F+  +M KLG+D+ W++ IM C+ SV Y+V +N  P 
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS
            P RG+RQGDPLSPYLFL+CAEGLT LL + E     +GL I +  P +SH+FFADDSL+FCRA   +C+ + AIL TY++ASGQ +N++K+S   S
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS

Query:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK
         N   +    +  +L   S+ +LG YLG+P   GR K   F  +K ++   L GWK KL S  G+++LIK+VAQAIP YTMSCF++ +++C+EIN + +K
Subjt:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK

Query:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW
        FWWG    ++K HW +WSN+C  K  GG+GFRD+ LFNQALLAK  WR+L+ PN+LL R L+ +YF   SF++A +  + S  WRSI   R + +KG RW
Subjt:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW

Query:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGR-YVAEILD
        RIGNG  V I +D W++    S+ VS    L     V++++D
Subjt:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGR-YVAEILD

A0A2N9G497 Reverse transcriptase domain-containing protein8.9e-12447.51Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +K  L  IIS++Q+ FVP R I+DN+ + FE +  L +RRKGK  H+A+K+DMSKAYDRVEW F+ ++ME++G+   WI  +MTCV++  Y+V +N  P 
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS
            P RG+RQGDPLSPYLFLLCAEGL+ LL R E      G+ + ++ P +SH+ FADDSL+FC+ATE +C  +  +L  Y+ ASGQ +N +K++   S
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS

Query:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK
        KN  E     ++ +  +Q +     YLG+P+  G++K   F  +K+R+   L+GWKE+L S  G+ +LIKT+AQAIPTYTMSCFKL  + CA+IN + + 
Subjt:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK

Query:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW
        +WWG  R + K HWI W  LCS K+ GG+GFRDI  FN ALLAK  WR++  P SL A+  + +YF G SFLKA LGSNPS  WRSI+  R+L +KG RW
Subjt:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW

Query:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGRYVAEILDS
        +IGNG+   + +D W    G + L    + ++ ++VA+++D+
Subjt:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGRYVAEILDS

A0A2N9G8I6 Reverse transcriptase domain-containing protein1.5e-12347.51Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +K  L  IIS++Q+ FVP R I+DN+ + FE +  L +RRKGK  H+A+K+DMSKAYDRVEW F+  +ME++G+   WI  +MTCV++  Y+V +N  P 
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS
            P RG+RQGDPLSPYLFLLCAEGL+ LL R E      G+ + ++ P +SH+ FADDSL+FC+ATE +C  +  +L  Y+ ASGQ +N +K++   S
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS

Query:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK
        KN  E     ++ +  +Q +     YLG+P+  G++K   F  +K+R+   L+GWKE+L S  G+ +LIKT+AQAIPTYTMSCFKL  + CA+IN + + 
Subjt:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK

Query:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW
        +WWG  R + K HWI W  LCS K+ GG+GFRDI  FN ALLAK  WR++  P SL A+  + +YF G SFLKA LGSNPS  WRSI+  R+L +KG RW
Subjt:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW

Query:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGRYVAEILDS
        +IGNG+   + +D W    G + L    + ++ ++VA+++D+
Subjt:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGRYVAEILDS

A0A2N9J3U0 Reverse transcriptase domain-containing protein6.2e-12547.96Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +K  L+ IIS  Q+ FVP RLI+DN+++ FE +  + ++RKG+  H+A+K+DMSKAYDRVEW F+  +M KLG+D+ W++ IM C+ SV Y+V +N  P 
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS
            P RG+RQGDPLSPYLFL+CAEGLT LL + E     +GL I +  P +SH+FFADDSL+FCRA   +C+ + AIL TY++ASGQ +N++K+S   S
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPS

Query:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK
         N   +    +  +L   S+ +LG YLG+P   GR K   F  +K ++   L GWK KL S  G+++LIK+VAQAIP YTMSCF++ +++C+EIN + +K
Subjt:  KNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAK

Query:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW
        FWWG    ++K HW +WSN+C  K  GG+GFRD+ LFNQALLAK  WR+L+ PN+LL R L+ +YF   SF++A +  + S  WRSI   R + +KG RW
Subjt:  FWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRW

Query:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGR-YVAEILD
        RIGNG  V I +D W++    S+ VS    L     V++++D
Subjt:  RIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGR-YVAEILD

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.0e-2123.58Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +++ ++ +I   Q  F+P      N+      I  +N  R   + H+ I +D  KA+D+++  F+ K + KLG D  ++  I    +     + +N    
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSS---F
        E F  K G RQG PLSP LF +  E L   + +E+ +   +G+++ K    LS   FADD +++        + +  ++  + + SG  IN  KS    +
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSS---F

Query:  MPSKNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIK--TVAQAIPTYTMSCFKLSNSICAEIN
          ++  + + + +L   +  +  K LG  L    ++   +N  ++ +   +      WK    S  G+  ++K   + + I  +     KL  +   E+ 
Subjt:  MPSKNVKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIK--TVAQAIPTYTMSCFKLSNSICAEIN

Query:  KVCAKFWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSW
        K   KF W   R++     I  S L      GG+   D KL+ +A + K +W
Subjt:  KVCAKFWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSW

P0C2F6 Putative ribonuclease H protein At1g657506.1e-2935.78Show/hide
Query:  MPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAKFWWGSSRSKEKSHWIRWSNLCSSKDRGG
        MP    R     F  + +RV + + GW+EK  S  G+  L K V  ++P ++MS   L  SI   ++++   F WGS+  K+K H ++WS +CS K  GG
Subjt:  MPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAKFWWGSSRSKEKSHWIRWSNLCSSKDRGG

Query:  LGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKG---NSFLKAPLGSNPSLTWRSI-VWVRDLFQKGYRWRIGNGRHVIIDQDPWVAEQGVSRL
        LG R  K  N+AL++K+ WR+L+  NSL    L+ +Y  G   +S    P GS  S TWRSI + +RD+   G  W  G+G+ +    D WV+ + +  L
Subjt:  LGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKG---NSFLKAPLGSNPSLTWRSI-VWVRDLFQKGYRWRIGNGRHVIIDQDPWVAEQGVSRL

Query:  VSPE
         + E
Subjt:  VSPE

P11369 LINE-1 retrotransposable element ORF2 protein6.5e-2324.43Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR
        +++ ++ II   Q  F+P      N+      I  +N  +   + H+ I +D  KA+D+++  F+ K++E+ G    +++ I          ++VN    
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPR

Query:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKS-SFMP
        E    K G RQG PLSPYLF +  E L   + +++ +   +G++I K    +S    ADD +++    +     +  +++++ E  G  IN++KS +F+ 
Subjt:  ETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKS-SFMP

Query:  SKN--VKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIK--TVAQAIPTYTMSCFKLSNSICAEIN
        +KN   ++E        +   + K LG  L    ++  +KN  F+ +K  +   LR WK+   S  G+  ++K   + +AI  +     K+      E+ 
Subjt:  SKN--VKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIK--TVAQAIPTYTMSCFKLSNSICAEIN

Query:  KVCAKFWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSW
            KF W + + +     I  S L   +  GG+   D+KL+ +A++ K +W
Subjt:  KVCAKFWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSW

P92555 Uncharacterized mitochondrial protein AtMg012503.6e-1347.76Show/hide
Query:  VNDIPRETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDS
        +N  P+   TP RGLRQGDPLSPYLF+LC E L+ L  R +      G+R++ + P ++H+ FADD+
Subjt:  VNDIPRETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003109.7e-3545.89Show/hide
Query:  AIPTYTMSCFKLSNSICAEINKVCAKFWWGSSRSKEKSHWIRWSNLCSSK-DRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLK
        A+P Y MSCF+LS  +C ++     +FWW S  +K K  W+ W  LC SK D GGLGFRD+  FNQALLAK S+RI+  P++LL+R LR RYF  +S ++
Subjt:  AIPTYTMSCFKLSNSICAEINKVCAKFWWGSSRSKEKSHWIRWSNLCSSK-DRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLK

Query:  APLGSNPSLTWRSIVWVRDLFQKGYRWRIGNGRHVIIDQDPWVAEQ
          +G+ PS  WRSI+  R+L  +G    IG+G H  +  D W+ ++
Subjt:  APLGSNPSLTWRSIVWVRDLFQKGYRWRIGNGRHVIIDQDPWVAEQ

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.1e-1226.28Show/hide
Query:  VKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAKFW
        VK+     + H     S      YLG+P    +     +  + +++   +  W  +  S  G+  LI +V  ++  + MS F+L ++   EI+ +C+ F 
Subjt:  VKEEFVRKLRHILEIQSSKELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAKFW

Query:  WGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQ
        W       K   + WS++C+ KD GGLG R +K  N+
Subjt:  WGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQ

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.5e-1031.71Show/hide
Query:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKI
        +K  + ++I   QA+F+P R+ +DN+V   E + ++  R+KG +G + +K+D+ KAYDR+ W ++   +   G+ + W+ +I
Subjt:  MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKI

AT4G29090.1 Ribonuclease H-like superfamily protein2.2e-3437.36Show/hide
Query:  AIPTYTMSCFKLSNSICAEINKVCAKFWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKA
        A+PTYTM+CF L  ++C +I  V A FWW + +  +  HW  W +L   K  GG+GF+DI+ FN ALL K  WR+L  P SL+A+  + RYF  +  L A
Subjt:  AIPTYTMSCFKLSNSICAEINKVCAKFWWGSSRSKEKSHWIRWSNLCSSKDRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKA

Query:  PLGSNPSLTWRSIVWVRDLFQKGYRWRIGNGRHVIIDQDPWVAEQGVS-----RLVSPEDNLKGRYVAEILDSAGLEGRHYK
        PLGS PS  W+SI   +++ ++G R  +GNG  +II +  W+  +  S     + V P++      + ++ D     GR ++
Subjt:  PLGSNPSLTWRSIVWVRDLFQKGYRWRIGNGRHVIIDQDPWVAEQGVS-----RLVSPEDNLKGRYVAEILDSAGLEGRHYK

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.9e-3645.89Show/hide
Query:  AIPTYTMSCFKLSNSICAEINKVCAKFWWGSSRSKEKSHWIRWSNLCSSK-DRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLK
        A+P Y MSCF+LS  +C ++     +FWW S  +K K  W+ W  LC SK D GGLGFRD+  FNQALLAK S+RI+  P++LL+R LR RYF  +S ++
Subjt:  AIPTYTMSCFKLSNSICAEINKVCAKFWWGSSRSKEKSHWIRWSNLCSSK-DRGGLGFRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLK

Query:  APLGSNPSLTWRSIVWVRDLFQKGYRWRIGNGRHVIIDQDPWVAEQ
          +G+ PS  WRSI+  R+L  +G    IG+G H  +  D W+ ++
Subjt:  APLGSNPSLTWRSIVWVRDLFQKGYRWRIGNGRHVIIDQDPWVAEQ

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.6e-1447.76Show/hide
Query:  VNDIPRETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDS
        +N  P+   TP RGLRQGDPLSPYLF+LC E L+ L  R +      G+R++ + P ++H+ FADD+
Subjt:  VNDIPRETFTPKRGLRQGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAACTCTTGAAGATATTATCTCTCAAACCCAAGCGACGTTCGTTCCGAATAGGCTTATCTCAGACAACGTGGTGCTCGATTTCGAATGCATTGATGCCCTTAA
CAGCAGACGAAAAGGAAAAGAAGGGCACCTAGCTATAAAAGTTGACATGAGTAAGGCCTACGACAGGGTTGAGTGGATCTTCATTTGGAAGATTATGGAGAAGCTGGGGT
ACGATAAGGGTTGGATCGATAAAATTATGACCTGTGTGGAATCAGTGAAGTATGCAGTTCAGGTTAACGACATCCCTCGAGAGACCTTCACTCCAAAGAGGGGATTGAGA
CAGGGAGATCCTCTTTCGCCTTATCTCTTTCTTTTATGCGCTGAAGGATTGACGACCCTCCTCAATCGAGAGGAAAACCTTAATCACTTCAGGGGTCTTAGAATTAATAA
ACATTGCCCCTCACTATCTCATATATTTTTTGCAGATGACAGCCTCATTTTCTGCAGAGCGACAGAGAAAGACTGCGAGATGATAAAGGCCATCCTCCATACCTACAAAG
AGGCCTCGGGACAAACAATAAACAATGACAAGTCGTCTTTTATGCCTAGTAAAAATGTCAAGGAGGAGTTTGTTCGAAAGCTTCGCCATATCCTTGAGATCCAAAGCTCA
AAGGAGCTGGGCCACTATCTCGGAATGCCCTCTCAGAACGGTAGAAACAAAAACATGGTTTTTAGGAGGGTTAAAGATAGGGTTTGGAATGCCCTTCGGGGGTGGAAAGA
GAAGTTATTCTCGGTTGGGGGTAAGAAAGTCTTAATTAAAACGGTTGCCCAAGCTATTCCTACCTACACCATGTCTTGCTTCAAACTTTCTAACTCCATTTGTGCTGAGA
TTAATAAAGTTTGTGCAAAATTTTGGTGGGGCTCTTCTAGATCAAAGGAAAAATCCCACTGGATCAGGTGGTCAAATCTTTGCTCTAGTAAAGATCGAGGTGGATTGGGC
TTTAGGGACATAAAGCTTTTCAACCAAGCCCTCCTAGCTAAGATGAGCTGGCGCATTCTAAAGTTCCCCAATTCTCTGCTAGCTAGGACCCTTAGAGGTCGTTACTTCAA
GGGCAATTCCTTCCTAAAAGCCCCCCTTGGCTCCAACCCTTCCCTTACGTGGAGGAGCATAGTCTGGGTTCGTGACCTTTTCCAAAAAGGATACAGGTGGAGGATTGGCA
ATGGTCGTCATGTGATTATCGACCAAGACCCTTGGGTAGCTGAGCAAGGGGTTAGCCGCCTAGTTAGTCCAGAGGATAATCTCAAAGGGAGATATGTGGCAGAGATCTTG
GATAGTGCAGGATTGGAAGGAAGACACTATAAAGGAAAGTTTCATGCCTATCGATGCTCTGAATATCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAAACTCTTGAAGATATTATCTCTCAAACCCAAGCGACGTTCGTTCCGAATAGGCTTATCTCAGACAACGTGGTGCTCGATTTCGAATGCATTGATGCCCTTAA
CAGCAGACGAAAAGGAAAAGAAGGGCACCTAGCTATAAAAGTTGACATGAGTAAGGCCTACGACAGGGTTGAGTGGATCTTCATTTGGAAGATTATGGAGAAGCTGGGGT
ACGATAAGGGTTGGATCGATAAAATTATGACCTGTGTGGAATCAGTGAAGTATGCAGTTCAGGTTAACGACATCCCTCGAGAGACCTTCACTCCAAAGAGGGGATTGAGA
CAGGGAGATCCTCTTTCGCCTTATCTCTTTCTTTTATGCGCTGAAGGATTGACGACCCTCCTCAATCGAGAGGAAAACCTTAATCACTTCAGGGGTCTTAGAATTAATAA
ACATTGCCCCTCACTATCTCATATATTTTTTGCAGATGACAGCCTCATTTTCTGCAGAGCGACAGAGAAAGACTGCGAGATGATAAAGGCCATCCTCCATACCTACAAAG
AGGCCTCGGGACAAACAATAAACAATGACAAGTCGTCTTTTATGCCTAGTAAAAATGTCAAGGAGGAGTTTGTTCGAAAGCTTCGCCATATCCTTGAGATCCAAAGCTCA
AAGGAGCTGGGCCACTATCTCGGAATGCCCTCTCAGAACGGTAGAAACAAAAACATGGTTTTTAGGAGGGTTAAAGATAGGGTTTGGAATGCCCTTCGGGGGTGGAAAGA
GAAGTTATTCTCGGTTGGGGGTAAGAAAGTCTTAATTAAAACGGTTGCCCAAGCTATTCCTACCTACACCATGTCTTGCTTCAAACTTTCTAACTCCATTTGTGCTGAGA
TTAATAAAGTTTGTGCAAAATTTTGGTGGGGCTCTTCTAGATCAAAGGAAAAATCCCACTGGATCAGGTGGTCAAATCTTTGCTCTAGTAAAGATCGAGGTGGATTGGGC
TTTAGGGACATAAAGCTTTTCAACCAAGCCCTCCTAGCTAAGATGAGCTGGCGCATTCTAAAGTTCCCCAATTCTCTGCTAGCTAGGACCCTTAGAGGTCGTTACTTCAA
GGGCAATTCCTTCCTAAAAGCCCCCCTTGGCTCCAACCCTTCCCTTACGTGGAGGAGCATAGTCTGGGTTCGTGACCTTTTCCAAAAAGGATACAGGTGGAGGATTGGCA
ATGGTCGTCATGTGATTATCGACCAAGACCCTTGGGTAGCTGAGCAAGGGGTTAGCCGCCTAGTTAGTCCAGAGGATAATCTCAAAGGGAGATATGTGGCAGAGATCTTG
GATAGTGCAGGATTGGAAGGAAGACACTATAAAGGAAAGTTTCATGCCTATCGATGCTCTGAATATCCTTAA
Protein sequenceShow/hide protein sequence
MKKTLEDIISQTQATFVPNRLISDNVVLDFECIDALNSRRKGKEGHLAIKVDMSKAYDRVEWIFIWKIMEKLGYDKGWIDKIMTCVESVKYAVQVNDIPRETFTPKRGLR
QGDPLSPYLFLLCAEGLTTLLNREENLNHFRGLRINKHCPSLSHIFFADDSLIFCRATEKDCEMIKAILHTYKEASGQTINNDKSSFMPSKNVKEEFVRKLRHILEIQSS
KELGHYLGMPSQNGRNKNMVFRRVKDRVWNALRGWKEKLFSVGGKKVLIKTVAQAIPTYTMSCFKLSNSICAEINKVCAKFWWGSSRSKEKSHWIRWSNLCSSKDRGGLG
FRDIKLFNQALLAKMSWRILKFPNSLLARTLRGRYFKGNSFLKAPLGSNPSLTWRSIVWVRDLFQKGYRWRIGNGRHVIIDQDPWVAEQGVSRLVSPEDNLKGRYVAEIL
DSAGLEGRHYKGKFHAYRCSEYP