; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023160 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023160
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:45084472..45086848
RNA-Seq ExpressionLag0023160
SyntenyLag0023160
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.8e-16138.78Show/hide
Query:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW
        E+RPISLCNV+Y+++AK +ANRLK +LN+IIS +QSAF+P RLITDNV++G+EC+H +   +  + G+ ALKLD+SKAYDR+EW+++ + M ++GF  KW
Subjt:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW

Query:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALL---KREESISNI-------------SDDNLVFFKASKEESSSIKKIL
        + LIM+CI +  + +LING P  + KP+RG+RQ  PLSPYLFI+CAE  S LL   +RE+ I  +             +DD+LVF KAS  +   +K I 
Subjt:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALL---KREESISNI-------------SDDNLVFFKASKEESSSIKKIL

Query:  QTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPIY
          Y  +SGQ  N  KS    S      + S I  +  +K       YLGLP   GRNK   F  +K +V   + +W  KLFS GGK+ILIK +AQA+P Y
Subjt:  QTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPIY

Query:  TMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKDN
         MS F+LPKGLC ++ K  A FWWG+ KDK   HW+ W +M  +K +GG+GFRDL  FN+A++AK  WR+ R P+SLMAR ++ +YY  STF  AK   N
Subjt:  TMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKDN

Query:  ASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGKRVAEIIKEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDEIIW
         S +W+SI+WG  +  +G +W++G+GK V + +D WI     ++PI  +       VA++I  +  W+ D ++  F+  D ++ILK+   +    DE++W
Subjt:  ASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGKRVAEIIKEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDEIIW

Query:  GADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNCKVVR
          DKKG ++VKS Y LA+N+   F      + SS  +WK  W L    K KI +W+A+ N LPT +N+ K+     P+C  C+ + E V H+   CK  R
Subjt:  GADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNCKVVR

Query:  KIW--GNFFPSLSSVYNLCRGWWRFMDFWDYL----SKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLP
        KIW         S  +N         DF+  +    S+S T  EA +     W IW   N+      ++D+  ++ + ++++   Q V    +       
Subjt:  KIW--GNFFPSLSSVYNLCRGWWRFMDFWDYL----SKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLP

Query:  SVG--RWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKISRNLDIIMLAVMAIIEGF---NSYRSNSKLTQS
         +   +W PP +   K+NVDAA        G+G I+RD+EG ++  G K+      + +    AI  G    N   S+S + +S
Subjt:  SVG--RWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKISRNLDIIMLAVMAIIEGF---NSYRSNSKLTQS

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.8e-16042.36Show/hide
Query:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH
        M ++RPISLCNV+YK+I+KVLANRLK +L  IIS++QSAF+ GRLITDNV+V FE +H L +++ GK+G AA+KLDMSKAYDR+EW +I+++M  MGF  
Subjt:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH

Query:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLK---REESISNIS--------------DDNLVFFKASKEESSSIK
        KW++L+M+CI SVSY IL+NG       P RG+RQ DP+SPY+F++CA+G S+LL    R+  IS +S              DD+L+F KA+ +E  ++ 
Subjt:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLK---REESISNIS--------------DDNLVFFKASKEESSSIK

Query:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI
         ILQ YE +SGQKIN+ KS    S N  + K  E+ ++LG  Q      YLGLP+  G++K  +F+ +KERV + L  WKEKL SVGG++ILIK +AQAI
Subjt:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI

Query:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA
        P YTMSCF++PK LC E+  M   FWWG    + K  W SW+ +C +K+ GGMGFR+L+ FN AMLAK  WR+  +P+SL+A+  + +YY      +AK 
Subjt:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA

Query:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWK---PIRVRDDLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLV
          + S  W+SI  G  +   G +W+VGNG+ + I ED W+ +   +K   P +  DD    RV+ +I +E   WK+DV++  FL  +A +IL +P  +  
Subjt:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWK---PIRVRDDLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLV

Query:  VNDEIIWGADKKGVFTVKSAYHLAV----NKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENV
          D+IIW  ++KG F+VKSAY++AV    N E   S+SG    S   +W+ +W L   PK +I  WK   NALPT  N+ +KGV+   VC  C  + E+ 
Subjt:  VNDEIIWGADKKGVFTVKSAYHLAV----NKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENV

Query:  EHIFWNCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSL---TVREAGIASQLIWAIWKEMNQIKHSS-SRTDTSKISLEIEAMVDRLQMVDSY
         HIF  C+V +++W  +  + + + N+       MD  D   K L   T  +  I   + WAIW   N+I   S S+         I+ +++      +Y
Subjt:  EHIFWNCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSL---TVREAGIASQLIWAIWKEMNQIKHSS-SRTDTSKISLEIEAMVDRLQMVDSY

Query:  QDFTPETLPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGA
            P+   S G+W  P    +KINVD A  +      VG IIRD+ G +  A
Subjt:  QDFTPETLPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGA

XP_030496634.1 uncharacterized protein LOC115712492 [Cannabis sativa]8.6e-15839.46Show/hide
Query:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH
        M ++RPISLCNVIYK+++K LA R K VL  +IS++QSAF+  RLIT+N++V FE +H L ++  G KG +ALKLDMSKA+DR+EW YI+ +M  MGF  
Subjt:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH

Query:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREESISNI-----------------SDDNLVFFKASKEESSSIK
        KW+ +IM+C+ S S+  ++NG      KP RG+RQ DPLSPYLF++C+EGLS LL+ EES+ ++                 +DD+L+F +AS   + ++K
Subjt:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREESISNI-----------------SDDNLVFFKASKEESSSIK

Query:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI
        ++L+TY  +SGQ +N +KS+   S N  ++      + LG+  S+    YLGLPA S R+K  MFS +KER+W+ L  W +KLFSVGGK++L+K + Q+I
Subjt:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI

Query:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA
        P Y MSCFRLP   C +L  M ANFWWGS+KD  K HW SW+ +C SK +GGMGFR    FNKA+LAK +WRI   P+SL++R L+ +Y+S + FL+A+ 
Subjt:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA

Query:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGKRVAEIIKEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDE
          + S+ W+ I WGR L  EG ++K+GNG  V    D WI     +KP+   +      V+  I +   W   ++  +F + D D I+ +P      ND 
Subjt:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGKRVAEIIKEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDE

Query:  IIWGADKKGVFTVKSAYHLAVN--KEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWN
        +IW     G +TV S +HLA N  +  Q  AS S +    + WK+ W+L+   K KI  W+ M NALP    + ++ V  +  C LC    E+V H  +N
Subjt:  IIWGADKKGVFTVKSAYHLAVN--KEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWN

Query:  CKVVRKIWGN--FFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQ--MVDSYQDF---
        C   RK+W +  F    +  +N+  G     D+  +LS   + ++  +    +WAIW E N++ H      +  I+      + + Q     + Q F   
Subjt:  CKVVRKIWGN--FFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQ--MVDSYQDF---

Query:  ------------------TPETLPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKI
                          + +   S   W+PP     KINVDAA   +  I GVG IIRD  GS+I A  K +
Subjt:  ------------------TPETLPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKI

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]4.4e-16241.55Show/hide
Query:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH
        M E+RPISLCNVIYK+I+KVLANRLK+VL  IIS +QSAFVPGRLITDNV+V +E +H ++ R+ GKKG  ALKLD+SKAYDR+EW +++ IM  MGFP 
Subjt:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH

Query:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE---------------SISNI--SDDNLVFFKASKEESSSIK
         W+E +M+C+ + S+ IL+NG P E+ +P RGIRQ DP+SPYLF++CAEGL+ALL + E                I+N+  +DD+L+F +A++ E  +I 
Subjt:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE---------------SISNI--SDDNLVFFKASKEESSSIK

Query:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI
        +ILQ YE +SGQ INL KS    S N  E +  +I ++LGVK+ +    YLGLP   GR K   FS LK+RVWK LQ WK  L S  GK+ILIK +AQAI
Subjt:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI

Query:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA
        P YTMS F++P  LC EL  +CA FWWG   ++RK HW SW  +   K++GGMGFRDL  FN AMLAK  WR+ +   SL+ R  + +Y+  S+FL+AK 
Subjt:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA

Query:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIR-VRDDLRGKRVAEIIK-EDGSWKEDVIKAFFLTSDADSILKMPKRNLVVN
          N S VW+S++  + +   G+ W+VGNG  +   +D W+ +    K +  V+ D     VAE+I  E   W  + I+A F   +A++I ++P     V 
Subjt:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIR-VRDDLRGKRVAEIIK-EDGSWKEDVIKAFFLTSDADSILKMPKRNLVVN

Query:  DEIIWGADKKGVFTVKSAYHLAVNKEAQFSASG-SETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFW
        D I W    +G+F+VKSAYH+A       +  G S    + +IW +IW L+   K K+  W+A    LPT  N+  + +  +  C +C  + E+  H  W
Subjt:  DEIIWGADKKGVFTVKSAYHLAVNKEAQFSASG-SETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFW

Query:  NCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDF--TPET
        +C  V+ IW     S   +     G    +   + L + L   E  +     W +W + N + H       S ++L  E  +   +   +  D   T ++
Subjt:  NCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDF--TPET

Query:  LPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGA
        +  +  W PP   ++K+N DAA F D+G  G G IIR+ +G ++ A
Subjt:  LPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGA

XP_030931246.1 uncharacterized protein LOC115957168 [Quercus lobata]6.6e-15839.08Show/hide
Query:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH
        M +YRPISLCNVIYK+I+KVLAN+LK++L  IIS +QSAFVP RLITDN++V +EC+HA++ R+ GKKG  ALKLD+SKAYDR+EW +++ IM  MGFP 
Subjt:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH

Query:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE---------------SISNI--SDDNLVFFKASKEESSSIK
         W++ +M+C+ + S+ + ING P     P RGIRQ DPLSPYLF++CAEG ++LL + E                ISN+  +DD+LVF +A++ E   + 
Subjt:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE---------------SISNI--SDDNLVFFKASKEESSSIK

Query:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI
        +IL+ Y  +SGQ INL KS    S N       EI  +LGV++     +YLGLP   GR+K   FS LK+R+WK LQ WK KL S  GK++LIK +AQ+I
Subjt:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI

Query:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA
        P YTM  F LP  LC ELN MCA FWWG   D+RK HW SW +M   K +GGMGFRD+  FN AMLAK  WR+ +D  SL+    + +Y+   +FL+AK 
Subjt:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA

Query:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGK-RVAEIIKED-GSWKEDVIKAFFLTSDADSILKMPKRNLVVN
          N+S VWKSI+  + +   G  W+VG G  + +  + WI +      I    ++  + RV E+I  + G+W +++I+  F   DAD+IL++P     V 
Subjt:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGK-RVAEIIKED-GSWKEDVIKAFFLTSDADSILKMPKRNLVVN

Query:  DEIIWGADKKGVFTVKSAYHLA--VNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIF
        D I W  ++   ++VKS Y +A  V KEA      S    S  +W+ IW +    K K+ +W+A    LPT  N+ +K V  +  C LC+C  E   H+ 
Subjt:  DEIIWGADKKGVFTVKSAYHLA--VNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIF

Query:  WNCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETL
        W C V + IW      L   ++       F+   + L   L+  E  +     W IW + N +       D S++       ++  +M       +  + 
Subjt:  WNCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETL

Query:  PSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLI---GAGGKKISRNLDIIMLAVMAIIE-----GFNSY---RSNSKLTQSFARMKLWSS
         S  RW PP    +K+N DAA F D    G G +IR++ G ++    A G  IS + +  +LA    +E     GF        NS +    +  +++ S
Subjt:  PSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLI---GAGGKKISRNLDIIMLAVMAIIE-----GFNSY---RSNSKLTQSFARMKLWSS

Query:  RMRRIW
        R+  ++
Subjt:  RMRRIW

TrEMBL top hitse value%identityAlignment
A0A2N9EMZ0 Reverse transcriptase domain-containing protein5.2e-16140.75Show/hide
Query:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW
        E+RPISLCNVIYK+I+KVLANRLK +L  I+ +SQSAF+PGRLITDN++V FE +H + ++++GK G  ALKLDMSKAYDR+EW Y++ +M  MGF +KW
Subjt:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW

Query:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE--------SISN---------ISDDNLVFFKASKEESSSIKKI
        V L+M CI +VSY IL+NG P    KP RG+RQ DPLSPYLF++CAEGL +L+++E+        SIS           +DD+L+F KA+ ++   I+ I
Subjt:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE--------SISN---------ISDDNLVFFKASKEESSSIKKI

Query:  LQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI
        L  YE +SGQ++N  K+    SK+       +I  +LGV        YLGLP+  GR K   F+++KERVW  L+ WKEKL S  G++ILIK +AQAIP 
Subjt:  LQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI

Query:  YTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKD
        Y MSCFRLP  L  E+  +   FWWG   DK K HW  WR +C SK  GGMG RDL  FN+A+LAK  WR+  +PSSL ++  + KY+   + L+A+ + 
Subjt:  YTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKD

Query:  NASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIR-VRDDLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDE
          S  WKSI+  R L  +G  W+VG G H+ I  D W+        +      +    V  +I  E  SWK +++K  FL  +A  IL +P       D 
Subjt:  NASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIR-VRDDLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDE

Query:  IIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNCK
        ++W A K G +TV+S YHL +N+  Q   S S+T     +W +IWSL   PK +  LW+A  N+LPT  N+  + + A+P C  C  + E+  H  W CK
Subjt:  IIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNCK

Query:  VVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLPSVG
         ++ +W +  P    +  +   +  F+D      ++L+  E  + S   W IW   N+++      + S++   I   +D L    + Q+  P+  P   
Subjt:  VVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLPSVG

Query:  R-----WTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKI
              W PP+E ++K+N D A F +    GVG IIR+  G ++G+   +I
Subjt:  R-----WTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKI

A0A2N9GI95 Reverse transcriptase domain-containing protein4.0e-16140.7Show/hide
Query:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW
        ++RPISLCNVIYK+I+KVLANRLK +L  I+S+SQSAFVPGRLITDN++V FE +H + +++ G+ G  ALKLDMSKAYDR+EW Y++R+M  MGF  KW
Subjt:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW

Query:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE--------SISN---------ISDDNLVFFKASKEESSSIKKI
        V ++M CI +VSY IL+NG P    KP RG+RQ DPLSPYLF++CAEG  +LL++E+        SIS           +DD+L+F KA+  +   I+ I
Subjt:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE--------SISN---------ISDDNLVFFKASKEESSSIKKI

Query:  LQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI
        L  YE +SGQ+IN  K+    SK+  ++  + +  +LGV        YLGLP+  GR K   F+++KERVW  L+ WKEKL S  G++ILIK +AQAIP 
Subjt:  LQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI

Query:  YTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKD
        Y MSCFRLP  L  E+  +   FWWG S DK K HW SW  +C SK  GG+GFR+L  FN+A+LAK  WR+  +PSSL  +  + KY+   + L+A    
Subjt:  YTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKD

Query:  NASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVR--DDLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVND
         +S  WKSI+  R L  +G  W+VGNG  + I  D W+ S  C + I            VA +I  +  +WKE++I+  FL  DA +I+ +P  +   +D
Subjt:  NASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVR--DDLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVND

Query:  EIIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNC
         ++WG  + G + V+S YHL + ++AQ      +T    ++W SIWSL+  PK +  LW+A   +LPT  N+  + +  +P C  C  + E   H  WNC
Subjt:  EIIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNC

Query:  KVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETL---
        K ++ +W    P    +  +   +  FMD W   ++ L+  E  + S + W IW   N+++        ++I    + ++ +     + QD  P +L   
Subjt:  KVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETL---

Query:  --PSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGA
           +V +W PP E ++K+N D A F +    G+G IIR++ G+++G+
Subjt:  --PSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGA

A0A2N9H567 Reverse transcriptase domain-containing protein2.5e-16339.61Show/hide
Query:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH
        M ++RPISLCNV+YK+I+KVLANRLKKVLN++IS++QSAFVPGRLITDN++V FE +H L  +R GK    A+KLDMSKAYDR+EWD+IR +M+ MGF  
Subjt:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH

Query:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREESISNI-----------------SDDNLVFFKASKEESSSIK
        +WV LIM CI+SVSY I+ING P    KP RGIRQ DPLSPYLF++CAEGL+ALL+  E+   +                 +DD+L+F++A+  ES ++ 
Subjt:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREESISNI-----------------SDDNLVFFKASKEESSSIK

Query:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI
         IL  YE +SGQK+N  K+    S N        I   L    +  +G YLGLP   GR K   F  +K +V K LQ WK KL S  G++ILIK +AQAI
Subjt:  KILQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAI

Query:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA
        P++TMSCF+LP  LC E+N M   FWWG  + +RK HW  W ++C  K  GG+GFRDL  FN+A+LAK  WRI ++ ++L+ + L+ KY+ + +FL+A+ 
Subjt:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKA

Query:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDL-RGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVN
          ++S  W+S+   R +   G +W++G G  V I  D W+ +   +K +  R  L     V+++I  E   WK  +I   F+  +A  I  +P  +L + 
Subjt:  KDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDL-RGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVN

Query:  DEIIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWN
        D ++W     G F+ +SAY + +  +   S S S      + WK IW +    K K  +W+A ++ALPT  N+ K+GV ++  C +C  + E V H  W 
Subjt:  DEIIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWN

Query:  CKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLPS
        C+  R++W N         +    W    D  D +    +  E  I   + W IW   N      S  + S +  +    V+  + +D+ +       P 
Subjt:  CKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLPS

Query:  VGR-WTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKISRNLDIIMLAVMAIIE
        V R WTPP E   K+NV    F +  + G+G ++RD  G+L+ A  ++ S+  D + +A  A+I+
Subjt:  VGR-WTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKISRNLDIIMLAVMAIIE

A0A2N9HYE3 Reverse transcriptase domain-containing protein5.2e-16140.75Show/hide
Query:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW
        E+RPISLCNVIYK+I+KVLANRLK +L  I+ +SQSAF+PGRLITDN++V FE +H + ++++GK G  ALKLDMSKAYDR+EW Y++ +M  MGF +KW
Subjt:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW

Query:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE--------SISN---------ISDDNLVFFKASKEESSSIKKI
        V L+M CI +VSY IL+NG P    KP RG+RQ DPLSPYLF++CAEGL +L+++E+        SIS           +DD+L+F KA+ ++   I+ I
Subjt:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE--------SISN---------ISDDNLVFFKASKEESSSIKKI

Query:  LQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI
        L  YE +SGQ++N  K+    SK+       +I  +LGV        YLGLP+  GR K   F+++KERVW  L+ WKEKL S  G++ILIK +AQAIP 
Subjt:  LQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI

Query:  YTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKD
        Y MSCFRLP  L  E+  +   FWWG   DK K HW  WR +C SK  GGMG RDL  FN+A+LAK  WR+  +PSSL ++  + KY+   + L+A+ + 
Subjt:  YTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKD

Query:  NASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIR-VRDDLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDE
          S  WKSI+  R L  +G  W+VG G H+ I  D W+        +      +    V  +I  E  SWK +++K  FL  +A  IL +P       D 
Subjt:  NASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIR-VRDDLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDE

Query:  IIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNCK
        ++W A K G +TV+S YHL +N+  Q   S S+T     +W +IWSL   PK +  LW+A  N+LPT  N+  + + A+P C  C  + E+  H  W CK
Subjt:  IIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNCK

Query:  VVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLPSVG
         ++ +W +  P    +  +   +  F+D      ++L+  E  + S   W IW   N+++      + S++   I   +D L    + Q+  P+  P   
Subjt:  VVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLPSVG

Query:  R-----WTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKI
              W PP+E ++K+N D A F +    GVG IIR+  G ++G+   +I
Subjt:  R-----WTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKI

A0A7N2LIH6 Uncharacterized protein5.2e-16138.75Show/hide
Query:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW
        E+RPISLCNVIYK+I+KVLANRLKKVL+ +I ++QSAFVPGR+ITDNV+V FE +H++N RR GK+G+ A+KLDMSKAYDR+EW Y+  +M  MGF  +W
Subjt:  EYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKW

Query:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE---------------SISNI--SDDNLVFFKASKEESSSIKKI
        + LIM C+ SVS+ +LING P+  F P RG+RQ DP+SPYLF++C EGLSA++K++E                IS++  +DD+++F +A+ +E   + K+
Subjt:  VELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREE---------------SISNI--SDDNLVFFKASKEESSSIKKI

Query:  LQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI
        L+ YE  SGQK+N  K+    S+N  +        + G +       YLGLP   GR K   F+R+K++V + +  WK KL S  G+++LIK +AQA P 
Subjt:  LQTYEMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI

Query:  YTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKD
        YTM+ F+LP  LC ELN M  +FWWG    ++K  W SW+N+C  K  GGMGF+DL+ FN A+LAK  WR+ ++P+SL  R L+ KY++ S+F++A+   
Subjt:  YTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKD

Query:  NASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRD-DLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDE
          S +W+SI+  +++  EG +W VG+G+ + I +  W+ S    K +  R   ++G+RVA +I +E G WK  +++  F+  +A+ IL +P  ++ + D 
Subjt:  NASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRD-DLRGKRVAEII-KEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDE

Query:  IIWGADKKGVFTVKSAYHLAVN-----KEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHI
        ++W     G FTVKSAY  A       +E + +   S+     +IWK+IW L+   K K  LW+A    LPT   +  + +  +  C  C  + E   H 
Subjt:  IIWGADKKGVFTVKSAYHLAVN-----KEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHI

Query:  FWNCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPET
         WNC V ++ W     ++ +  ++      F+D    L +S   ++    + + W++W   N ++H         I+ E     + ++     +   P+ 
Subjt:  FWNCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPET

Query:  LPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKI
        +P   RW+PP +  +K+NVDAA F + G  G+G +IR+++G ++GA  KK+
Subjt:  LPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.1e-2625.61Show/hide
Query:  DEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHK
        + +RPISL N+  K++ K+LANR+++ +  +I   Q  F+PG     N+      I  +N  R+  K    + +D  KA+D+I+  ++ + +  +G    
Subjt:  DEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHK

Query:  WVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREESISNI------------SDDNLVFFKASKEESSSIKKILQTY
        ++++I    +  +  I++NG   E F  K G RQ  PLSP LF +  E L+  +++E+ I  I            +DD +V+ +     + ++ K++  +
Subjt:  WVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREESISNI------------SDDNLVFFKASKEESSSIKKILQTY

Query:  EMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSR----LKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI
           SG KIN+ KS   +  N  +T++  + ++     S  I  YLG+  Q  R+   +F      L + + +    WK    S  G+  ++K+      I
Subjt:  EMSSGQKINLSKSLCMISKNIGETKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSR----LKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPI

Query:  YTMSC--FRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISW
        Y  +    +LP     EL K    F W   + +      S +N     + GG+   D +L+ KA + K +W
Subjt:  YTMSC--FRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISW

P08548 LINE-1 reverse transcriptase homolog9.6e-2725.14Show/hide
Query:  DEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHK
        + YRPISL N+  K++ K+L NR+++ +  II   Q  F+PG     N+      I  +N  ++  K    L +D  KA+D I+  ++ R +  +G    
Subjt:  DEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHK

Query:  WVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREESISNI------------SDDNLVFFKASKEESSSIKKILQTY
        +++LI       +  I++NG+  + F  + G RQ  PLSP LF +  E L+  ++ E++I  I            +DD +V+ + +++ ++ + ++++ Y
Subjt:  WVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREESISNI------------SDDNLVFFKASKEESSSIKKILQTY

Query:  EMSSGQKINLSKSLCMISKNIGE---TKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIK--VIAQAIP
           SG KIN  KS+  I  N  +   T    I   +  K+   +G YL    +    +   +  L++ + + +  WK    S  G+  ++K  ++ +AI 
Subjt:  EMSSGQKINLSKSLCMISKNIGE---TKASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIK--VIAQAIP

Query:  IYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISW
         +     + P     +L K+  +F W   K +      S +N     + GG+   DL L+ K+++ K +W
Subjt:  IYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISW

P0C2F6 Putative ribonuclease H protein At1g657504.9e-3924.27Show/hide
Query:  LPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGG
        +P    R     F  + ERV   +  W+EK  S  G+  L K +  ++P+++MS   LP+ +   L+++   F WGS+ +K+K H   W  +C  K++GG
Subjt:  LPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGG

Query:  MGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYY----SESTFLKAKAKDNASIVWKSIVWG-RSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWK
        +G R  +  N+A+++K+ WR+ ++ +SL    L+ KY+     +S +L  K   + S  W+SI  G R + + G  W  G+G+ +    D W+      K
Subjt:  MGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYY----SESTFLKAKAKDNASIVWKSIVWG-RSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWK

Query:  PIRVRDDLRGKRVAE---IIKED-----GSWKEDVIKAFFLTSDADSILKMPKRNLV--VNDEIIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSS
        P+   D+  G+R  +   ++ +D       W    I   + T++    L+    +LV    D + W   + G F+V+SAY +    E           + 
Subjt:  PIRVRDDLRGKRVAE---IIKED-----GSWKEDVIKAFFLTSDADSILKMPKRNLV--VNDEIIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSS

Query:  ISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNCKVVRKIWGNFFPS--LSSVYNLCRGWWRFMDFWDYLSK
         S +  +W ++   + K  LW   + A+ T +   ++ + A+ VC +C+   E++ H+  +C     IW    P       ++     W + +  D  S 
Subjt:  ISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNCKVVRKIWGNFFPS--LSSVYNLCRGWWRFMDFWDYLSK

Query:  SLTVREAGIASQLIWAIWKEM--NQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRD
           +  + I + +IW  WK    N    ++   D  K   E    V R    +     T   +  +  W  P     K+N D A   + G+   G ++RD
Subjt:  SLTVREAGIASQLIWAIWKEM--NQIKHSSSRTDTSKISLEIEAMVDRLQMVDSYQDFTPETLPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRD

Query:  SEGSLIGAGGKKISR
          G+  G     I R
Subjt:  SEGSLIGAGGKKISR

P11369 LINE-1 retrotransposable element ORF2 protein2.8e-2626.39Show/hide
Query:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH
        ++ +RPISL N+  K++ K+LANR+++ +  II   Q  F+PG     N+      IH +N  +   K    + LD  KA+D+I+  ++ +++   G   
Subjt:  MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPH

Query:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALL------------KREESISNISDDNLVFFKASKEESSSIKKILQT
         ++ +I          I +NG   E    K G RQ  PLSPYLF +  E L+  +            K E  IS ++DD +V+    K  +  +  ++ +
Subjt:  KWVELIMNCIESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALL------------KREESISNISDDNLVFFKASKEESSSIKKILQT

Query:  YEMSSGQKINLSKSLCMI-SKNIGETKASEISKVLGVKQSNSIGNYLG--LPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIK--VIAQAI
        +    G KIN +KS+  + +KN    K    +    +  +N    YLG  L  +        F  LK+ + + L+ WK+   S  G+  ++K  ++ +AI
Subjt:  YEMSSGQKINLSKSLCMI-SKNIGETKASEISKVLGVKQSNSIGNYLG--LPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIK--VIAQAI

Query:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEK---GGMGFRDLELFNKAMLAKISWRICRD
          +     ++P     EL      F W + K +            L K+K   GG+   DL+L+ +A++ K +W   RD
Subjt:  PIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEK---GGMGFRDLELFNKAMLAKISWRICRD

P93295 Uncharacterized mitochondrial protein AtMg003102.6e-3242.76Show/hide
Query:  AIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKE-KGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLK
        A+P+Y MSCFRL K LC +L      FWW S ++KRK  W +W+ +C SKE  GG+GFRDL  FN+A+LAK S+RI   P +L++R LR +Y+  S+ ++
Subjt:  AIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKE-KGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLK

Query:  AKAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPI
               S  W+SI+ GR L + G    +G+G H  +  D WIM +    P+
Subjt:  AKAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.1e-2925.38Show/hide
Query:  LRGKYYSESTFLKAKAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGKRVAEIIKEDGS---WKEDVIKAFFLT
        ++ +Y+ + + L AK +   S  W S++ G +L  +G +  +G+G+++ I  D  I+     +P+   +  +   +  + +  GS   W +  I  F   
Subjt:  LRGKYYSESTFLKAKAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGKRVAEIIKEDGS---WKEDVIKAFFLT

Query:  SDADSILKMPKRNLVVNDEIIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPV
        SD   I ++        D+IIW  +  G +TV+S Y L  +  +    + +    SI +   IW+L   PK K  LW+A+S AL T + +  +G+  +P 
Subjt:  SDADSILKMPKRNLVVNDEIIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPV

Query:  CFLCRCKEENVEHIFWNCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQ--------LIWAIWKEMNQIKHSSSRTDTSKISL
        C  C  + E++ H  + C      W        S  +L R      DF + +S  L   +    S         LIW IWK  N +  +  R   SK  L
Subjt:  CFLCRCKEENVEHIFWNCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQ--------LIWAIWKEMNQIKHSSSRTDTSKISL

Query:  EIEAMV-DRLQMVDSYQDFTPETLPSVG----RWTPPKEKQWKINVDAAWFDDIGIRGV-GWIIRDSEGSLIGAGGKKISRNLDIIMLAVMAII
          +A   D L    S++  TP     +      W  P     K N DA  FD   +    GWIIR+  G+ I  G  K++   + +     A++
Subjt:  EIEAMV-DRLQMVDSYQDFTPETLPSVG----RWTPPKEKQWKINVDAAWFDDIGIRGV-GWIIRDSEGSLIGAGGKKISRNLDIIMLAVMAII

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-1723.14Show/hide
Query:  YLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKE
        YLGLP  + +     +  L E++   +  W  +  S  G+  LI  +  ++  + MS FRLP     E++ +C++F W   +   K    +W ++C  K+
Subjt:  YLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKE

Query:  KGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPI
        +GG+G R L+  NK       W       S+   T  G +                 +WK I+  R+L +   K  + NG +       W  +   W  I
Subjt:  KGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPI

Query:  RVRDDLRGKR--VAEIIKEDGSWKEDVIKAFFLTSDADSILKMPKRNLVV--------NDEIIW---GADKKGVFTVKSAYHLAVNKEAQFSASGSETCS
            D+ G R  +   I    S  E V+         D++L++      V         D + W   G   K  F  K  +           A+  E   
Subjt:  RVRDDLRGKR--VAEIIKEDGSWKEDVIKAFFLTSDADSILKMPKRNLVV--------NDEIIW---GADKKGVFTVKSAYHLAVNKEAQFSASGSETCS

Query:  SISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNC
         ++ +K +W   + PK  +  W A+ N L T D +      A+  C LC    E  +H+F+ C
Subjt:  SISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCRCKEENVEHIFWNC

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.0e-1627.2Show/hide
Query:  LANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKWVELIMNCIESVSYQILIN
        +  RLK ++  +I  +Q++F+PGR+ TDN+V   E +H++  R+ G KG   LKLD+ KAYDRI WDY+   +I  GFP  W+  I              
Subjt:  LANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKWVELIMNCIESVSYQILIN

Query:  GIPQEVFKP-----KRGIRQEDPLSPYL--FIMCAEGLSALLKREESISNISDDNLVFFKASKEESSSIKKILQTYEMSSGQKINLSKSLCMISKNIGET
        G      +P     + G R +D  +P+    + CAE L   + R   I ++  D     K  K  + + +K ++   ++   ++      C+ S ++  T
Subjt:  GIPQEVFKP-----KRGIRQEDPLSPYL--FIMCAEGLSALLKREESISNISDDNLVFFKASKEESSSIKKILQTYEMSSGQKINLSKSLCMISKNIGET

Query:  KASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWK
         A +  + L V +   +   L   A+S  +   M  +  E   +AL   K
Subjt:  KASEISKVLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWK

AT4G29090.1 Ribonuclease H-like superfamily protein3.1e-5726.29Show/hide
Query:  AIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKA
        A+P YTM+CF LPK +C ++  + A+FWW + ++ +  HW +W ++   K +GG+GF+D+E FN A+L K  WR+   P SLMA+  + +Y+ +S  L A
Subjt:  AIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKA

Query:  KAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGK--------RVAEIIKEDG-SWKEDVIKAFFLTSDADSILK
              S VWKSI   + +  +G +  VGNG+ + I    W+ SK     +R++     +        +V+++I E G  W++DVI+  F   +   I +
Subjt:  KAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRGK--------RVAEIIKEDG-SWKEDVIKAFFLTSDADSILK

Query:  MPKRNLVVNDEIIWGADKKGVFTVKSAYHL---AVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCR
        +      + D   W     G +TVKS Y +    +NK +  S       S   I++ IW  ++ PK +  LWK +SN+LP    +  + +     C  C 
Subjt:  MPKRNLVVNDEIIWGADKKGVFTVKSAYHL---AVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPTLDNIRKKGVDANPVCFLCR

Query:  CKEENVEHIFWNCKVVRKIWG--------------NFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKIS
          +E V H+ + C   R  W               + + +L  V+NL  G       W+  S+        +   L+W +WK  N++       +  ++ 
Subjt:  CKEENVEHIFWNCKVVRKIWG--------------NFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKIS

Query:  LEIEAMVDRLQM---VDSYQDFTPETLPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGK---KISRNLDIIMLAVMAIIEGFNS
           E  ++  ++    +S          S GRW PP  +  K N DA W  D    G+GW++R+ +G +   G +   K+   L+  + A+   +   + 
Subjt:  LEIEAMVDRLQM---VDSYQDFTPETLPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGK---KISRNLDIIMLAVMAIIEGFNS

Query:  YRSNSKLTQSFARMKLWSSRMRRIW
        ++ N  + +S +++ +       IW
Subjt:  YRSNSKLTQSFARMKLWSSRMRRIW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-3342.76Show/hide
Query:  AIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKE-KGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLK
        A+P+Y MSCFRL K LC +L      FWW S ++KRK  W +W+ +C SKE  GG+GFRDL  FN+A+LAK S+RI   P +L++R LR +Y+  S+ ++
Subjt:  AIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLSKE-KGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLK

Query:  AKAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPI
               S  W+SI+ GR L + G    +G+G H  +  D WIM +    P+
Subjt:  AKAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGAGTACAGGCCAATCAGCCTATGCAATGTAATCTATAAGGTTATTGCAAAGGTGTTGGCTAACAGGTTGAAAAAGGTGCTCAACTACATTATCTCCCAATCTCA
GTCGGCTTTCGTTCCAGGGAGGCTGATTACTGATAATGTGGTGGTGGGCTTCGAATGCATCCATGCACTAAACAACAGAAGATCAGGCAAGAAAGGGATTGCAGCCTTAA
AGCTCGATATGAGCAAGGCGTATGATCGAATCGAATGGGACTACATCAGAAGAATTATGATACATATGGGCTTTCCTCACAAGTGGGTGGAGCTTATTATGAATTGTATT
GAATCTGTAAGCTATCAAATCCTTATTAATGGCATCCCTCAGGAGGTCTTCAAGCCTAAAAGGGGAATTCGGCAAGAAGACCCGCTCTCGCCATACTTATTCATTATGTG
TGCAGAGGGATTGTCAGCTCTTCTAAAAAGGGAGGAATCGATCTCTAACATCTCTGATGACAACCTTGTTTTCTTCAAAGCCTCTAAAGAAGAGAGCTCTAGTATCAAGA
AGATTCTGCAGACCTACGAGATGTCATCAGGCCAAAAGATCAACCTCTCTAAATCTCTGTGTATGATCAGTAAGAACATTGGCGAGACAAAAGCTTCAGAGATCAGCAAG
GTGTTGGGAGTTAAGCAATCGAACTCCATTGGCAACTATCTTGGTCTCCCAGCCCAATCAGGGAGGAACAAAGGAGTTATGTTCAGCAGACTGAAGGAGAGAGTCTGGAA
AGCGCTCCAAAATTGGAAGGAGAAGCTCTTCTCGGTGGGAGGGAAGAAGATTCTTATAAAAGTGATAGCTCAGGCCATCCCCATCTATACTATGAGCTGCTTTAGGTTAC
CTAAGGGGCTATGCGGGGAGTTAAACAAGATGTGTGCTAATTTCTGGTGGGGTTCGTCGAAAGATAAAAGGAAAGCCCACTGGTCAAGTTGGAGAAACATGTGTCTTAGT
AAAGAGAAAGGGGGGATGGGTTTTAGAGATCTTGAGTTGTTTAATAAAGCTATGTTGGCGAAGATTAGTTGGAGAATTTGCAGGGACCCAAGTAGTCTCATGGCCCGAAC
GCTCAGGGGGAAATACTATAGTGAGAGTACCTTCCTAAAAGCCAAAGCAAAAGATAACGCCTCTATAGTGTGGAAAAGCATTGTGTGGGGTAGATCTCTTTTTGCCGAGG
GGTTCAAGTGGAAAGTAGGAAATGGCAAGCACGTGTATATAGATGAAGATCCCTGGATTATGAGCAAGGGTTGTTGGAAACCAATCAGAGTCAGGGACGATTTAAGGGGC
AAAAGAGTGGCTGAGATTATTAAAGAAGACGGCTCCTGGAAAGAAGATGTCATTAAAGCTTTTTTCCTCACTAGTGACGCTGATTCAATCTTAAAGATGCCTAAGAGGAA
TCTGGTGGTTAACGATGAGATTATTTGGGGAGCGGATAAGAAAGGGGTTTTTACTGTAAAAAGCGCCTATCATCTTGCGGTTAATAAGGAGGCCCAATTCTCAGCGTCAG
GATCAGAGACTTGTTCGTCCATCAGTATTTGGAAATCTATTTGGAGTCTAAAGAGTAGACCGAAAGAGAAAATTCACCTTTGGAAAGCCATGAGCAATGCTCTTCCTACC
TTGGACAATATTAGAAAGAAAGGCGTGGATGCTAATCCTGTTTGCTTTTTGTGCAGGTGCAAGGAGGAGAACGTGGAGCATATTTTTTGGAATTGTAAAGTGGTAAGAAA
GATTTGGGGTAACTTTTTTCCTTCTCTAAGCAGTGTTTATAATCTTTGCAGAGGTTGGTGGAGATTCATGGATTTTTGGGATTACCTCTCAAAGAGTCTAACTGTTCGGG
AAGCTGGGATAGCAAGTCAACTTATTTGGGCTATTTGGAAAGAAATGAACCAGATAAAACATTCCAGCAGCAGGACAGACACTTCAAAAATCAGTTTAGAAATTGAAGCT
ATGGTCGATCGGTTGCAGATGGTAGATTCTTACCAGGATTTTACGCCGGAGACTCTTCCGAGTGTCGGGAGATGGACCCCGCCGAAGGAGAAGCAGTGGAAAATCAATGT
TGATGCTGCTTGGTTCGACGACATTGGGATCAGAGGTGTGGGGTGGATTATCCGAGACTCCGAAGGTTCTCTGATCGGAGCTGGTGGCAAGAAAATCTCAAGAAATTTAG
ACATAATCATGTTAGCAGTCATGGCGATTATTGAAGGCTTCAACAGTTATCGATCAAATTCAAAGCTTACCCAGAGCTTCGCTCGCATGAAGTTGTGGTCGAGTCGGATG
CGGCGGATTTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGAGTACAGGCCAATCAGCCTATGCAATGTAATCTATAAGGTTATTGCAAAGGTGTTGGCTAACAGGTTGAAAAAGGTGCTCAACTACATTATCTCCCAATCTCA
GTCGGCTTTCGTTCCAGGGAGGCTGATTACTGATAATGTGGTGGTGGGCTTCGAATGCATCCATGCACTAAACAACAGAAGATCAGGCAAGAAAGGGATTGCAGCCTTAA
AGCTCGATATGAGCAAGGCGTATGATCGAATCGAATGGGACTACATCAGAAGAATTATGATACATATGGGCTTTCCTCACAAGTGGGTGGAGCTTATTATGAATTGTATT
GAATCTGTAAGCTATCAAATCCTTATTAATGGCATCCCTCAGGAGGTCTTCAAGCCTAAAAGGGGAATTCGGCAAGAAGACCCGCTCTCGCCATACTTATTCATTATGTG
TGCAGAGGGATTGTCAGCTCTTCTAAAAAGGGAGGAATCGATCTCTAACATCTCTGATGACAACCTTGTTTTCTTCAAAGCCTCTAAAGAAGAGAGCTCTAGTATCAAGA
AGATTCTGCAGACCTACGAGATGTCATCAGGCCAAAAGATCAACCTCTCTAAATCTCTGTGTATGATCAGTAAGAACATTGGCGAGACAAAAGCTTCAGAGATCAGCAAG
GTGTTGGGAGTTAAGCAATCGAACTCCATTGGCAACTATCTTGGTCTCCCAGCCCAATCAGGGAGGAACAAAGGAGTTATGTTCAGCAGACTGAAGGAGAGAGTCTGGAA
AGCGCTCCAAAATTGGAAGGAGAAGCTCTTCTCGGTGGGAGGGAAGAAGATTCTTATAAAAGTGATAGCTCAGGCCATCCCCATCTATACTATGAGCTGCTTTAGGTTAC
CTAAGGGGCTATGCGGGGAGTTAAACAAGATGTGTGCTAATTTCTGGTGGGGTTCGTCGAAAGATAAAAGGAAAGCCCACTGGTCAAGTTGGAGAAACATGTGTCTTAGT
AAAGAGAAAGGGGGGATGGGTTTTAGAGATCTTGAGTTGTTTAATAAAGCTATGTTGGCGAAGATTAGTTGGAGAATTTGCAGGGACCCAAGTAGTCTCATGGCCCGAAC
GCTCAGGGGGAAATACTATAGTGAGAGTACCTTCCTAAAAGCCAAAGCAAAAGATAACGCCTCTATAGTGTGGAAAAGCATTGTGTGGGGTAGATCTCTTTTTGCCGAGG
GGTTCAAGTGGAAAGTAGGAAATGGCAAGCACGTGTATATAGATGAAGATCCCTGGATTATGAGCAAGGGTTGTTGGAAACCAATCAGAGTCAGGGACGATTTAAGGGGC
AAAAGAGTGGCTGAGATTATTAAAGAAGACGGCTCCTGGAAAGAAGATGTCATTAAAGCTTTTTTCCTCACTAGTGACGCTGATTCAATCTTAAAGATGCCTAAGAGGAA
TCTGGTGGTTAACGATGAGATTATTTGGGGAGCGGATAAGAAAGGGGTTTTTACTGTAAAAAGCGCCTATCATCTTGCGGTTAATAAGGAGGCCCAATTCTCAGCGTCAG
GATCAGAGACTTGTTCGTCCATCAGTATTTGGAAATCTATTTGGAGTCTAAAGAGTAGACCGAAAGAGAAAATTCACCTTTGGAAAGCCATGAGCAATGCTCTTCCTACC
TTGGACAATATTAGAAAGAAAGGCGTGGATGCTAATCCTGTTTGCTTTTTGTGCAGGTGCAAGGAGGAGAACGTGGAGCATATTTTTTGGAATTGTAAAGTGGTAAGAAA
GATTTGGGGTAACTTTTTTCCTTCTCTAAGCAGTGTTTATAATCTTTGCAGAGGTTGGTGGAGATTCATGGATTTTTGGGATTACCTCTCAAAGAGTCTAACTGTTCGGG
AAGCTGGGATAGCAAGTCAACTTATTTGGGCTATTTGGAAAGAAATGAACCAGATAAAACATTCCAGCAGCAGGACAGACACTTCAAAAATCAGTTTAGAAATTGAAGCT
ATGGTCGATCGGTTGCAGATGGTAGATTCTTACCAGGATTTTACGCCGGAGACTCTTCCGAGTGTCGGGAGATGGACCCCGCCGAAGGAGAAGCAGTGGAAAATCAATGT
TGATGCTGCTTGGTTCGACGACATTGGGATCAGAGGTGTGGGGTGGATTATCCGAGACTCCGAAGGTTCTCTGATCGGAGCTGGTGGCAAGAAAATCTCAAGAAATTTAG
ACATAATCATGTTAGCAGTCATGGCGATTATTGAAGGCTTCAACAGTTATCGATCAAATTCAAAGCTTACCCAGAGCTTCGCTCGCATGAAGTTGTGGTCGAGTCGGATG
CGGCGGATTTGGTGA
Protein sequenceShow/hide protein sequence
MDEYRPISLCNVIYKVIAKVLANRLKKVLNYIISQSQSAFVPGRLITDNVVVGFECIHALNNRRSGKKGIAALKLDMSKAYDRIEWDYIRRIMIHMGFPHKWVELIMNCI
ESVSYQILINGIPQEVFKPKRGIRQEDPLSPYLFIMCAEGLSALLKREESISNISDDNLVFFKASKEESSSIKKILQTYEMSSGQKINLSKSLCMISKNIGETKASEISK
VLGVKQSNSIGNYLGLPAQSGRNKGVMFSRLKERVWKALQNWKEKLFSVGGKKILIKVIAQAIPIYTMSCFRLPKGLCGELNKMCANFWWGSSKDKRKAHWSSWRNMCLS
KEKGGMGFRDLELFNKAMLAKISWRICRDPSSLMARTLRGKYYSESTFLKAKAKDNASIVWKSIVWGRSLFAEGFKWKVGNGKHVYIDEDPWIMSKGCWKPIRVRDDLRG
KRVAEIIKEDGSWKEDVIKAFFLTSDADSILKMPKRNLVVNDEIIWGADKKGVFTVKSAYHLAVNKEAQFSASGSETCSSISIWKSIWSLKSRPKEKIHLWKAMSNALPT
LDNIRKKGVDANPVCFLCRCKEENVEHIFWNCKVVRKIWGNFFPSLSSVYNLCRGWWRFMDFWDYLSKSLTVREAGIASQLIWAIWKEMNQIKHSSSRTDTSKISLEIEA
MVDRLQMVDSYQDFTPETLPSVGRWTPPKEKQWKINVDAAWFDDIGIRGVGWIIRDSEGSLIGAGGKKISRNLDIIMLAVMAIIEGFNSYRSNSKLTQSFARMKLWSSRM
RRIW