; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010801 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010801
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:7028047..7030825
RNA-Seq ExpressionLag0010801
SyntenyLag0010801
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4274760.1 unnamed protein product [Prunus armeniaca]8.5e-14436.61Show/hide
Query:  ILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIEDKEGVWREG
        ++ WR+    NS   I  L +Q++     +  +   +  LE+ LK++L  EE +WK KSR+QWL+ G+KNT+FFH+KV  RR ++ L G+ED  GVW + 
Subjt:  ILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIEDKEGVWREG

Query:  DEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVANFFRRGRML
        +  +  + + YF DLF +S+P  +++      + +  S++ +L+  V+   + + V +++P  +PG DG T  F+Q +W V+ ++V R+V +FF  G++ 
Subjt:  DEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVANFFRRGRML

Query:  RRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAY
        +R+NHT+I LI KV  P  M Q RPI L NV YKIISK+L NR+K VLP L+S  Q+ FV  R I+D+ILV HEI+HS++R KR     + +KLDMAKA+
Subjt:  RRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAY

Query:  DRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFF
        DR+EW FL  +MK +GF  ++  W+ EC+S+VSYS+++N  P G I   RGLRQGDPLSP+LF++C+E LT L+    +R  L G+++  +G ++++L F
Subjt:  DRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFF

Query:  ADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLKCKVADNVRGWAE
        ADDS++FC+A   E + ++++L  Y   SGQ+VN  KS++ +S++  + LR Q+  ++ +      G+YLG+  +FG SKR++FE ++ K+   + GWAE
Subjt:  ADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLKCKVADNVRGWAE

Query:  QFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEINV----------------------------RPRGVG-------------KVCWQV--------
        QFLS AGKEVL+K++A+AMP YTM+CFKLP+++CKEI                              +  G+G             K+ W++        
Subjt:  QFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEINV----------------------------RPRGVG-------------KVCWQV--------

Query:  ----------------------------------EVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLNNAWTNVKDLMCNNGRSWDVSKLQQCLGA
                                           +L  G  WR+G  + V +  DPW+P P TF+ +    +    V +L+    ++W    + +C   
Subjt:  ----------------------------------EVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLNNAWTNVKDLMCNNGRSWDVSKLQQCLGA

Query:  EDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSLGIQ
        E+ + ++ IPIS     DK +WH    G YTV+ GY+ +L +Q
Subjt:  EDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSLGIQ

XP_010462868.1 PREDICTED: uncharacterized protein LOC104743494 [Camelina sativa]4.2e-14337.45Show/hide
Query:  EKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIE
        +KIKN   +I  WRK    +    I  L+  ++E +   +    +I  +E+ LK++   EE YW+ KSR  WLR GDKNT+FF A  K RR R+ + G+ 
Subjt:  EKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIE

Query:  DKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVA
        D + VW E    + ++  +YF DLF++S    + ++      +I D+ N  L+ ++S   + KA+FAM+P+K PG DGMTA F+Q  W  ++ ++V +V 
Subjt:  DKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVA

Query:  NFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMV
         FFR GR    +N TNI LI KV  P +M + RPI LCNV+YKIISK+LC R+K+ LP LVSE Q+ FV GRLI+D+ILV  E+ H +    R K  F+ 
Subjt:  NFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMV

Query:  VKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNN
         K DM+KAYDR+EWAFLE VM  +GFD  W  W++ CVSSVSY +++N +P G I  +RGLRQGDPLSPYLF+LC+EVL   +  +     + G  I+ +
Subjt:  VKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNN

Query:  GPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLKCKV
         PT+++L FADDS+ FC+A  +E Q +M ++  YG  SGQ VNL KS+++F    P E+RDQL S++GI +   +G YLG+      SK ++F  +K ++
Subjt:  GPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLKCKV

Query:  ADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-----------NVRPRGVGKVCWQ------------------------------
         D V GW  + LS  GKE+++KS+ALA+P + M+C+KLP  +  ++           N +  G+  V W                               
Subjt:  ADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-----------NVRPRGVGKVCWQ------------------------------

Query:  ------------------------------------------VEVLRHGFWWRVGTNSRVNIWRDPWIP--RPLTFKPVQCLNNAWTNVKDLMCNN----
                                                   +++ +G  W VG+ S +++WRDPWIP  RP   +P       W  +  LM N+    
Subjt:  ------------------------------------------VEVLRHGFWWRVGTNSRVNIWRDPWIP--RPLTFKPVQCLNNAWTNVKDLMCNN----

Query:  -GRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQ
          + W +  L++ L   D+ ++  + +S  ++ D+ VWH T SG YTVK GY+
Subjt:  -GRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQ

XP_027166234.1 uncharacterized protein LOC113766221 [Coffea eugenioides]5.5e-14337.52Show/hide
Query:  GGHAYRLTEKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFG-KISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRR
        G   Y++  KIKN  +A+L W+     NS  +I+ +++Q+ + ++ R       ++ L+K+L K+   EE +W+ KSR+QWL+EGDKNT+FFHA V+GRR
Subjt:  GGHAYRLTEKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFG-KISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRR

Query:  SRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVI
         R+ L  ++ ++G W E +EE+++    Y+  L  S++  +L+++ +     I D  N  L+  V    I+  +F+MNP+KAPG DGM+  F+Q  W  I
Subjt:  SRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVI

Query:  RENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRK
        ++ V++ +  FF  G +L+ +NHT I LI KV +PT + Q RPI LC   YK+I+K+L NR+K VL   + + Q+ F+ GR I D+I+V HE MH +K K
Subjt:  RENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRK

Query:  KRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGC
        K+GK+ FM VKLDM+KAYDR+EW+FLE +M+ MGFD KWR WV+ECV SVSYS  +N +    +I +RG+RQGDPLSPYLF+LCSE L+ L+  +     
Subjt:  KRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGC

Query:  LMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKRE
        L G KIS  GP++ +L FADDS+IFC+A   +  EL  VL  YG  SGQ +NL KS+++FS N   +L D++   MG  Q    G+YLGL++    SK++
Subjt:  LMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKRE

Query:  MFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEINV--------RPRGVGKVCW-------------------------
        +F  +K  +   +  W  + LS AGKE +LKS+ALAMP YTM+CF+LP  +CKEI+            G  K+ W                         
Subjt:  MFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEINV--------RPRGVGKVCW-------------------------

Query:  ---------------------------------QVEVLRHGFW-WR----------------VGTNSRVNIWRDPWIPRPLTFKPVQCLN--NAWTNVKD
                                         + +V  +  W WR                +G   + NIW D WIP  L  +     N  NA   V +
Subjt:  ---------------------------------QVEVLRHGFW-WR----------------VGTNSRVNIWRDPWIPRPLTFKPVQCLN--NAWTNVKD

Query:  LMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQ
        L+C   + W+ + + +    +D   ++ IP+S   + D   W     G Y+V  GY+
Subjt:  LMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQ

XP_028948114.1 uncharacterized protein LOC114820933 [Malus domestica]1.6e-15338.51Show/hide
Query:  EKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIE
        +K+K   M++  W +    NS+  I QL+ +IR   +       +I   EK+L+ +   EEAYWK KSR+QWL+EGDKNT+FFHA+   RR  + + G+E
Subjt:  EKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIE

Query:  DKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVA
        D  GVW E ++E+  +   YF   F+SS+P  + ++    +  + + DN  L   ++ + I +A F + P +APG DG T  FY+D+W+ + ++V  +V 
Subjt:  DKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVA

Query:  NFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMV
         F+  G++LR++NHTN+VLI KV  P  M Q RPI LCNV YKII+K+L NR+K V+ K++ E Q+ FV G+ I D+ILV HEI+HS+  +K+G +  M 
Subjt:  NFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMV

Query:  VKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNN
        +KLDMAKAYDR+EW FL  +M  +GF P +   + EC+SSVS+S+++N  P G I  ERGLRQGDPLS +LF+LC+E  + L+  S+  G L G K++ +
Subjt:  VKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNN

Query:  GPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLKCKV
        G  +++LFFADDS++F  AT  E Q ++ VL+ Y   SGQ +NL KS+ +F S +    + ++   +GI+     G+YLGL+ +FG+SK+ +F  ++ K+
Subjt:  GPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLKCKV

Query:  ADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-----------NVRPRGV------------------------------GKVCWQ
           + GW+EQFLS+AGKEVL+K++A+A+P Y M+CFKLP+ +C+++           N + +GV                               K+ W+
Subjt:  ADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-----------NVRPRGV------------------------------GKVCWQ

Query:  V------------------------------------------EVLRHGFWWRVGTNSRVNIWRDPWIPRPLTF--KPVQCLNNAWTNVKDLMCNNGRSW
        +                                           VL+HG  WRVG  +++NI  DPW P+P TF  KP  CL    T V DL+  N RSW
Subjt:  V------------------------------------------EVLRHGFWWRVGTNSRVNIWRDPWIPRPLTF--KPVQCLNNAWTNVKDLMCNNGRSW

Query:  DVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSLGI
            + Q   +ED   ++ IP+S     D+ VWHHT  G Y+VK GY  ++ +
Subjt:  DVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSLGI

XP_028962235.1 uncharacterized protein LOC114826307 [Malus domestica]1.8e-15738.83Show/hide
Query:  GGHAYRLTEKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRS
        G HAYR  EKIK    ++  W K+T  NS+ +++ L+ +IR     +      + Q EKDL+ +   EE YWK KSR QWL EGDKNT+FFHA+   RR 
Subjt:  GGHAYRLTEKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRS

Query:  RSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIR
         + + GIED +GVW E D E+A   + YF DLF+SS+P ++DD+    +  I   DN AL   V+   I  AV  + P +APG DG +  FYQD+W  + 
Subjt:  RSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIR

Query:  ENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKK
        E+VV+++  F+  G +LR++NHTN+VLI KV  P  M Q RPI LCNV YKI++KVL NR+K V+PK++ + Q+ FV G+ + D+ILV HE++HS+  ++
Subjt:  ENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKK

Query:  RGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCL
        R  +  M +KLDMAKAYDR+EW FL  +M  +GF P +  W+  C+S+VS+S+VVN  P G I+ +RGLRQGDPLSP+LF+LC+E L+ +L   + +G L
Subjt:  RGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCL

Query:  MGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREM
         G+K +  G  + +LFFADDS++F  AT  + + +   L  Y G SGQ +NL+KS+V FS  +PN  + ++   +GI      G+YLGL+ +FG SK+ +
Subjt:  MGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREM

Query:  FENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-----------NVRPRGVGKVCWQ----------------------
        F  ++ KV   + GW EQFLS AGKEVL+KS+A+A+P Y M+CFKLP+ +C++I           N +  G+  V W+                      
Subjt:  FENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-----------NVRPRGVGKVCWQ----------------------

Query:  --------------------------------------------------VEVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLNNAWTNVKDLMC
                                                           +VL  G  WRVG    +NI  DPW P+P +F+     N   T V DL+ 
Subjt:  --------------------------------------------------VEVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLNNAWTNVKDLMC

Query:  NNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSLGIQCVEGPSRGNQ
            SW  + +      ED   ++ IP+S     D+ +W H+ +G+Y+VK GY   + ++ +E  + G +
Subjt:  NNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSLGIQCVEGPSRGNQ

TrEMBL top hitse value%identityAlignment
A0A2N9E147 Reverse transcriptase domain-containing protein3.5e-14339.48Show/hide
Query:  GGHAYRLTEKIKNTLMAILSW-------RKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHA
        G   Y++ +KIK   + +L W       R K+  +    ++Q EE +  +Q++ T +      L K+L   L  EEAYW+ +SRV WL+EGDKNT+FFHA
Subjt:  GGHAYRLTEKIKNTLMAILSW-------RKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHA

Query:  KVKGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQ
            RR  + +  + D++G    GDE +  V   YF +LF +S P  +D +    ++++    N  L+   +   ++ A+F M P KAPG DGM+A F+Q
Subjt:  KVKGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQ

Query:  DNWQVIRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIM
          W ++   + R V +     RMLR +N+T+IVLI KV  P  M+Q RPI LCNV YK+ISKV  NR+K+ LP ++S+ Q+ FV GRLI+D++L+  E +
Subjt:  DNWQVIRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIM

Query:  HSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLND
        H MK K++G    +  KLDM+KAYDRIEW +L+ VM  MGFD +W   V+ECVS+ S+S+++N  P G I   RGLRQGDPLSPYLF++C+E  T L+ +
Subjt:  HSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLND

Query:  SISRGCLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEF
        ++++  + G  I+  GP  ++LFFADDSI+F +A+  E Q L  VL  Y   SGQ +N+ K+++ FSSN+ + +++++ S +G   +++L +YLGL    
Subjt:  SISRGCLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEF

Query:  GLSKREMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKE----INVRPRGVGKVCWQV-----EVLRHGFWWRVGTNSR
        G SKR+ FE++K ++   V GW E+ LS AGKE+L+K++A A+P Y M+ F+LP S+C E    IN +        W+      E++  G  WRVG  + 
Subjt:  GLSKREMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKE----INVRPRGVGKVCWQV-----EVLRHGFWWRVGTNSR

Query:  VNIWRDPWIPRPLTFK---PVQCLNNAWTNVKDLMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGY
        + IW+D WIP   TFK   P+  L    T V  L+  + R W+   +       +   +  IP+S R   D  +W  T  G+Y+V+  Y
Subjt:  VNIWRDPWIPRPLTFK---PVQCLNNAWTNVKDLMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGY

A0A2N9H680 Reverse transcriptase domain-containing protein3.2e-14438.1Show/hide
Query:  GGHAYRLTEKIKNTLMAILSWRK-KTNVNSEVEIKQLEEQIREEQDKRTPDF--GKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKG
        G   +R+ +KIKN  M +L W + + ++N  + I++ + ++ + +     ++   +++ L +++   +  EE +W+ +SRV WL+EGD+NT+FFHA    
Subjt:  GGHAYRLTEKIKNTLMAILSWRK-KTNVNSEVEIKQLEEQIREEQDKRTPDF--GKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKG

Query:  RRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQ
        R+  + ++G+ D  GVW+     ++++ + YF  LF SS P  + ++  + D ++  + N AL+ ++S   I  A+F M P KAPG DGMTA F+Q  W 
Subjt:  RRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQ

Query:  VIRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMK
        ++ E+V   + +FF  GRML  +N TNIVLI KV  P  M+Q RPI LCNV YKI SKVL NR+K +LPK++S+ Q  FV GRLISD++++  E++H +K
Subjt:  VIRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMK

Query:  RKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISR
            G+   M VKLDM+KAYDR+EW FL+ ++  +GF  +W   ++ CV++ SY+++VN  PHG I   RGLRQGDPLSPYLF+LC+E L+ L+  +   
Subjt:  RKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISR

Query:  GCLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSK
          + G  I   GP +++LFFADDSIIFCRA+  +G+ + S+L+ Y   SGQ +N+ K+   FS N+PN +R ++ S+   + S +  +YLGL    G SK
Subjt:  GCLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSK

Query:  REMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEINV--------RPRGVGKVCWQV---------------------
        +  F  +K ++   ++GW E+ LS AG+E+L+K++  A+P Y M+CFK    +C +I          +  G  K+ W                       
Subjt:  REMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEINV--------RPRGVGKVCWQV---------------------

Query:  ------EVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCL--NNAWTNVKDLMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTN
               VL  G  WRVG  + + IW+D W+P P TF+ +  +  +N+   V  L+  N R WDV  L+Q     DV ++ +IP+S R   DK +W  T 
Subjt:  ------EVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCL--NNAWTNVKDLMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTN

Query:  SGMYTV
        SG++TV
Subjt:  SGMYTV

A0A2N9I946 Uncharacterized protein4.6e-14336.27Show/hide
Query:  GGHAYRLTEKIKNTLMAILSW-RKKTNVNSE-VEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGR
        G   + + +KIK   M +L W + +  +N   +E K+      E          +++ L +++   +  EE +W+ +SRV WL+EGD+NT+++HA    R
Subjt:  GGHAYRLTEKIKNTLMAILSW-RKKTNVNSE-VEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGR

Query:  RSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQV
        +  + + G+ D +G+W+     ++++ +EYF  LF SS P  + ++  + D ++  + N AL+ + S   I++A+F M P KAPG DGMTA F+Q  W +
Subjt:  RSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQV

Query:  IRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKR
        + E+V   + +FF  GRML  +N+TNIVLI KV  P  M+Q RPI LCNV YKI SKVL NR+K +LP ++S+ Q+ FV GRLISD+I++  E +H +K 
Subjt:  IRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKR

Query:  KKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRG
         + G    M  KLDM+KAYDR+EW FL+ ++  +GF  +W   ++ CV+S SYS++VN  PHG I   RGLRQGDPLSPYLF+LC+E L+ L+  +    
Subjt:  KKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRG

Query:  CLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKR
         + G  I   GP +++LFFADDS+IFCRA+  +G  L ++L  Y   SGQ +N  K+ + FS N+PN +R  + S+ G + S +  +YLGL    G SK+
Subjt:  CLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKR

Query:  EMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-------------------------NVRPR---GVG--------
          F  +K ++   ++GW E+ LS AG+E+L+K++  A+PIY M+CFKLP  +C EI                          +RP+   G+G        
Subjt:  EMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-------------------------NVRPR---GVG--------

Query:  -----------------------------------------------KVCWQVEVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLN--NAWTNVK
                                                        +C    VLR G  WRVG    + IW+D W+P P TF+ +  L+  N+   V 
Subjt:  -----------------------------------------------KVCWQVEVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLN--NAWTNVK

Query:  DLMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSL
         L+      WD  KL+Q     DV ++ +IP+S R   DK +W  T SG +TV+  Y   L
Subjt:  DLMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSL

A0A2N9J936 Uncharacterized protein4.6e-14336.27Show/hide
Query:  GGHAYRLTEKIKNTLMAILSW-RKKTNVNSE-VEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGR
        G   + + +KIK   M +L W + +  +N   +E K+      E          +++ L +++   +  EE +W+ +SRV WL+EGD+NT+++HA    R
Subjt:  GGHAYRLTEKIKNTLMAILSW-RKKTNVNSE-VEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGR

Query:  RSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQV
        +  + + G+ D +G+W+     ++++ +EYF  LF SS P  + ++  + D ++  + N AL+ + S   I++A+F M P KAPG DGMTA F+Q  W +
Subjt:  RSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQV

Query:  IRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKR
        + E+V   + +FF  GRML  +N+TNIVLI KV  P  M+Q RPI LCNV YKI SKVL NR+K +LP ++S+ Q+ FV GRLISD+I++  E +H +K 
Subjt:  IRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKR

Query:  KKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRG
         + G    M  KLDM+KAYDR+EW FL+ ++  +GF  +W   ++ CV+S SYS++VN  PHG I   RGLRQGDPLSPYLF+LC+E L+ L+  +    
Subjt:  KKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRG

Query:  CLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKR
         + G  I   GP +++LFFADDS+IFCRA+  +G  L ++L  Y   SGQ +N  K+ + FS N+PN +R  + S+ G + S +  +YLGL    G SK+
Subjt:  CLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKR

Query:  EMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-------------------------NVRPR---GVG--------
          F  +K ++   ++GW E+ LS AG+E+L+K++  A+PIY M+CFKLP  +C EI                          +RP+   G+G        
Subjt:  EMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEI-------------------------NVRPR---GVG--------

Query:  -----------------------------------------------KVCWQVEVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLN--NAWTNVK
                                                        +C    VLR G  WRVG    + IW+D W+P P TF+ +  L+  N+   V 
Subjt:  -----------------------------------------------KVCWQVEVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLN--NAWTNVK

Query:  DLMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSL
         L+      WD  KL+Q     DV ++ +IP+S R   DK +W  T SG +TV+  Y   L
Subjt:  DLMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSL

A0A6J5UE59 Reverse transcriptase domain-containing protein4.1e-14436.61Show/hide
Query:  ILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIEDKEGVWREG
        ++ WR+    NS   I  L +Q++     +  +   +  LE+ LK++L  EE +WK KSR+QWL+ G+KNT+FFH+KV  RR ++ L G+ED  GVW + 
Subjt:  ILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIEDKEGVWREG

Query:  DEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVANFFRRGRML
        +  +  + + YF DLF +S+P  +++      + +  S++ +L+  V+   + + V +++P  +PG DG T  F+Q +W V+ ++V R+V +FF  G++ 
Subjt:  DEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVANFFRRGRML

Query:  RRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAY
        +R+NHT+I LI KV  P  M Q RPI L NV YKIISK+L NR+K VLP L+S  Q+ FV  R I+D+ILV HEI+HS++R KR     + +KLDMAKA+
Subjt:  RRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAY

Query:  DRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFF
        DR+EW FL  +MK +GF  ++  W+ EC+S+VSYS+++N  P G I   RGLRQGDPLSP+LF++C+E LT L+    +R  L G+++  +G ++++L F
Subjt:  DRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFF

Query:  ADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLKCKVADNVRGWAE
        ADDS++FC+A   E + ++++L  Y   SGQ+VN  KS++ +S++  + LR Q+  ++ +      G+YLG+  +FG SKR++FE ++ K+   + GWAE
Subjt:  ADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLKCKVADNVRGWAE

Query:  QFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEINV----------------------------RPRGVG-------------KVCWQV--------
        QFLS AGKEVL+K++A+AMP YTM+CFKLP+++CKEI                              +  G+G             K+ W++        
Subjt:  QFLSNAGKEVLLKSMALAMPIYTMNCFKLPLSICKEINV----------------------------RPRGVG-------------KVCWQV--------

Query:  ----------------------------------EVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLNNAWTNVKDLMCNNGRSWDVSKLQQCLGA
                                           +L  G  WR+G  + V +  DPW+P P TF+ +    +    V +L+    ++W    + +C   
Subjt:  ----------------------------------EVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLNNAWTNVKDLMCNNGRSWDVSKLQQCLGA

Query:  EDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSLGIQ
        E+ + ++ IPIS     DK +WH    G YTV+ GY+ +L +Q
Subjt:  EDVSLVMEIPISHREEGDKAVWHHTNSGMYTVKLGYQTSLGIQ

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.2e-3223.08Show/hide
Query:  EIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREG-DKNTRFFHAKVKGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFG
        ++K+LE+Q  E+   +     +I+++  +LK+ +  ++   K      W  E  +K  R     +K +R ++ +  I++ +G       E+ +   EY+ 
Subjt:  EIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREG-DKNTRFFHAKVKGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFG

Query:  DLF--KSSQPPELDDLFYRWD-KMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVANFFRRGRMLRRMNHTNIVL
         L+  K     E+D     +    ++  +  +L   ++ + I   + ++   K+PG DG TA FYQ   + +   ++++  +  + G +       +I+L
Subjt:  DLF--KSSQPPELDDLFYRWD-KMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVANFFRRGRMLRRMNHTNIVL

Query:  ISKVGL-PTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLE
        I K G   TK    RPI L N+  KI++K+L NR+++ + KL+   Q  F+ G     +I     ++  + R K   KN +++ +D  KA+D+I+  F+ 
Subjt:  ISKVGL-PTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLE

Query:  RVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLT-FLLNDSISRGCLMGYKISNNGPTVANLFFADDSIIFC
        + +  +G D  +   +       + ++++N +      L+ G RQG PLSP LF +  EVL   +  +   +G  +G +       V    FADD I++ 
Subjt:  RVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLT-FLLNDSISRGCLMGYKISNNGPTVANLFFADDSIIFC

Query:  RATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSK--REMFENLKCKVADNVRGWAEQFLSNA
               Q L+ ++  +  +SG  +N+ KS   F  N+  +   Q+   +    + +  +YLG+++   +    +E ++ L  ++ ++   W     S  
Subjt:  RATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSK--REMFENLKCKVADNVRGWAEQFLSNA

Query:  GKEVLLKSMALAMPIYTMNC--FKLPLSICKEI
        G+  ++K   L   IY  N    KLP++   E+
Subjt:  GKEVLLKSMALAMPIYTMNC--FKLPLSICKEI

P08548 LINE-1 reverse transcriptase homolog2.8e-3324.86Show/hide
Query:  LMAILSWRKKTNVNSEV-EIKQLEEQIREEQDKRTPDFGK-ISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIEDKEG
        L A L   ++  VN+ +  +KQLE   +EE     P   K I+++  +L +           KS+  +  + +K  +      + +R +S +S I +   
Subjt:  LMAILSWRKKTNVNSEV-EIKQLEEQIREEQDKRTPDFGK-ISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIEDKEG

Query:  VWREGDEEVASVGIEYFGDLF--KSSQPPELDDLFYRWD-KMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVAN
               E+  +  EY+  L+  K     E+D          +   +   L   +S + I   +  +   K+PG DG T+ FYQ   + +   ++ +  N
Subjt:  VWREGDEEVASVGIEYFGDLF--KSSQPPELDDLFYRWD-KMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVAN

Query:  FFRRGRMLRRMNHTNIVLISKVGL-PTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMV
          + G +       NI LI K G  PT+    RPI L N+  KI++K+L NR+++ + K++   Q  F+ G     +I     ++  + + K   K+ M+
Subjt:  FFRRGRMLRRMNHTNIVLISKVGL-PTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMV

Query:  VKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNN
        + +D  KA+D I+  F+ R +K +G +  +   +    S  + ++++N        L  G RQG PLSP LF +  EVL   + +      + G  I + 
Subjt:  VKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNN

Query:  GPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKS-NVVFSSNSPNE--LRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLK
           +    FADD I++   T     +L+ V+ +Y  +SG  +N  KS   ++++N+  E  ++D +   +   +   LG YL  +V+     +E +E L+
Subjt:  GPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKS-NVVFSSNSPNE--LRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLK

Query:  CKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNC--FKLPLSICKEI
         ++A++V  W     S  G+  ++K   L   IY  N    K PLS  K++
Subjt:  CKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNC--FKLPLSICKEI

P11369 LINE-1 retrotransposable element ORF2 protein6.3e-3326.06Show/hide
Query:  KGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLF--KSSQPPELDDLFYRWDKMIDDSDNV-ALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFY
        KG R +  ++ I +++G      EE+ +    ++  L+  K     E+D    R+     + D V  L + +S   IE  + ++   K+PG DG +A FY
Subjt:  KGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLF--KSSQPPELDDLFYRWDKMIDDSDNV-ALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFY

Query:  QDNWQVIRENVVRMVANFFRR----GRMLRRMNHTNIVLISKVGL-PTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSIL
            Q  +E+++ ++   F +    G +        I LI K    PTK+   RPI L N+  KI++K+L NR+++ +  ++   Q  F+ G     +I 
Subjt:  QDNWQVIRENVVRMVANFFRR----GRMLRRMNHTNIVLISKVGL-PTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSIL

Query:  VDHEIMHSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVL
            ++H + + K   KN M++ LD  KA+D+I+  F+ +V++  G    +   +    S    ++ VN +    I L+ G RQG PLSPYLF +  EVL
Subjt:  VDHEIMHSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVL

Query:  TFLLNDSISRGCLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVF---SSNSPNELRDQLASLMGINQSDRLG
           +        + G +I      ++ L  ADD I++     +  +EL+++++ +G + G  +N +KS       +  +  E+R+     +  N    LG
Subjt:  TFLLNDSISRGCLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVF---SSNSPNELRDQLASLMGINQSDRLG

Query:  RYLGLEVEFGLSKREMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNC--FKLPLSICKEI
          L  EV+    K   F++LK ++ +++R W +   S  G+  ++K   L   IY  N    K+P     E+
Subjt:  RYLGLEVEFGLSKREMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNC--FKLPLSICKEI

P14381 Transposon TX1 uncharacterized 149 kDa protein7.3e-2924.43Show/hide
Query:  KSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWD--KMIDDSDNVALMTEVSCTVIEKA
        +SR+Q L + D+ +RFF+A  K + +R  ++ +  ++G   E  E +      ++ +LF S  P   D     WD   ++ +     L T ++   + +A
Subjt:  KSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWD--KMIDDSDNVALMTEVSCTVIEKA

Query:  VFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSEC
        +  M  +K+PG DG+T  F+Q  W  +  +  R++   F++G +        + L+ K G    +   RP+ L +  YKI++K +  R+K VL +++   
Subjt:  VFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSEC

Query:  QTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQG
        Q+  V GR I D++ +  +++H  +R      +   + LD  KA+DR++  +L   ++   F P++  ++    +S    + +N      +   RG+RQG
Subjt:  QTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQG

Query:  DPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFFADDSIIFCR--ATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSN-----SPN
         PLS  L+ L  E    LL     R  L G  +      V    +ADD I+  +        QE   V   Y   S   +N SKS+ +   +      P 
Subjt:  DPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFFADDSIIFCR--ATHSEGQELMSVLDKYGGLSGQAVNLSKSNVVFSSN-----SPN

Query:  ELRDQLASLMGINQSDRLGRYLGLEV---EFGLSKREMFENLKCKVADNVRGWA--EQFLSNAGKEVLLKSMALAMPIYTMNC
          RD       I+   ++ +YLG+ +   E+ +S+   F  L+  V   +  W    + LS  G+ +++  +  +   Y + C
Subjt:  ELRDQLASLMGINQSDRLGRYLGLEV---EFGLSKREMFENLKCKVADNVRGWA--EQFLSNAGKEVLLKSMALAMPIYTMNC

P92555 Uncharacterized mitochondrial protein AtMg012502.3e-1451.47Show/hide
Query:  VVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFFADDS
        ++N  P G++   RGLRQGDPLSPYLF+LC+EVL+ L   +  +G L G ++SNN P + +L FADD+
Subjt:  VVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFFADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.1e-2328.41Show/hide
Query:  MGGHAYRLTEKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDK-RTPDFGKISQLEKDLKKS----LGAEEAYWKAKSRVQWLREGDKNTRFFHAK
        +G H + L E +K          ++   N + + K+  + +   Q +  T     + ++E   +K       A E++++ KSR++WL++GD NTRFFH  
Subjt:  MGGHAYRLTEKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDK-RTPDFGKISQLEKDLKKS----LGAEEAYWKAKSRVQWLREGDKNTRFFHAK

Query:  VKGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEK----AVFAMNPDKAPGADGMTAG
        +   ++++ +  +   + V  E   +V  + + Y+  L  S       D   R   +     N  L + +S    +K    AVFAM  +KAPG D  TA 
Subjt:  VKGRRSRSGLSGIEDKEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEK----AVFAMNPDKAPGADGMTAG

Query:  FYQDNWQVIRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIIS
        F+ ++W V++++ +  V  FFR G +L+R N T I LI KV    +++  RP+  C V YKII+
Subjt:  FYQDNWQVIRENVVRMVANFFRRGRMLRRMNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.7e-1241.46Show/hide
Query:  LCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKW
        +  R+K ++  L+   Q +F+ GR+ +D+I+   E +HSM+RKK G K +M++KLD+ KAYDRI W +LE  +   GF   W
Subjt:  LCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLERVMKVMGFDPKW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-1551.47Show/hide
Query:  VVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFFADDS
        ++N  P G++   RGLRQGDPLSPYLF+LC+EVL+ L   +  +G L G ++SNN P + +L FADD+
Subjt:  VVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGTCATGCGTACAGGTTGACGGAAAAAATTAAAAATACGCTTATGGCGATATTAAGCTGGCGGAAGAAGACAAATGTAAACAGTGAGGTGGAAATTAAACAATT
AGAAGAGCAGATAAGGGAAGAGCAGGACAAAAGAACCCCGGATTTTGGGAAGATTAGCCAGCTTGAAAAAGATCTAAAGAAGTCATTGGGGGCCGAAGAAGCGTATTGGA
AGGCGAAATCTAGAGTCCAATGGCTGAGGGAAGGGGACAAAAACACGAGATTCTTCCATGCAAAGGTGAAGGGGAGACGAAGCAGGAGTGGGTTGTCAGGTATTGAGGAT
AAGGAGGGTGTGTGGAGGGAAGGAGACGAAGAAGTGGCAAGTGTGGGGATTGAGTACTTTGGGGATTTATTTAAATCTTCTCAACCACCAGAGCTAGATGATCTGTTTTA
CCGTTGGGACAAAATGATTGACGATAGCGACAATGTGGCGTTGATGACTGAGGTATCATGCACAGTGATTGAAAAAGCAGTGTTTGCAATGAATCCTGATAAAGCCCCGG
GGGCTGATGGCATGACAGCGGGGTTTTACCAAGACAACTGGCAGGTGATTCGGGAAAATGTTGTACGGATGGTTGCCAATTTCTTTCGTAGAGGCAGAATGCTTAGGAGG
ATGAACCACACTAATATTGTTCTGATTTCGAAAGTAGGGCTGCCAACAAAAATGACTCAGTTAAGGCCGATAGAGTTATGTAACGTTGCATATAAGATAATATCGAAAGT
TTTGTGCAATCGGGTTAAGAAGGTTCTGCCCAAACTTGTCAGTGAATGCCAAACGACTTTTGTTCATGGTCGCTTGATTTCGGATAGTATCCTAGTGGACCACGAAATCA
TGCATTCGATGAAAAGGAAAAAGCGAGGAAAAAAGAACTTTATGGTTGTGAAACTGGATATGGCTAAAGCGTACGATAGAATAGAGTGGGCTTTCTTGGAGAGAGTGATG
AAGGTGATGGGCTTTGATCCTAAATGGCGGATGTGGGTTTTGGAATGTGTGTCATCAGTTTCGTACAGTCTGGTGGTCAACAATAAGCCACATGGAATGATCATTCTAGA
GAGAGGTTTGAGGCAGGGGGACCCATTGTCCCCGTATCTATTTGTGCTGTGCTCGGAGGTACTTACATTTTTGTTAAATGACTCTATTAGTCGTGGATGCCTCATGGGGT
ATAAAATCTCCAATAATGGGCCGACTGTTGCAAATCTATTTTTTGCAGATGATTCTATTATCTTTTGCAGAGCAACGCATAGTGAGGGGCAAGAGCTCATGAGTGTCCTA
GATAAATATGGAGGACTATCGGGGCAAGCGGTAAACCTCAGCAAAAGCAATGTGGTTTTTAGCAGTAATAGCCCAAATGAGCTGAGAGACCAATTAGCAAGTCTGATGGG
CATCAACCAGTCGGACAGGCTGGGAAGATACTTGGGATTGGAGGTGGAATTTGGGCTATCCAAGAGAGAAATGTTTGAGAACCTGAAATGCAAGGTGGCTGATAATGTGA
GAGGGTGGGCGGAGCAATTCCTCTCTAATGCAGGGAAGGAAGTGTTGCTGAAGTCGATGGCTTTGGCTATGCCGATATATACTATGAACTGTTTTAAGTTGCCGTTGTCT
ATATGTAAAGAAATTAATGTGCGGCCTCGTGGGGTTGGCAAAGTGTGCTGGCAAGTAGAGGTCCTGAGACATGGTTTCTGGTGGAGGGTTGGGACTAATTCTCGAGTAAA
TATTTGGAGGGATCCGTGGATCCCTAGGCCGTTGACCTTTAAACCGGTGCAATGCTTGAATAATGCGTGGACGAACGTGAAGGATTTGATGTGTAATAATGGGAGGAGCT
GGGATGTAAGCAAGCTACAACAGTGTTTGGGGGCTGAGGATGTGAGCTTGGTAATGGAAATTCCGATTAGCCACAGGGAGGAGGGGGATAAGGCAGTGTGGCACCACACG
AATTCGGGTATGTACACAGTTAAATTGGGCTATCAGACATCACTGGGGATTCAATGTGTGGAGGGTCCAAGCAGGGGGAATCAGATGGGAATAAGGCAAACCCTTGAATA
TGGAGCAAGGTCTGCTGGAGGTGAGAAAGGAGGTGCGGGTGTTTCAAGAAGCCATGCAGAATGGGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGTCATGCGTACAGGTTGACGGAAAAAATTAAAAATACGCTTATGGCGATATTAAGCTGGCGGAAGAAGACAAATGTAAACAGTGAGGTGGAAATTAAACAATT
AGAAGAGCAGATAAGGGAAGAGCAGGACAAAAGAACCCCGGATTTTGGGAAGATTAGCCAGCTTGAAAAAGATCTAAAGAAGTCATTGGGGGCCGAAGAAGCGTATTGGA
AGGCGAAATCTAGAGTCCAATGGCTGAGGGAAGGGGACAAAAACACGAGATTCTTCCATGCAAAGGTGAAGGGGAGACGAAGCAGGAGTGGGTTGTCAGGTATTGAGGAT
AAGGAGGGTGTGTGGAGGGAAGGAGACGAAGAAGTGGCAAGTGTGGGGATTGAGTACTTTGGGGATTTATTTAAATCTTCTCAACCACCAGAGCTAGATGATCTGTTTTA
CCGTTGGGACAAAATGATTGACGATAGCGACAATGTGGCGTTGATGACTGAGGTATCATGCACAGTGATTGAAAAAGCAGTGTTTGCAATGAATCCTGATAAAGCCCCGG
GGGCTGATGGCATGACAGCGGGGTTTTACCAAGACAACTGGCAGGTGATTCGGGAAAATGTTGTACGGATGGTTGCCAATTTCTTTCGTAGAGGCAGAATGCTTAGGAGG
ATGAACCACACTAATATTGTTCTGATTTCGAAAGTAGGGCTGCCAACAAAAATGACTCAGTTAAGGCCGATAGAGTTATGTAACGTTGCATATAAGATAATATCGAAAGT
TTTGTGCAATCGGGTTAAGAAGGTTCTGCCCAAACTTGTCAGTGAATGCCAAACGACTTTTGTTCATGGTCGCTTGATTTCGGATAGTATCCTAGTGGACCACGAAATCA
TGCATTCGATGAAAAGGAAAAAGCGAGGAAAAAAGAACTTTATGGTTGTGAAACTGGATATGGCTAAAGCGTACGATAGAATAGAGTGGGCTTTCTTGGAGAGAGTGATG
AAGGTGATGGGCTTTGATCCTAAATGGCGGATGTGGGTTTTGGAATGTGTGTCATCAGTTTCGTACAGTCTGGTGGTCAACAATAAGCCACATGGAATGATCATTCTAGA
GAGAGGTTTGAGGCAGGGGGACCCATTGTCCCCGTATCTATTTGTGCTGTGCTCGGAGGTACTTACATTTTTGTTAAATGACTCTATTAGTCGTGGATGCCTCATGGGGT
ATAAAATCTCCAATAATGGGCCGACTGTTGCAAATCTATTTTTTGCAGATGATTCTATTATCTTTTGCAGAGCAACGCATAGTGAGGGGCAAGAGCTCATGAGTGTCCTA
GATAAATATGGAGGACTATCGGGGCAAGCGGTAAACCTCAGCAAAAGCAATGTGGTTTTTAGCAGTAATAGCCCAAATGAGCTGAGAGACCAATTAGCAAGTCTGATGGG
CATCAACCAGTCGGACAGGCTGGGAAGATACTTGGGATTGGAGGTGGAATTTGGGCTATCCAAGAGAGAAATGTTTGAGAACCTGAAATGCAAGGTGGCTGATAATGTGA
GAGGGTGGGCGGAGCAATTCCTCTCTAATGCAGGGAAGGAAGTGTTGCTGAAGTCGATGGCTTTGGCTATGCCGATATATACTATGAACTGTTTTAAGTTGCCGTTGTCT
ATATGTAAAGAAATTAATGTGCGGCCTCGTGGGGTTGGCAAAGTGTGCTGGCAAGTAGAGGTCCTGAGACATGGTTTCTGGTGGAGGGTTGGGACTAATTCTCGAGTAAA
TATTTGGAGGGATCCGTGGATCCCTAGGCCGTTGACCTTTAAACCGGTGCAATGCTTGAATAATGCGTGGACGAACGTGAAGGATTTGATGTGTAATAATGGGAGGAGCT
GGGATGTAAGCAAGCTACAACAGTGTTTGGGGGCTGAGGATGTGAGCTTGGTAATGGAAATTCCGATTAGCCACAGGGAGGAGGGGGATAAGGCAGTGTGGCACCACACG
AATTCGGGTATGTACACAGTTAAATTGGGCTATCAGACATCACTGGGGATTCAATGTGTGGAGGGTCCAAGCAGGGGGAATCAGATGGGAATAAGGCAAACCCTTGAATA
TGGAGCAAGGTCTGCTGGAGGTGAGAAAGGAGGTGCGGGTGTTTCAAGAAGCCATGCAGAATGGGGCTAG
Protein sequenceShow/hide protein sequence
MGGHAYRLTEKIKNTLMAILSWRKKTNVNSEVEIKQLEEQIREEQDKRTPDFGKISQLEKDLKKSLGAEEAYWKAKSRVQWLREGDKNTRFFHAKVKGRRSRSGLSGIED
KEGVWREGDEEVASVGIEYFGDLFKSSQPPELDDLFYRWDKMIDDSDNVALMTEVSCTVIEKAVFAMNPDKAPGADGMTAGFYQDNWQVIRENVVRMVANFFRRGRMLRR
MNHTNIVLISKVGLPTKMTQLRPIELCNVAYKIISKVLCNRVKKVLPKLVSECQTTFVHGRLISDSILVDHEIMHSMKRKKRGKKNFMVVKLDMAKAYDRIEWAFLERVM
KVMGFDPKWRMWVLECVSSVSYSLVVNNKPHGMIILERGLRQGDPLSPYLFVLCSEVLTFLLNDSISRGCLMGYKISNNGPTVANLFFADDSIIFCRATHSEGQELMSVL
DKYGGLSGQAVNLSKSNVVFSSNSPNELRDQLASLMGINQSDRLGRYLGLEVEFGLSKREMFENLKCKVADNVRGWAEQFLSNAGKEVLLKSMALAMPIYTMNCFKLPLS
ICKEINVRPRGVGKVCWQVEVLRHGFWWRVGTNSRVNIWRDPWIPRPLTFKPVQCLNNAWTNVKDLMCNNGRSWDVSKLQQCLGAEDVSLVMEIPISHREEGDKAVWHHT
NSGMYTVKLGYQTSLGIQCVEGPSRGNQMGIRQTLEYGARSAGGEKGGAGVSRSHAEWG