; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035208 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035208
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:16664363..16675367
RNA-Seq ExpressionLag0035208
SyntenyLag0035208
Gene Ontology termsGO:0006265 - DNA topological change (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003918 - DNA topoisomerase type II (ATP-hydrolyzing) activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR013758 - DNA topoisomerase, type IIA, subunit A/ C-terminal, alpha-beta
IPR013760 - DNA topoisomerase, type IIA-like domain superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]4.0e-16040.98Show/hide
Query:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI
        + ESE ++  E+A+ TI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+
Subjt:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI

Query:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------
        ILLNSLPE+Y EVK+AI+Y  DSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+                                
Subjt:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------

Query:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------
                                       EVL V   +    WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Subjt:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------

Query:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-
                                     +N + K+ KG++VK +G L +GLYVL   T+ G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL 
Subjt:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-

Query:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL
                                                                   ++D                          VE QT RK+K L
Subjt:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL

Query:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS
        RTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAER NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Subjt:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS

Query:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-EKNQSETETNSEKNKSFEMELELASIQTPT---EN
        +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+ + +  K Q + +T          E+ +AS   P+   +N
Subjt:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-EKNQSETETNSEKNKSFEMELELASIQTPT---EN

Query:  QPAETDVRVEEGAETQ---AETQAENI----------PPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKW
        QP      +E+  +++    ++Q E I              + LQNY LTRDR +RE   P RY  AD+V YAL    +SI+ EPLT+ EAI S +  +W
Subjt:  QPAETDVRVEEGAETQ---AETQAENI----------PPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKW

Query:  KEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLV
        K+AM+EE+ SL KN TW LV +P N+ L+  KWIYK+K     +   RYKARL+
Subjt:  KEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLV

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]8.6e-17942.73Show/hide
Query:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI
        + ESE ++  E+A+STI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+
Subjt:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI

Query:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------
        ILLNSLPE+Y EVK+AI+Y RDSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+                                
Subjt:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------

Query:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------
                                       EVL V   +    WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Subjt:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------

Query:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-
                                     +N + K+ KG++VK +G L +GLYVL   T+ G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL 
Subjt:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-

Query:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL
                                                                   ++D                          VE QT RK+K L
Subjt:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL

Query:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS
        RTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAER NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Subjt:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS

Query:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-----EKNQ------SETETNSEKNKSFEMELELAS
        +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+ + +     +K Q      +E    SE   S +++ +   
Subjt:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-----EKNQ------SETETNSEKNKSFEMELELAS

Query:  IQTPTENQPAETD--------VRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSD
        +    + Q +E D        + ++EGA  + E+ + N       LQNY LTRDR +RE   P RY  AD+V YAL    +SI+ EPLT+ EAI S +  
Subjt:  IQTPTENQPAETD--------VRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSD

Query:  KWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL
        +WK+AM+EE+ SL KN TW LV +P N+ L+  KWIYK+K     +   RYKARLVAKGYTQKEGVD+ EIFSPVVRHSSIR +LS+
Subjt:  KWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL

PKU72844.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]2.2e-15038.39Show/hide
Query:  LPESEIKET---------KEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDE
        LPESE+  T         ++ AFS+IIL LAD VLR+V    T  E+WK+L+++Y  K+L N++Y+KE+FFG+KMD  K ++ NLDE+N+++LDL N++ 
Subjt:  LPESEIKET---------KEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDE

Query:  KMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELK-KGKSKESEALFTRGRTEKKSS-----RNNSRTEVLT-------------
        K+ DE++AIILLNSLP+S    K  ++Y R+++++D V +AL S+ L++K   K+   E L  RGR++K+ +     ++ SR++  +             
Subjt:  KMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELK-KGKSKESEALFTRGRTEKKSS-----RNNSRTEVLT-------------

Query:  -------------------------VLEGNFDSEWIL---------DSGCSF----------------------HMTPNKHWFLNFEEIDGGKVLLGNHQ
                                 ++  N+DS  +L         +  C                        H+   K   ++   +D    +  + +
Subjt:  -------------------------VLEGNFDSEWIL---------DSGCSF----------------------HMTPNKHWFLNFEEIDGGKVLLGNHQ

Query:  NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLLVED-----------------------------
          ++I KGA+V  KGI  NGLYVL   T+VG T V ++++  +TKLWH RLGH+S+RGL EL KQGL   D                             
Subjt:  NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLLVED-----------------------------

Query:  --------------------------------------------------------LVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRG
                                                                +VE Q  RKLK LRTDNGLEF +  F +FC    GI+RH TV  
Subjt:  --------------------------------------------------------LVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRG

Query:  TPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQ
        TPQQNGLAERMNRTLL+++RCL+ ++ L K FWGEAL TA YLVNR+PS+AI+FKTP E W   PP L++LR FGC+AY H  +GKL+ R+ KC+FLGY 
Subjt:  TPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQ

Query:  SGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETET-----NSEKNKS-FEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIP-PEP
        +G+KGYRLW +     K IISRDV F+E+ +   +++++  T     NS+ NK  +E E+E  S     EN               Q  T+ EN P P  
Subjt:  SGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETET-----NSEKNKS-FEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIP-PEP

Query:  DSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSV
        ++  +Y L+RDR+RR I+ P++Y  A+++ YAL + +   D EP++Y EA++  +S+ W +AMQ+E +SL+KNNTW LV++  N+ +V CKWIYK+K+  
Subjt:  DSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSV

Query:  DPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL
          ++P RYKARLVA+GYTQKEG+DY EIFSPVV+H+SIR L+ L
Subjt:  DPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL

RVW99173.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.9e-14940.17Show/hide
Query:  AKFEVERFDGR-----------------GLPE---------SEIKETKEI-----AFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYI
        AKF+VERF G+                 GL +         S ++E K+I     A S IIL L D VLR+V +A +  EVW +L+ +Y+TKSL N+L+ 
Subjt:  AKFEVERFDGR-----------------GLPE---------SEIKETKEI-----AFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYI

Query:  KERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGK----SKE----SE
        K + + FKM P   +E +LD +N+I+LDL NID  +SDE++AI+LL SL   Y  +K AI Y RDSL+ D   ++      + KK K     KE     +
Subjt:  KERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGK----SKE----SE

Query:  ALFTRGRTEKKSSR---------NNSRTEVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNHQ------------------------
            R  T KK+                EVL V E +   EWILDSGCSFHM P K WF +F+E DGG VLLGN++                        
Subjt:  ALFTRGRTEKKSSR---------NNSRTEVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNHQ------------------------

Query:  ----------------------------NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL----
                                    N++++ +G++   K  + NGLY L   T++   +   + D   TKLWH RLGHMS +GL+EL KQG+L    
Subjt:  ----------------------------NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL----

Query:  --------------------VEDLVEKQTE---------------------RKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMN
                             + + E Q +                     RK+K LRTDNGLEFLSN+F  FC+  EGI  H TVR TPQQNGLAERMN
Subjt:  --------------------VEDLVEKQTE---------------------RKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMN

Query:  RTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIE
        RT+LE++RC++S++ L K FW EA  T  +L+NRSPS+A+ FKTP EKW+    +  +L+ FGC AY H+K  KL+ RA KC+FLGY  G+KGY+LW   
Subjt:  RTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIE

Query:  KGEEKCIISRDVTFDESVIAWEKNQSETETNSEKNKSFEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRR
        +G+  CIISRDVTF+E  ++ +    + E + +    FE+E E          QP ++     + A+ +   + +N P +   L++YNL RDRQ+R+++ 
Subjt:  KGEEKCIISRDVTFDESVIAWEKNQSETETNSEKNKSFEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRR

Query:  PARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQ
        P RY   ++  +AL      +D EP TY EAINS   D+W +A++EEM+SL KN TWELV +P ++ +VG KW++K KQ    ++  RYKARLVAKG++Q
Subjt:  PARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQ

Query:  KEGVDYGEIFSPVVRHSSIRTLLS
        KEGVDY EIFSPVV+HSSIR LL+
Subjt:  KEGVDYGEIFSPVVRHSSIRTLLS

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]8.6e-17942.73Show/hide
Query:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI
        + ESE ++  E+A+STI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+
Subjt:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI

Query:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------
        ILLNSLPE+Y EVK+AI+Y RDSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+                                
Subjt:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------

Query:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------
                                       EVL V   +    WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Subjt:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------

Query:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-
                                     +N + K+ KG++VK +G L +GLYVL   T+ G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL 
Subjt:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-

Query:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL
                                                                   ++D                          VE QT RK+K L
Subjt:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL

Query:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS
        RTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAER NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Subjt:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS

Query:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-----EKNQ------SETETNSEKNKSFEMELELAS
        +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+ + +     +K Q      +E    SE   S +++ +   
Subjt:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-----EKNQ------SETETNSEKNKSFEMELELAS

Query:  IQTPTENQPAETD--------VRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSD
        +    + Q +E D        + ++EGA  + E+ + N       LQNY LTRDR +RE   P RY  AD+V YAL    +SI+ EPLT+ EAI S +  
Subjt:  IQTPTENQPAETD--------VRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSD

Query:  KWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL
        +WK+AM+EE+ SL KN TW LV +P N+ L+  KWIYK+K     +   RYKARLVAKGYTQKEGVD+ EIFSPVVRHSSIR +LS+
Subjt:  KWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL

TrEMBL top hitse value%identityAlignment
A0A2I0WB13 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-15038.39Show/hide
Query:  LPESEIKET---------KEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDE
        LPESE+  T         ++ AFS+IIL LAD VLR+V    T  E+WK+L+++Y  K+L N++Y+KE+FFG+KMD  K ++ NLDE+N+++LDL N++ 
Subjt:  LPESEIKET---------KEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDE

Query:  KMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELK-KGKSKESEALFTRGRTEKKSS-----RNNSRTEVLT-------------
        K+ DE++AIILLNSLP+S    K  ++Y R+++++D V +AL S+ L++K   K+   E L  RGR++K+ +     ++ SR++  +             
Subjt:  KMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELK-KGKSKESEALFTRGRTEKKSS-----RNNSRTEVLT-------------

Query:  -------------------------VLEGNFDSEWIL---------DSGCSF----------------------HMTPNKHWFLNFEEIDGGKVLLGNHQ
                                 ++  N+DS  +L         +  C                        H+   K   ++   +D    +  + +
Subjt:  -------------------------VLEGNFDSEWIL---------DSGCSF----------------------HMTPNKHWFLNFEEIDGGKVLLGNHQ

Query:  NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLLVED-----------------------------
          ++I KGA+V  KGI  NGLYVL   T+VG T V ++++  +TKLWH RLGH+S+RGL EL KQGL   D                             
Subjt:  NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLLVED-----------------------------

Query:  --------------------------------------------------------LVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRG
                                                                +VE Q  RKLK LRTDNGLEF +  F +FC    GI+RH TV  
Subjt:  --------------------------------------------------------LVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRG

Query:  TPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQ
        TPQQNGLAERMNRTLL+++RCL+ ++ L K FWGEAL TA YLVNR+PS+AI+FKTP E W   PP L++LR FGC+AY H  +GKL+ R+ KC+FLGY 
Subjt:  TPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQ

Query:  SGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETET-----NSEKNKS-FEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIP-PEP
        +G+KGYRLW +     K IISRDV F+E+ +   +++++  T     NS+ NK  +E E+E  S     EN               Q  T+ EN P P  
Subjt:  SGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETET-----NSEKNKS-FEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIP-PEP

Query:  DSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSV
        ++  +Y L+RDR+RR I+ P++Y  A+++ YAL + +   D EP++Y EA++  +S+ W +AMQ+E +SL+KNNTW LV++  N+ +V CKWIYK+K+  
Subjt:  DSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSV

Query:  DPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL
          ++P RYKARLVA+GYTQKEG+DY EIFSPVV+H+SIR L+ L
Subjt:  DPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL

A0A438IR25 Retrovirus-related Pol polyprotein from transposon TNT 1-949.1e-15040.17Show/hide
Query:  AKFEVERFDGR-----------------GLPE---------SEIKETKEI-----AFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYI
        AKF+VERF G+                 GL +         S ++E K+I     A S IIL L D VLR+V +A +  EVW +L+ +Y+TKSL N+L+ 
Subjt:  AKFEVERFDGR-----------------GLPE---------SEIKETKEI-----AFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYI

Query:  KERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGK----SKE----SE
        K + + FKM P   +E +LD +N+I+LDL NID  +SDE++AI+LL SL   Y  +K AI Y RDSL+ D   ++      + KK K     KE     +
Subjt:  KERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGK----SKE----SE

Query:  ALFTRGRTEKKSSR---------NNSRTEVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNHQ------------------------
            R  T KK+                EVL V E +   EWILDSGCSFHM P K WF +F+E DGG VLLGN++                        
Subjt:  ALFTRGRTEKKSSR---------NNSRTEVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNHQ------------------------

Query:  ----------------------------NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL----
                                    N++++ +G++   K  + NGLY L   T++   +   + D   TKLWH RLGHMS +GL+EL KQG+L    
Subjt:  ----------------------------NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL----

Query:  --------------------VEDLVEKQTE---------------------RKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMN
                             + + E Q +                     RK+K LRTDNGLEFLSN+F  FC+  EGI  H TVR TPQQNGLAERMN
Subjt:  --------------------VEDLVEKQTE---------------------RKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMN

Query:  RTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIE
        RT+LE++RC++S++ L K FW EA  T  +L+NRSPS+A+ FKTP EKW+    +  +L+ FGC AY H+K  KL+ RA KC+FLGY  G+KGY+LW   
Subjt:  RTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIE

Query:  KGEEKCIISRDVTFDESVIAWEKNQSETETNSEKNKSFEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRR
        +G+  CIISRDVTF+E  ++ +    + E + +    FE+E E          QP ++     + A+ +   + +N P +   L++YNL RDRQ+R+++ 
Subjt:  KGEEKCIISRDVTFDESVIAWEKNQSETETNSEKNKSFEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRR

Query:  PARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQ
        P RY   ++  +AL      +D EP TY EAINS   D+W +A++EEM+SL KN TWELV +P ++ +VG KW++K KQ    ++  RYKARLVAKG++Q
Subjt:  PARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQ

Query:  KEGVDYGEIFSPVVRHSSIRTLLS
        KEGVDY EIFSPVV+HSSIR LL+
Subjt:  KEGVDYGEIFSPVVRHSSIRTLLS

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class1.9e-16040.98Show/hide
Query:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI
        + ESE ++  E+A+ TI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+
Subjt:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI

Query:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------
        ILLNSLPE+Y EVK+AI+Y  DSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+                                
Subjt:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------

Query:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------
                                       EVL V   +    WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Subjt:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------

Query:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-
                                     +N + K+ KG++VK +G L +GLYVL   T+ G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL 
Subjt:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-

Query:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL
                                                                   ++D                          VE QT RK+K L
Subjt:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL

Query:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS
        RTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAER NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Subjt:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS

Query:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-EKNQSETETNSEKNKSFEMELELASIQTPT---EN
        +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+ + +  K Q + +T          E+ +AS   P+   +N
Subjt:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-EKNQSETETNSEKNKSFEMELELASIQTPT---EN

Query:  QPAETDVRVEEGAETQ---AETQAENI----------PPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKW
        QP      +E+  +++    ++Q E I              + LQNY LTRDR +RE   P RY  AD+V YAL    +SI+ EPLT+ EAI S +  +W
Subjt:  QPAETDVRVEEGAETQ---AETQAENI----------PPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKW

Query:  KEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLV
        K+AM+EE+ SL KN TW LV +P N+ L+  KWIYK+K     +   RYKARL+
Subjt:  KEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLV

A0A5A7UB25 Putative gag-pol polyprotein4.2e-17942.73Show/hide
Query:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI
        + ESE ++  E+A+STI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+
Subjt:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI

Query:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------
        ILLNSLPE+Y EVK+AI+Y RDSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+                                
Subjt:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------

Query:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------
                                       EVL V   +    WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Subjt:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------

Query:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-
                                     +N + K+ KG++VK +G L +GLYVL   T+ G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL 
Subjt:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-

Query:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL
                                                                   ++D                          VE QT RK+K L
Subjt:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL

Query:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS
        RTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAER NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Subjt:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS

Query:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-----EKNQ------SETETNSEKNKSFEMELELAS
        +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+ + +     +K Q      +E    SE   S +++ +   
Subjt:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-----EKNQ------SETETNSEKNKSFEMELELAS

Query:  IQTPTENQPAETD--------VRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSD
        +    + Q +E D        + ++EGA  + E+ + N       LQNY LTRDR +RE   P RY  AD+V YAL    +SI+ EPLT+ EAI S +  
Subjt:  IQTPTENQPAETD--------VRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSD

Query:  KWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL
        +WK+AM+EE+ SL KN TW LV +P N+ L+  KWIYK+K     +   RYKARLVAKGYTQKEGVD+ EIFSPVVRHSSIR +LS+
Subjt:  KWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL

A0A5D3DNU1 Putative gag-pol polyprotein4.2e-17942.73Show/hide
Query:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI
        + ESE ++  E+A+STI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+
Subjt:  LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAI

Query:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------
        ILLNSLPE+Y EVK+AI+Y RDSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+                                
Subjt:  ILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT--------------------------------

Query:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------
                                       EVL V   +    WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Subjt:  -------------------------------EVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH----------------------

Query:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-
                                     +N + K+ KG++VK +G L +GLYVL   T+ G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL 
Subjt:  -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-

Query:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL
                                                                   ++D                          VE QT RK+K L
Subjt:  -----------------------------------------------------------VEDL-------------------------VEKQTERKLKCL

Query:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS
        RTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAER NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Subjt:  RTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS

Query:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-----EKNQ------SETETNSEKNKSFEMELELAS
        +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+ + +     +K Q      +E    SE   S +++ +   
Subjt:  NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAW-----EKNQ------SETETNSEKNKSFEMELELAS

Query:  IQTPTENQPAETD--------VRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSD
        +    + Q +E D        + ++EGA  + E+ + N       LQNY LTRDR +RE   P RY  AD+V YAL    +SI+ EPLT+ EAI S +  
Subjt:  IQTPTENQPAETD--------VRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSD

Query:  KWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL
        +WK+AM+EE+ SL KN TW LV +P N+ L+  KWIYK+K     +   RYKARLVAKGYTQKEGVD+ EIFSPVVRHSSIR +LS+
Subjt:  KWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-5433.05Show/hide
Query:  LVEDLVEKQTER---KLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPS
        + +D V K       K+  L  DNG E+LSNE ++FC + +GI  HLTV  TPQ NG++ERM RT+ EK R ++S A L K FWGEA++TATYL+NR PS
Subjt:  LVEDLVEKQTER---KLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPS

Query:  TAI--DFKTPMEKWSNHPPDLSNLRTFGCIAYAH--SKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDE-SVIAWEKNQSET----
         A+    KTP E W N  P L +LR FG   Y H  +K+GK D+++ K +F+GY+    G++LW  +   EK I++RDV  DE +++     + ET    
Subjt:  TAI--DFKTPMEKWSNHPPDLSNLRTFGCIAYAH--SKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDE-SVIAWEKNQSET----

Query:  ETNSEKNKSFEME-LELASIQTPTENQPAETDVRVEEGAETQ-----------AETQAENIPPEPDSLQ-------------------------------
        ++   +NK+F  +  ++   + P E++  +    +++  E++            +T+  N   E D++Q                               
Subjt:  ETNSEKNKSFEME-LELASIQTPTENQPAETDVRVEEGAETQ-----------AETQAENIPPEPDSLQ-------------------------------

Query:  -NYNLTRDRQRRE------IRRPARYASADIVH-----------YALFTEMNSIDEEPLTYHEAINSI-----------NSDKWKEAMQEEMNSLLKNNT
         N N +R+ +  E      I  P +    +I++            +   E NS+++  L  H   N +           +   W+EA+  E+N+   NNT
Subjt:  -NYNLTRDRQRRE------IRRPARYASADIVH-----------YALFTEMNSIDEEPLTYHEAINSI-----------NSDKWKEAMQEEMNSLLKNNT

Query:  WELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL
        W +  RP NK +V  +W++ VK + +   P RYKARLVA+G+TQK  +DY E F+PV R SS R +LSL
Subjt:  WELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-9330.41Show/hide
Query:  EIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESY
        E A S I L+L+D+V+  + + +TA  +W +L+ +Y++K+LTNKLY+K++ +   M    +   +L+ +N ++  LAN+  K+ +E++AI+LLNSLP SY
Subjt:  EIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESY

Query:  NEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFT--RGRTEKKS------------SRNNSRTEVLTVLE----GNF--------------
        + + + I + + ++ +  V SAL   + +++K    + +AL T  RGR+ ++S            S+N S++ V         G+F              
Subjt:  NEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFT--RGRTEKKS------------SRNNSRTEVLTVLE----GNF--------------

Query:  -----------------------------------DSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH-------------QNNI-----------
                                           +SEW++D+  S H TP +  F  +   D G V +GN              + N+           
Subjt:  -----------------------------------DSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH-------------QNNI-----------

Query:  ----------------------------KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-------
                                    ++ KG++V AKG+    LY  +A    G    A  +D+    LWH R+GHMSE+GL+ L+K+ L+       
Subjt:  ----------------------------KIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL-------

Query:  -----------------------------------------------------VED-------------------------LVEKQTERKLKCLRTDNGL
                                                             ++D                         LVE++T RKLK LR+DNG 
Subjt:  -----------------------------------------------------VED-------------------------LVEKQTERKLKCLRTDNGL

Query:  EFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFG
        E+ S EF+E+C  + GI    TV GTPQ NG+AERMNRT++EK+R ++  A LPK FWGEA+ TA YL+NRSPS  + F+ P   W+N     S+L+ FG
Subjt:  EFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFG

Query:  CIAYAH---SKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETETN------------SEKNKSFEMELELASIQ
        C A+AH    +  KLD+++  C+F+GY     GYRLW  +  ++K I SRDV F ES +    + SE   N            S    S E   +  S Q
Subjt:  CIAYAH---SKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETETN------------SEKNKSFEMELELASIQ

Query:  TPTENQPAETDVRVEEGA-ETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEE
             +  E   +++EG  E +  TQ E         Q+  L R  + R   R  RY S + V       + S D EP +  E ++    ++  +AMQEE
Subjt:  TPTENQPAETDVRVEEGA-ETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEE

Query:  MNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL
        M SL KN T++LV+ P  K  + CKW++K+K+  D  +  RYKARLV KG+ QK+G+D+ EIFSPVV+ +SIRT+LSL
Subjt:  MNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL

P92512 Uncharacterized mitochondrial protein AtMg007108.8e-1750.6Show/hide
Query:  MNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKK
        MNRT++EK+R ++    LPK F  +A  TA +++N+ PSTAI+F  P E W    P  S LR FGC+AY H  EGKL  RAKK
Subjt:  MNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.5e-3626.73Show/hide
Query:  EDLVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLT-VRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAID
        ++L+E + + ++    +DNG EF++    E+   ++  I HLT    TP+ NGL+ER +R ++E    L+S+A +PK +W  A   A YL+NR P+  + 
Subjt:  EDLVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLT-VRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAID

Query:  FKTPMEKWSNHPPDLSNLRTFGCIAYAHSK---EGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFD---------------------ES
         ++P +K     P+   LR FGC  Y   +   + KLD+++++C+FLGY      Y   C+     +  ISR V FD                     ES
Subjt:  FKTPMEKWSNHPPDLSNLRTFGCIAYAHSK---EGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFD---------------------ES

Query:  VIAW-----------------------------------------------------------------------EKNQSETETNSEKNKS-----FEME
           W                                                                       +  Q++T+T+S +N S      E  
Subjt:  VIAW-----------------------------------------------------------------------EKNQSETETNSEKNKS-----FEME

Query:  LELA-SIQTPTENQPAETDVRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIV----HYALFTEMNSIDEEPLTYHEAINSIN
         +LA S+ TP ++  +         + + + T    +   P  L       ++           A A I+     Y+L   + + + EP T   AI ++ 
Subjt:  LELA-SIQTPTENQPAETDVRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIV----HYALFTEMNSIDEEPLTYHEAINSIN

Query:  SDKWKEAMQEEMNSLLKNNTWELV-DRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL
         ++W+ AM  E+N+ + N+TW+LV   PS+  +VGC+WI+  K + D S   RYKARLVAKGY Q+ G+DY E FSPV++ +SIR +L +
Subjt:  SDKWKEAMQEEMNSLLKNNTWELV-DRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.3e-3525.65Show/hide
Query:  LLVEDLVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTA
        ++ + LVE + + ++  L +DNG EF+    +++     GI    +   TP+ NGL+ER +R ++E    L+S+A +PK +W  A   A YL+NR P+  
Subjt:  LLVEDLVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTA

Query:  IDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSK---EGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETETNSEKNK
        +  ++P +K    PP+   L+ FGC  Y   +     KL++++K+C F+GY      Y   C+     +   SR V FDE    +        T+ E+  
Subjt:  IDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSK---EGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETETNSEKNK

Query:  S----------------------------------------------FEMELELASIQTPTENQPAETDVRVEEGAETQAETQAEN--------------
                                                           L  +SI +P+ ++P        +      +TQ  N              
Subjt:  S----------------------------------------------FEMELELASIQTPTENQPAETDVRVEEGAETQAETQAEN--------------

Query:  ----------------------------------------------IPPEPDSLQ-------NYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDE
                                                      + P P  +Q       N +    R +  IR+P +  S     YA     NS   
Subjt:  ----------------------------------------------IPPEPDSLQ-------NYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDE

Query:  EPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKI-LVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTL
        EP T   AI ++  D+W++AM  E+N+ + N+TW+LV  P   + +VGC+WI+  K + D S   RYKARLVAKGY Q+ G+DY E FSPV++ +SIR +
Subjt:  EPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKI-LVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTL

Query:  LSL
        L +
Subjt:  LSL

Arabidopsis top hitse value%identityAlignment
AT3G10690.1 DNA GYRASE A1.7e-0752.54Show/hide
Query:  VDFVPTFDNSQKEPSLLPAQLPTLLSNGSSGITVIHLRRV--RGIGELI-VPSYFFHKP
        VDFV  FDNSQKEP++LPA+LP LL NG+SGI V     +    +GEL+ V     H P
Subjt:  VDFVPTFDNSQKEPSLLPAQLPTLLSNGSSGITVIHLRRV--RGIGELI-VPSYFFHKP

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.7e-2045.19Show/hide
Query:  EEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTL
        +EP TY+EA   +    W  AM +E+ ++   +TWE+   P NK  +GCKW+YK+K + D    +RYKARLVAKGYTQ+EG+D+ E FSPV + +S++ +
Subjt:  EEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTL

Query:  LSLN
        L+++
Subjt:  LSLN

ATMG00300.1 Gag-Pol-related retrotransposon family protein3.8e-0740.91Show/hide
Query:  IKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL
        +K++KG     KG  H+ LY+L  +   G + +A E  + +T+LWH+RL HMS+RG+  L K+G L
Subjt:  IKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.2e-1850.6Show/hide
Query:  MNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKK
        MNRT++EK+R ++    LPK F  +A  TA +++N+ PSTAI+F  P E W    P  S LR FGC+AY H  EGKL  RAKK
Subjt:  MNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.2e-1843.64Show/hide
Query:  TEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVR
        T   +I +EP      I ++    W +AMQEE+++L +N TW LV  P N+ ++GCKW++K K   D +   R KARLVAKG+ Q+EG+ + E +SPVVR
Subjt:  TEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVR

Query:  HSSIRTLLSL
         ++IRT+L++
Subjt:  HSSIRTLLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTTTTGGGCGCGTTCCACCTCCTGCCCGTCTACGTTGATTTTGTTCCAACTTTTGATAATTCCCAAAAGGAACCTTCACTTCTACCAGCTCAGCTCCCAACTCT
ATTGTCGAACGGTTCCTCAGGGATTACGGTGATCCATCTCAGACGTGTAAGAGGCATTGGTGAATTAATTGTACCTTCCTACTTCTTTCACAAGCCAATTCCATCAATTT
CCAAGAATGTCGTAGTAGAGGATGAACCAAAAGGTTGGAAAAAAAATGCACAACTCCAGAGACAATACAATGAAATGGAGAAGAAATTGCGTGAAGTGCAGTCACAAATG
GAGAGTGGTAGACTAACACCAATGTCAGATCAAGGTAGTTGCCCACAAGTACCAGACCCTCAGCCTTCTGATGAAAAACCTGACCAGGTTGTTCATTGGAGGAGCACTGG
TCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGGCAAAATTCTAAGGAAGGAATGAATATTCGAAGAAAGAAGATCCTCCAGTTAATCTTTAGAGTTTTTGCTC
CCACAAGCTCTCAACGCAAAATCCTGTTCGAGAATATTGGTGCGATTGCTTGGTGGTGTTCAAGGGTGATTTTCAGAAAGATAAAGGTTCTTTTGGTTGCTGGATTTTTT
GCAAAAATCAAAGGAAAAGGCGAAACGTTCAAGAATTTCTACAAGGATATTCACAAGCTCACAGTCAGCCTTGAAGAGTCAAAGAACCTCTCACCAGCATGGACTGTCAT
AGTCACTTTCCAGCTCAGATTAAATTCTCCTAGTCAGCAAGCCAAGCGATCCATTCGGCAACTGGGTCTCTGTGATAGAGCGATTGCTGTAGAAGCCGTGGTGTGCTTGT
TCGAGAAGATAACGAATAATCCCTATTTGTTTTCGATTATGGCTGCTAAGTTTGAGGTAGAGCGTTTTGATGGTAGAGGCCTACCTGAATCTGAAATAAAAGAAACTAAG
GAGATTGCCTTTAGCACCATAATCTTATACTTAGCTGATAATGTTCTTCGCCAAGTTCATGAAGCTAATACAGCTGAAGAAGTCTGGAAACAATTAGATAAAATTTACCT
GACGAAATCCTTAACAAATAAGTTGTACATCAAAGAGAGATTCTTTGGTTTTAAAATGGATCCCAACAAAGACCTGGAACATAACCTTGATGAATATAATCGTATTGTGT
TAGACCTTGCAAACATTGATGAAAAAATGTCAGATGAAAATAGGGCTATTATTTTGTTGAATTCACTCCCTGAATCCTACAATGAGGTGAAATCTGCAATAAGGTACGAT
AGAGACAGCCTTTCAATGGATATAGTGTTGAGTGCTCTTAGGTCCAGAGATTTAGAACTCAAGAAGGGGAAATCAAAAGAGAGTGAAGCTTTATTTACAAGGGGAAGAAC
AGAGAAGAAATCCTCTAGAAATAATAGCAGAACAGAAGTCCTTACTGTGTTAGAAGGTAATTTTGACTCTGAATGGATCCTTGATTCTGGTTGTTCGTTCCATATGACAC
CTAATAAGCATTGGTTCCTGAATTTTGAAGAAATTGATGGTGGCAAAGTGCTACTAGGCAATCACCAAAATAACATCAAAATTGTCAAAGGAGCTATTGTCAAAGCAAAA
GGAATTTTACATAATGGTCTATATGTCCTAAGCGCGAATACAATGGTGGGGACAACTGCTGTTGCATCTGAAAGAGATCAAAAACAAACAAAGCTATGGCATGCTAGGCT
TGGGCATATGAGTGAAAGGGGTTTGAGGGAACTGTCCAAACAGGGTCTGTTAGTGGAAGACCTTGTTGAGAAACAAACTGAAAGGAAGCTAAAATGTTTGAGAACTGATA
ACGGGTTGGAATTCTTAAGCAATGAATTTAAAGAATTTTGTAAATTAACAGAAGGTATAATTAGACATTTGACTGTGAGAGGCACACCACAACAAAATGGCTTGGCTGAA
CGGATGAATAGAACTCTTCTTGAAAAAATAAGATGTTTGATGTCTAATGCGTGTTTGCCGAAAAAATTCTGGGGTGAAGCACTTATGACTGCAACCTACTTGGTCAATAG
AAGTCCTTCAACAGCCATTGATTTTAAAACACCAATGGAGAAGTGGTCCAATCACCCTCCTGATTTAAGTAATCTAAGAACCTTTGGTTGCATTGCTTATGCACATTCTA
AAGAAGGAAAATTAGATAATCGTGCCAAGAAATGTCTGTTTTTGGGGTATCAATCTGGTATAAAAGGTTATAGACTATGGTGCATTGAAAAAGGTGAAGAAAAATGCATA
ATTAGTAGGGATGTAACCTTTGATGAATCAGTAATTGCTTGGGAAAAGAATCAGAGTGAAACAGAAACAAATTCTGAAAAGAATAAATCTTTTGAAATGGAATTAGAGTT
AGCATCCATACAAACACCAACTGAAAACCAACCTGCTGAAACTGATGTTCGTGTTGAAGAAGGTGCTGAAACACAAGCTGAAACACAAGCTGAAAACATTCCACCAGAAC
CTGATTCTTTGCAAAATTATAACCTAACCCGTGATAGACAAAGAAGAGAAATAAGAAGACCAGCAAGATATGCTAGTGCAGATATTGTTCACTATGCCTTATTCACTGAA
ATGAATTCAATTGATGAAGAACCTCTAACTTATCATGAAGCTATAAACTCCATAAACAGTGATAAGTGGAAAGAAGCAATGCAAGAAGAAATGAATTCTCTTCTGAAAAA
TAATACCTGGGAATTAGTAGATAGGCCATCAAACAAAATATTGGTTGGTTGTAAGTGGATTTATAAGGTAAAACAAAGTGTTGATCCTTCACAACCTAAAAGGTACAAAG
CAAGGTTGGTTGCCAAGGGGTACACTCAAAAGGAAGGAGTGGATTATGGAGAGATTTTCTCCCCTGTAGTTAGGCATTCATCTATAAGAACCTTGCTGTCCCTTAACAAT
CTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTTTTGGGCGCGTTCCACCTCCTGCCCGTCTACGTTGATTTTGTTCCAACTTTTGATAATTCCCAAAAGGAACCTTCACTTCTACCAGCTCAGCTCCCAACTCT
ATTGTCGAACGGTTCCTCAGGGATTACGGTGATCCATCTCAGACGTGTAAGAGGCATTGGTGAATTAATTGTACCTTCCTACTTCTTTCACAAGCCAATTCCATCAATTT
CCAAGAATGTCGTAGTAGAGGATGAACCAAAAGGTTGGAAAAAAAATGCACAACTCCAGAGACAATACAATGAAATGGAGAAGAAATTGCGTGAAGTGCAGTCACAAATG
GAGAGTGGTAGACTAACACCAATGTCAGATCAAGGTAGTTGCCCACAAGTACCAGACCCTCAGCCTTCTGATGAAAAACCTGACCAGGTTGTTCATTGGAGGAGCACTGG
TCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGGCAAAATTCTAAGGAAGGAATGAATATTCGAAGAAAGAAGATCCTCCAGTTAATCTTTAGAGTTTTTGCTC
CCACAAGCTCTCAACGCAAAATCCTGTTCGAGAATATTGGTGCGATTGCTTGGTGGTGTTCAAGGGTGATTTTCAGAAAGATAAAGGTTCTTTTGGTTGCTGGATTTTTT
GCAAAAATCAAAGGAAAAGGCGAAACGTTCAAGAATTTCTACAAGGATATTCACAAGCTCACAGTCAGCCTTGAAGAGTCAAAGAACCTCTCACCAGCATGGACTGTCAT
AGTCACTTTCCAGCTCAGATTAAATTCTCCTAGTCAGCAAGCCAAGCGATCCATTCGGCAACTGGGTCTCTGTGATAGAGCGATTGCTGTAGAAGCCGTGGTGTGCTTGT
TCGAGAAGATAACGAATAATCCCTATTTGTTTTCGATTATGGCTGCTAAGTTTGAGGTAGAGCGTTTTGATGGTAGAGGCCTACCTGAATCTGAAATAAAAGAAACTAAG
GAGATTGCCTTTAGCACCATAATCTTATACTTAGCTGATAATGTTCTTCGCCAAGTTCATGAAGCTAATACAGCTGAAGAAGTCTGGAAACAATTAGATAAAATTTACCT
GACGAAATCCTTAACAAATAAGTTGTACATCAAAGAGAGATTCTTTGGTTTTAAAATGGATCCCAACAAAGACCTGGAACATAACCTTGATGAATATAATCGTATTGTGT
TAGACCTTGCAAACATTGATGAAAAAATGTCAGATGAAAATAGGGCTATTATTTTGTTGAATTCACTCCCTGAATCCTACAATGAGGTGAAATCTGCAATAAGGTACGAT
AGAGACAGCCTTTCAATGGATATAGTGTTGAGTGCTCTTAGGTCCAGAGATTTAGAACTCAAGAAGGGGAAATCAAAAGAGAGTGAAGCTTTATTTACAAGGGGAAGAAC
AGAGAAGAAATCCTCTAGAAATAATAGCAGAACAGAAGTCCTTACTGTGTTAGAAGGTAATTTTGACTCTGAATGGATCCTTGATTCTGGTTGTTCGTTCCATATGACAC
CTAATAAGCATTGGTTCCTGAATTTTGAAGAAATTGATGGTGGCAAAGTGCTACTAGGCAATCACCAAAATAACATCAAAATTGTCAAAGGAGCTATTGTCAAAGCAAAA
GGAATTTTACATAATGGTCTATATGTCCTAAGCGCGAATACAATGGTGGGGACAACTGCTGTTGCATCTGAAAGAGATCAAAAACAAACAAAGCTATGGCATGCTAGGCT
TGGGCATATGAGTGAAAGGGGTTTGAGGGAACTGTCCAAACAGGGTCTGTTAGTGGAAGACCTTGTTGAGAAACAAACTGAAAGGAAGCTAAAATGTTTGAGAACTGATA
ACGGGTTGGAATTCTTAAGCAATGAATTTAAAGAATTTTGTAAATTAACAGAAGGTATAATTAGACATTTGACTGTGAGAGGCACACCACAACAAAATGGCTTGGCTGAA
CGGATGAATAGAACTCTTCTTGAAAAAATAAGATGTTTGATGTCTAATGCGTGTTTGCCGAAAAAATTCTGGGGTGAAGCACTTATGACTGCAACCTACTTGGTCAATAG
AAGTCCTTCAACAGCCATTGATTTTAAAACACCAATGGAGAAGTGGTCCAATCACCCTCCTGATTTAAGTAATCTAAGAACCTTTGGTTGCATTGCTTATGCACATTCTA
AAGAAGGAAAATTAGATAATCGTGCCAAGAAATGTCTGTTTTTGGGGTATCAATCTGGTATAAAAGGTTATAGACTATGGTGCATTGAAAAAGGTGAAGAAAAATGCATA
ATTAGTAGGGATGTAACCTTTGATGAATCAGTAATTGCTTGGGAAAAGAATCAGAGTGAAACAGAAACAAATTCTGAAAAGAATAAATCTTTTGAAATGGAATTAGAGTT
AGCATCCATACAAACACCAACTGAAAACCAACCTGCTGAAACTGATGTTCGTGTTGAAGAAGGTGCTGAAACACAAGCTGAAACACAAGCTGAAAACATTCCACCAGAAC
CTGATTCTTTGCAAAATTATAACCTAACCCGTGATAGACAAAGAAGAGAAATAAGAAGACCAGCAAGATATGCTAGTGCAGATATTGTTCACTATGCCTTATTCACTGAA
ATGAATTCAATTGATGAAGAACCTCTAACTTATCATGAAGCTATAAACTCCATAAACAGTGATAAGTGGAAAGAAGCAATGCAAGAAGAAATGAATTCTCTTCTGAAAAA
TAATACCTGGGAATTAGTAGATAGGCCATCAAACAAAATATTGGTTGGTTGTAAGTGGATTTATAAGGTAAAACAAAGTGTTGATCCTTCACAACCTAAAAGGTACAAAG
CAAGGTTGGTTGCCAAGGGGTACACTCAAAAGGAAGGAGTGGATTATGGAGAGATTTTCTCCCCTGTAGTTAGGCATTCATCTATAAGAACCTTGCTGTCCCTTAACAAT
CTTTGA
Protein sequenceShow/hide protein sequence
MKFLGAFHLLPVYVDFVPTFDNSQKEPSLLPAQLPTLLSNGSSGITVIHLRRVRGIGELIVPSYFFHKPIPSISKNVVVEDEPKGWKKNAQLQRQYNEMEKKLREVQSQM
ESGRLTPMSDQGSCPQVPDPQPSDEKPDQVVHWRSTGPLGPTGSSFRALRQNSKEGMNIRRKKILQLIFRVFAPTSSQRKILFENIGAIAWWCSRVIFRKIKVLLVAGFF
AKIKGKGETFKNFYKDIHKLTVSLEESKNLSPAWTVIVTFQLRLNSPSQQAKRSIRQLGLCDRAIAVEAVVCLFEKITNNPYLFSIMAAKFEVERFDGRGLPESEIKETK
EIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYD
RDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRTEVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNHQNNIKIVKGAIVKAK
GILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLLVEDLVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAE
RMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCI
ISRDVTFDESVIAWEKNQSETETNSEKNKSFEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTE
MNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSLNN
L