; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G11750 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G11750
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr7:9954033..9960114
RNA-Seq ExpressionCSPI07G11750
SyntenyCSPI07G11750
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN64335.1 hypothetical protein VITISV_001808 [Vitis vinifera]2.1e-18352.99Show/hide
Query:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIG----------
        N  F+++C + G +HNF +PRT QQNGVVERKNRTLQE AR+MLNE  LPKYFW EA+NT+CYV NR+L+RP L KTPYELW  K PNI           
Subjt:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIG----------

Query:  ----------DDLEKDFGDLL-----------VNDKGKEIVPSMQDVNI--------IEKKEEGSSSLPKE----WRYAL--------SHPKDLILGNPE
                   D + D G  L            N +   +  S+ D  +         +  ++    +P++    W Y L        +HP+D I+GNP 
Subjt:  ----------DDLEKDFGDLL-----------VNDKGKEIVPSMQDVNI--------IEKKEEGSSSLPKE----WRYAL--------SHPKDLILGNPE

Query:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIY
         GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQ+ELNQFE ++V +LVPRP N S+IGTKWVFRNKMDENG I+RNKARLVAQG+ QEE I 
Subjt:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIY

Query:  YEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------WLKTSSKSLCMHNEFEMSMMG
        YEETFA VARLEAIRMLLAFA +K FI YQMDVK AFLNG+I EE+YVEQP GF+  +                     W +  SK L +   F+M  + 
Subjt:  YEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------WLKTSSKSLCMHNEFEMSMMG

Query:  ELSFFLGFQIKQLKDDIFIS-------------------QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDI
           F    +   L   I++                      KY ++LLK+F + + +V KT MS++ KLD DEKGK +D   YRGMI SLLYLTAS+PDI
Subjt:  ELSFFLGFQIKQLKDDIFIS-------------------QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDI

Query:  MFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIA
        M+SVCLCARFQSCPKESH  AVKRIL+YL GT+ +GLWYP+   F LIG+SDA+FAG  ++ KSTS TC  LG SLVSW SKKQNS+ALST EAEY A +
Subjt:  MFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIA

Query:  SCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLS
         C A+I+WMKQ L DF L F++VPI CDN SAIN++KNP+ HSRTKHI+IRHHF+R+H Q G ITLEF+S
Subjt:  SCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLS

KYP33754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]4.5e-18651.46Show/hide
Query:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------
        N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNTACY  NR L+RP L KTPYEL++G+ PNI           
Subjt:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------

Query:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-
                    D + D G  L   +N K                                  EIV S +D +I E+               +EG +++ 
Subjt:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-

Query:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG
          +EWR + +HP + I+G+  +GV TR+SL    +N++FVS+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Subjt:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------
         +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLNG+I EEVYVEQP GFE                        
Subjt:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------

Query:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK
        W +  SK L                                                 M +EFEMSMMGEL+FFLG QI+Q K+ IFI+Q KY + LLK+
Subjt:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK

Query:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY
        F +   +   T MSTT  LDKDE GK +D+K YRGMI SLLYL+AS+PDIMFSVC CAR+QS PKESH  AVKRI++YLL T ++GLWYP+N+ FNL+GY
Subjt:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY

Query:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI
        SD++FAG   D KSTS TC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA+I+WMKQ L D+GL  D++PI CDN SAINL+KNP+ HSRTKHI+I
Subjt:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI

Query:  RHHFIREHVQNGHITLEFL
        RHHF+R+HVQ G   LEF+
Subjt:  RHHFIREHVQNGHITLEFL

KYP66812.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]4.5e-18651.46Show/hide
Query:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------
        N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNTACY  NR L+RP L KTPYEL++G+ PNI           
Subjt:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------

Query:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-
                    D + D G  L   +N K                                  EIV S +D ++ E+               +EG +++ 
Subjt:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-

Query:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG
          +EWR + +HP + I+G+  +GV TR+SL    +N++FVS+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Subjt:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------
         +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLNG+I EEVYVEQP GFE                        
Subjt:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------

Query:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK
        W +  SK L                                                 M +EFEMSMMGEL+FFLG QI+Q K+ IFI+Q KY + LLK+
Subjt:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK

Query:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY
        F +   +   T MSTT  LDKDE GK +D+K YRGMI SLLYL+AS+PDIMFSVCLCAR+QS PKESH  AVKRI++ LLGT ++GLWYP+N+ FNL+GY
Subjt:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY

Query:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI
        SD++FAG   D KSTS TC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA+I+WMKQ L D+GL  D++PI CDN SAINL+KNP+ HSRTKHI+I
Subjt:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI

Query:  RHHFIREHVQNGHITLEFL
        RHHF+R+HVQ G   LEF+
Subjt:  RHHFIREHVQNGHITLEFL

RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.7e-20453.63Show/hide
Query:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIG----------
        N  F+++C ++G +HNFS+PRTPQQNGVVERKNRTLQE AR+MLNE  LPKYFW EAVNT+CYV NR+L+RP L KTPYELW  K PNI           
Subjt:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIG----------

Query:  --------------------------------------------------------------DD--LEKDFGDLLVNDKGKEIV----PSMQDVNII---
                                                                      DD  LE   G L + DK ++      P  +D  +    
Subjt:  --------------------------------------------------------------DD--LEKDFGDLLVNDKGKEIV----PSMQDVNII---

Query:  --EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTK
          + + E S  LPK+W++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFE ++V +LVPRP N S+IGTK
Subjt:  --EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTK

Query:  WVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFE-------------
        WVFRNKMDENG I+RNKARLVAQG+ QEE I YEETFA VARLEAIRMLLAFA +K FI YQMDVK AFLNG+I EEVYVEQP GF+             
Subjt:  WVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFE-------------

Query:  ----------------------KG--------SLWLKTSSK-------------------------SLCMHNEFEMSMMGELSFFLGFQIKQLKDDIFIS
                              KG        +L++KT  K                         S CMH+EFEMSMMGEL++FLG QIKQLK+  FI+
Subjt:  ----------------------KG--------SLWLKTSSK-------------------------SLCMHNEFEMSMMGELSFFLGFQIKQLKDDIFIS

Query:  QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWY
        Q KY ++LLK+F + + +V KT MS++ KLD DEKGK +D   YRGMI SLLYLTAS+PDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+++GLWY
Subjt:  QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWY

Query:  PRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNP
        P+   F LIG+SDA+FAG  ++ KSTS TC FLG SLVSW SKKQNSVALST EAEYIA   CCA+I+WMKQ L DF L F++VPI CDN SAIN++KNP
Subjt:  PRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNP

Query:  IHHSRTKHIDIRHHFIREHVQNGHITLEFLS
        + HSRTKHI+IRHHF+R+H Q G ITLEF+S
Subjt:  IHHSRTKHIDIRHHFIREHVQNGHITLEFLS

XP_042980087.1 uncharacterized protein LOC122310269, partial [Carya illinoinensis]2.3e-18255.78Show/hide
Query:  INDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGDDLEKDFGD
        +N   + FC+ENGF HNFS+PRTPQQNGVVERKNR+LQE AR+MLNE  LP YFW EAV+TACYV NRV++R  LDKTPYELW+ K PNIG         
Subjt:  INDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGDDLEKDFGD

Query:  LLVNDK---GKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFE
         ++ND+   GK    S + + +      G S+  K +R  + + K L             ++    ++ F  QIEP++  DA  DE WILAMQEELNQFE
Subjt:  LLVNDK---GKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFE

Query:  MNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYV
         N V  LVPRP N +IIGTKWVFRNK DE+G I RNKARLVAQGF QEE I Y+ET+A VARLEAIRMLLA+A YK F  +QMDVK AFLNG+I EEVYV
Subjt:  MNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYV

Query:  EQPLGF-----------------------------------EKG--------SLWLK-------------------TSSKSLC------MHNEFEMSMMG
        EQP GF                                   EKG        +L++K                    +++++C      M  EFEMSMMG
Subjt:  EQPLGF-----------------------------------EKG--------SLWLK-------------------TSSKSLC------MHNEFEMSMMG

Query:  ELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHF
        EL+FFLG QIKQ K   FI+Q KY + LLKKF +   +   T MS +TKLDKDE GK VD K YRGMI SLLYLTAS+PDIMFSVCLCARFQS PKESH 
Subjt:  ELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHF

Query:  HAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLK
         AVKRIL+YL GTI++GLWYP++  F+LI Y+DA++AG  +D KSTS  C FLG +LVSWFSKKQNSVALSTTEAEY+A  SCCA++++MKQ L DF L 
Subjt:  HAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLK

Query:  FDNVPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEF
        ++++PI CDN SAINL+KNPI HSRTKHI+IR+HF+R+HVQ G I LEF
Subjt:  FDNVPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEF

TrEMBL top hitse value%identityAlignment
A0A151QU14 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-18651.46Show/hide
Query:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------
        N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNTACY  NR L+RP L KTPYEL++G+ PNI           
Subjt:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------

Query:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-
                    D + D G  L   +N K                                  EIV S +D +I E+               +EG +++ 
Subjt:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-

Query:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG
          +EWR + +HP + I+G+  +GV TR+SL    +N++FVS+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Subjt:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------
         +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLNG+I EEVYVEQP GFE                        
Subjt:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------

Query:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK
        W +  SK L                                                 M +EFEMSMMGEL+FFLG QI+Q K+ IFI+Q KY + LLK+
Subjt:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK

Query:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY
        F +   +   T MSTT  LDKDE GK +D+K YRGMI SLLYL+AS+PDIMFSVC CAR+QS PKESH  AVKRI++YLL T ++GLWYP+N+ FNL+GY
Subjt:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY

Query:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI
        SD++FAG   D KSTS TC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA+I+WMKQ L D+GL  D++PI CDN SAINL+KNP+ HSRTKHI+I
Subjt:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI

Query:  RHHFIREHVQNGHITLEFL
        RHHF+R+HVQ G   LEF+
Subjt:  RHHFIREHVQNGHITLEFL

A0A151TAG4 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)2.5e-18251.04Show/hide
Query:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------
        N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNTACY  NR L+RP L KTPYEL++G+ PNI           
Subjt:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------

Query:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-
                    D + D G  L   +N K                                  EIV S +D +I E+               +EG +++ 
Subjt:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-

Query:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG
          +EWR + +HP + I+G+  +GV TR+SL    +N++FVS+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Subjt:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------
         +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLNG+I EEVYVEQP GFE                        
Subjt:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------

Query:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK
        W +  SK L                                                 M +EFEMSMMGEL+FFLG QI+Q K+ IFI+Q KY + LLK+
Subjt:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK

Query:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY
        F +   +   T MSTT  LDKDE GK +D+K YRGMI SLLYL+ S+P+IMFSVCLC R+QS PKESH  AVKRI++YLLGT ++GLWY +N+ FNL+GY
Subjt:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY

Query:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI
        SD++FAG   D KS S TC F+GS+LVSW SKKQNSVALST EAEYIA  S CA+I+WMKQ L DFGL  D+VPI CDN SAINL+KN + HSRTKHI+I
Subjt:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI

Query:  RHHFIREHVQNGHITLEFL
        RHHF+R+HVQ G   LEF+
Subjt:  RHHFIREHVQNGHITLEFL

A0A151TIF5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-18651.46Show/hide
Query:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------
        N  F  FCEENG  HNFS+PRTPQQNGVVERKNR+L+E AR+MLN+  LPKYFW EAVNTACY  NR L+RP L KTPYEL++G+ PNI           
Subjt:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGD---------

Query:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-
                    D + D G  L   +N K                                  EIV S +D ++ E+               +EG +++ 
Subjt:  ------------DLEKDFGDLL---VNDKG--------------------------------KEIVPSMQDVNIIEKK--------------EEGSSSL-

Query:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG
          +EWR + +HP + I+G+  +GV TR+SL    +N++FVS+IE ++  +A  DE WI AMQEELNQFE N+V  LV RP N  IIGTKW+FRNK+DE+G
Subjt:  -PKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENG

Query:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------
         +IRNKARLVA+G+ QEE I YEET+A VARLEAIRMLLA+AS   F  YQMDVK AFLNG+I EEVYVEQP GFE                        
Subjt:  NIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------

Query:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK
        W +  SK L                                                 M +EFEMSMMGEL+FFLG QI+Q K+ IFI+Q KY + LLK+
Subjt:  WLKTSSKSLC------------------------------------------------MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKK

Query:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY
        F +   +   T MSTT  LDKDE GK +D+K YRGMI SLLYL+AS+PDIMFSVCLCAR+QS PKESH  AVKRI++ LLGT ++GLWYP+N+ FNL+GY
Subjt:  FKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGY

Query:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI
        SD++FAG   D KSTS TC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA+I+WMKQ L D+GL  D++PI CDN SAINL+KNP+ HSRTKHI+I
Subjt:  SDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDI

Query:  RHHFIREHVQNGHITLEFL
        RHHF+R+HVQ G   LEF+
Subjt:  RHHFIREHVQNGHITLEFL

A0A438GI90 Retrovirus-related Pol polyprotein from transposon TNT 1-948.1e-20553.63Show/hide
Query:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIG----------
        N  F+++C ++G +HNFS+PRTPQQNGVVERKNRTLQE AR+MLNE  LPKYFW EAVNT+CYV NR+L+RP L KTPYELW  K PNI           
Subjt:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIG----------

Query:  --------------------------------------------------------------DD--LEKDFGDLLVNDKGKEIV----PSMQDVNII---
                                                                      DD  LE   G L + DK ++      P  +D  +    
Subjt:  --------------------------------------------------------------DD--LEKDFGDLLVNDKGKEIV----PSMQDVNII---

Query:  --EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTK
          + + E S  LPK+W++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFE ++V +LVPRP N S+IGTK
Subjt:  --EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTK

Query:  WVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFE-------------
        WVFRNKMDENG I+RNKARLVAQG+ QEE I YEETFA VARLEAIRMLLAFA +K FI YQMDVK AFLNG+I EEVYVEQP GF+             
Subjt:  WVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFE-------------

Query:  ----------------------KG--------SLWLKTSSK-------------------------SLCMHNEFEMSMMGELSFFLGFQIKQLKDDIFIS
                              KG        +L++KT  K                         S CMH+EFEMSMMGEL++FLG QIKQLK+  FI+
Subjt:  ----------------------KG--------SLWLKTSSK-------------------------SLCMHNEFEMSMMGELSFFLGFQIKQLKDDIFIS

Query:  QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWY
        Q KY ++LLK+F + + +V KT MS++ KLD DEKGK +D   YRGMI SLLYLTAS+PDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+++GLWY
Subjt:  QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWY

Query:  PRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNP
        P+   F LIG+SDA+FAG  ++ KSTS TC FLG SLVSW SKKQNSVALST EAEYIA   CCA+I+WMKQ L DF L F++VPI CDN SAIN++KNP
Subjt:  PRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNP

Query:  IHHSRTKHIDIRHHFIREHVQNGHITLEFLS
        + HSRTKHI+IRHHF+R+H Q G ITLEF+S
Subjt:  IHHSRTKHIDIRHHFIREHVQNGHITLEFLS

A5C8K0 Uncharacterized protein1.0e-18352.99Show/hide
Query:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIG----------
        N  F+++C + G +HNF +PRT QQNGVVERKNRTLQE AR+MLNE  LPKYFW EA+NT+CYV NR+L+RP L KTPYELW  K PNI           
Subjt:  NDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIG----------

Query:  ----------DDLEKDFGDLL-----------VNDKGKEIVPSMQDVNI--------IEKKEEGSSSLPKE----WRYAL--------SHPKDLILGNPE
                   D + D G  L            N +   +  S+ D  +         +  ++    +P++    W Y L        +HP+D I+GNP 
Subjt:  ----------DDLEKDFGDLL-----------VNDKGKEIVPSMQDVNI--------IEKKEEGSSSLPKE----WRYAL--------SHPKDLILGNPE

Query:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIY
         GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQ+ELNQFE ++V +LVPRP N S+IGTKWVFRNKMDENG I+RNKARLVAQG+ QEE I 
Subjt:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIY

Query:  YEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------WLKTSSKSLCMHNEFEMSMMG
        YEETFA VARLEAIRMLLAFA +K FI YQMDVK AFLNG+I EE+YVEQP GF+  +                     W +  SK L +   F+M  + 
Subjt:  YEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGSL--------------------WLKTSSKSLCMHNEFEMSMMG

Query:  ELSFFLGFQIKQLKDDIFIS-------------------QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDI
           F    +   L   I++                      KY ++LLK+F + + +V KT MS++ KLD DEKGK +D   YRGMI SLLYLTAS+PDI
Subjt:  ELSFFLGFQIKQLKDDIFIS-------------------QEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDI

Query:  MFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIA
        M+SVCLCARFQSCPKESH  AVKRIL+YL GT+ +GLWYP+   F LIG+SDA+FAG  ++ KSTS TC  LG SLVSW SKKQNS+ALST EAEY A +
Subjt:  MFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIA

Query:  SCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLS
         C A+I+WMKQ L DF L F++VPI CDN SAIN++KNP+ HSRTKHI+IRHHF+R+H Q G ITLEF+S
Subjt:  SCCAKIIWMKQILCDFGLKFDNVPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLS

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.1e-6926.86Show/hide
Query:  INDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD--KTPYELWHGKIPNI--------
        +++  + FC + G S++ + P TPQ NGV ER  RT+ E AR+M++   L K FW EAV TA Y+ NR+  R  +D  KTPYE+WH K P +        
Subjt:  INDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD--KTPYELWHGKIPNI--------

Query:  ----------GDDLEKDFGDLL-------------VNDK---GKEIV-----------PSMQDVNIIEKKEEGSSSLPKEWRYAL---------------
                  G   +K F  +              VN+K    +++V              + V + + KE  + + P + R  +               
Subjt:  ----------GDDLEKDFGDLL-------------VNDK---GKEIV-----------PSMQDVNIIEKKEEGSSSLPKEWRYAL---------------

Query:  -------------------------------------------------------------------------------SHPKDLILGNP----------
                                                                                        H K++ + NP          
Subjt:  -------------------------------------------------------------------------------SHPKDLILGNP----------

Query:  --EQGVKTR---------SSLN-LFSNLAFVSQIEPRSFKDAECDE---FWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNK
           + +KT+         +SLN +  N   +    P SF + +  +    W  A+  ELN  ++N    +  RP N +I+ ++WVF  K +E GN IR K
Subjt:  --EQGVKTR---------SSLN-LFSNLAFVSQIEPRSFKDAECDE---FWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNK

Query:  ARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGS------------------LWLK-----
        ARLVA+GF Q+  I YEETFA VAR+ + R +L+         +QMDVK AFLNG + EE+Y+  P G    S                   W +     
Subjt:  ARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGS------------------LWLK-----

Query:  -------TSSKSLCMH--------------------------------------NEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKG
                SS   C++                                       +F M+ + E+  F+G +I+  +D I++SQ  Y + +L KF +   
Subjt:  -------TSSKSLCMH--------------------------------------NEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKG

Query:  QVAKTRMSTTTKLDKDEKGKCVDIKT-YRGMIKSLLY-LTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEF--NLIGYSD
            T +   +K++ +      D  T  R +I  L+Y +  ++PD+  +V + +R+ S      +  +KR+L+YL GTID+ L + +N+ F   +IGY D
Subjt:  QVAKTRMSTTTKLDKDEKGKCVDIKT-YRGMIKSLLY-LTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEF--NLIGYSD

Query:  ANFAGSLLDHKSTS-RTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDN-VPIFCDNISAINLTKNPIHHSRTKHIDI
        +++AGS +D KST+    +    +L+ W +K+QNSVA S+TEAEY+A+     + +W+K +L    +K +N + I+ DN   I++  NP  H R KHIDI
Subjt:  ANFAGSLLDHKSTS-RTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDN-VPIFCDNISAINLTKNPIHHSRTKHIDI

Query:  RHHFIREHVQNGHITLEFL
        ++HF RE VQN  I LE++
Subjt:  RHHFIREHVQNGHITLEFL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-6829.76Show/hide
Query:  FKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGK------------------
        F+++C  +G  H  + P TPQ NGV ER NRT+ E  RSML    LPK FW EAV TACY+ NR    P   + P  +W  K                  
Subjt:  FKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGK------------------

Query:  ------------IPNI----GDDLEKDFGDLLVNDKGKEIVPSMQDV----------NIIEKKEEG-----------------SSSLPKEWRYALSHPKD
                    IP I    GD+   +FG  L +   K+++ S   V          ++ EK + G                 + S   E       P +
Subjt:  ------------IPNI----GDDLEKDFGDLLVNDKGKEIVPSMQDV----------NIIEKKEEG-----------------SSSLPKEWRYALSHPKD

Query:  LI------------LGNPEQGVKTRSSL----------NLFSNLAFV---SQIEPRSFKDA----ECDEFWILAMQEELNQFEMNKVRKLVPRPYNASII
        +I            + +P QG +    L            + +  +V      EP S K+     E ++  + AMQEE+   + N   KLV  P     +
Subjt:  LI------------LGNPEQGVKTRSSL----------NLFSNLAFV---SQIEPRSFKDA----ECDEFWILAMQEELNQFEMNKVRKLVPRPYNASII

Query:  GTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFE----------
          KWVF+ K D +  ++R KARLV +GF Q++ I ++E F+ V ++ +IR +L+ A+       Q+DVK AFL+G + EE+Y+EQP GFE          
Subjt:  GTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFE----------

Query:  -----------------------KGSLWLKTSS------------------------------KSLC------MHNEFEMSMMGELSFFLGFQI--KQLK
                               K   +LKT S                              K L       +   F+M  +G     LG +I  ++  
Subjt:  -----------------------KGSLWLKTSS------------------------------KSLC------MHNEFEMSMMGELSFFLGFQI--KQLK

Query:  DDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDK-------DEKGKCVDIKTYRGMIKSLLY-LTASKPDIMFSVCLCARFQSCPKESHFHAVKRI
          +++SQEKY   +L++F +   +   T ++   KL K       +EKG    +  Y   + SL+Y +  ++PDI  +V + +RF   P + H+ AVK I
Subjt:  DDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDK-------DEKGKCVDIKTYRGMIKSLLY-LTASKPDIMFSVCLCARFQSCPKESHFHAVKRI

Query:  LKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPI
        L+YL GT    L +  +    L GY+DA+ AG + + KS++          +SW SK Q  VALSTTEAEYIA      ++IW+K+ L + GL      +
Subjt:  LKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVPI

Query:  FCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQN
        +CD+ SAI+L+KN ++H+RTKHID+R+H+IRE V +
Subjt:  FCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQN

P92519 Uncharacterized mitochondrial protein AtMg008101.9e-2533.5Show/hide
Query:  MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEK---GKCVDIKTYRGMIKSLLYLTASKPDIMFSVC
        + + F M  +G + +FLG QIK     +F+SQ KY   +L     N G +    MST   L  +      K  D   +R ++ +L YLT ++PDI ++V 
Subjt:  MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEK---GKCVDIKTYRGMIKSLLYLTASKPDIMFSVC

Query:  LCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAK
        +  +    P  + F  +KR+L+Y+ GTI  GL+  +N + N+  + D+++AG     +ST+  C FLG +++SW +K+Q +V+ S+TE EY A+A   A+
Subjt:  LCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAK

Query:  IIW
        + W
Subjt:  IIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-6131.89Show/hide
Query:  LAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLV-PRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAI
        ++  ++ EPR+   A  DE W  AM  E+N    N    LV P P + +I+G +W+F  K + +G++ R KARLVA+G+ Q   + Y ETF+ V +  +I
Subjt:  LAFVSQIEPRSFKDAECDEFWILAMQEELNQFEMNKVRKLV-PRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAI

Query:  RMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGF-------------------------------------------EKGSLWLKTSSKSL---
        R++L  A  +++   Q+DV  AFL G + ++VY+ QP GF                                              SL++    KS+   
Subjt:  RMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGF-------------------------------------------EKGSLWLKTSSKSL---

Query:  ---------------CMHN-------EFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRG
                        +HN        F +    EL +FLG + K++   + +SQ +Y  +LL +  +   +   T M+ + KL      K  D   YRG
Subjt:  ---------------CMHN-------EFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRG

Query:  MIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQN
        ++ SL YL  ++PDI ++V   ++F   P E H  A+KRIL+YL GT + G++  +    +L  YSDA++AG   D+ ST+    +LG   +SW SKKQ 
Subjt:  MIKSLLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQN

Query:  SVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVP-IFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLS
         V  S+TEAEY ++A+  +++ W+  +L + G++    P I+CDN+ A  L  NP+ HSR KHI I +HFIR  VQ+G + +  +S
Subjt:  SVALSTTEAEYIAIASCCAKIIWMKQILCDFGLKFDNVP-IFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-0735.63Show/hide
Query:  AFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPN
        A  ++  ++G SH  S P TP+ NG+ ERK+R + E   ++L+   +PK +W  A   A Y+ NR L  P L  ++P++   G  PN
Subjt:  AFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-6131.11Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFEMNKVRKLV-PRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFA
        EPR+   A  D+ W  AM  E+N    N    LV P P + +I+G +W+F  K + +G++ R KARLVA+G+ Q   + Y ETF+ V +  +IR++L  A
Subjt:  EPRSFKDAECDEFWILAMQEELNQFEMNKVRKLV-PRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFA

Query:  SYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGF-------------------------------------------EKGSLWLKTSSKSL----------
          +++   Q+DV  AFL G + +EVY+ QP GF                                              SL++    +S+          
Subjt:  SYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGF-------------------------------------------EKGSLWLKTSSKSL----------

Query:  ---------------CMHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLY
                        +   F +    +L +FLG + K++   + +SQ +YT +LL +  +   +   T M+T+ KL      K  D   YRG++ SL Y
Subjt:  ---------------CMHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLY

Query:  LTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTT
        L  ++PD+ ++V   +++   P + H++A+KR+L+YL GT D G++  +    +L  YSDA++AG   D+ ST+    +LG   +SW SKKQ  V  S+T
Subjt:  LTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTT

Query:  EAEYIAIASCCAKIIWMKQILCDFGLKFDNVP-IFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLS
        EAEY ++A+  +++ W+  +L + G++  + P I+CDN+ A  L  NP+ HSR KHI + +HFIR  VQ+G + +  +S
Subjt:  EAEYIAIASCCAKIIWMKQILCDFGLKFDNVP-IFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-0836.47Show/hide
Query:  KDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPN
        +D+  ++G SH  S P TP+ NG+ ERK+R + E   ++L+   +PK +W  A + A Y+ NR L  P L  ++P++   G+ PN
Subjt:  KDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPN

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.6e-5829.72Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFAS
        EP ++ +A+    W  AM +E+   E     ++   P N   IG KWV++ K + +G I R KARLVA+G+ Q+E I + ETF+ V +L +++++LA ++
Subjt:  EPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFAS

Query:  YKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGF----------------EKGSLWLKTSSKS---------------------------------------
           F  +Q+D+  AFLNG + EE+Y++ P G+                +K    LK +S+                                        
Subjt:  YKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGF----------------EKGSLWLKTSSKS---------------------------------------

Query:  ----LCMHNE-------------FEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKS
            +C +N+             F++  +G L +FLG +I +    I I Q KY  +LL +  L   + +   M  +        G  VD K YR +I  
Subjt:  ----LCMHNE-------------FEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKS

Query:  LLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVAL
        L+YL  ++ DI F+V   ++F   P+ +H  AV +IL Y+ GT+  GL+Y    E  L  +SDA+F       +ST+  C FLG+SL+SW SKKQ  V+ 
Subjt:  LLYLTASKPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVAL

Query:  STTEAEYIAIASCCAKIIWMKQILCDFGLKFDN-VPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREH-VQNGHITLEFLSW----GILRYLFVIHKR
        S+ EAEY A++    +++W+ Q   +  L       +FCDN +AI++  N + H RTKHI+   H +RE  V    ++  F ++    G   YL  I + 
Subjt:  STTEAEYIAIASCCAKIIWMKQILCDFGLKFDN-VPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREH-VQNGHITLEFLSW----GILRYLFVIHKR

Query:  LGVVVISV
          + ++S+
Subjt:  LGVVVISV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0441.82Show/hide
Query:  NRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIP
        NRT+ E  RSML E GLPK F  +A NTA ++ N+          P E+W   +P
Subjt:  NRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIP

ATMG00810.1 DNA/RNA polymerases superfamily protein1.3e-2633.5Show/hide
Query:  MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEK---GKCVDIKTYRGMIKSLLYLTASKPDIMFSVC
        + + F M  +G + +FLG QIK     +F+SQ KY   +L     N G +    MST   L  +      K  D   +R ++ +L YLT ++PDI ++V 
Subjt:  MHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEK---GKCVDIKTYRGMIKSLLYLTASKPDIMFSVC

Query:  LCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAK
        +  +    P  + F  +KR+L+Y+ GTI  GL+  +N + N+  + D+++AG     +ST+  C FLG +++SW +K+Q +V+ S+TE EY A+A   A+
Subjt:  LCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAK

Query:  IIW
        + W
Subjt:  IIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.0e-1849.49Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFA
        EP+S   A  D  W  AMQEEL+    NK   LVP P N +I+G KWVF+ K+  +G + R KARLVA+GF QEE IY+ ET++ V R   IR +L  A
Subjt:  EPRSFKDAECDEFWILAMQEELNQFEMNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTGTCCCATGGGATCACCAATTATTATGCATCCATCGGGAGCATTAGACTGATATGTGTTTATGCAGAACACTTGACTGAACTGTACGTCCCTCAAGGCGTTAG
ATTGATACGTATATTCTATGGGATCACAAGACTGACTATGCAGGGTTTAGGCTATATTGATGAATCATCTACTCCTTCAAGTTCTAAAACTACATTTGTTCAAGCATCAC
TTATTGTGCCTAAGCTTAACATGCCTAATGATGTGTCTAATCATGTTAAATCTAGTTTTTGGTTGCTCAAGACACATGACGAGAGACCGATCCAAGTTATCTCTTTCTCC
AAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGTGATCACGAAGGAGAATTTGATAATTAATGATGCTTTTAAAGATTTTTGTGAAGAAAATGGTTT
TTCCCATAATTTTTCTTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTAC
CTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATT
CCAAATATTGGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAATATCATAGAAAAGAAAGA
AGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATT
TATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAA
ATGAACAAAGTTCGGAAATTAGTCCCTAGGCCGTATAATGCATCTATAATTGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAA
AGCTAGACTTGTAGCTCAAGGTTTTTGTCAAGAAGAAGATATATATTATGAAGAGACTTTTGCAACGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTT
CTTATAAAACATTCATTTTCTATCAAATGGATGTAAAATGTGCTTTTTTAAACGGTTATATTGTGGAGGAAGTTTACGTAGAACAACCTCTGGGCTTTGAAAAAGGCTCT
TTATGGCTTAAAACAAGCTCCAAGAGCTTGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTTTTCAAATCAAACAACTCAAGGA
TGACATCTTCATAAGTCAAGAAAAATACACAAGGAATTTGCTTAAGAAATTCAAATTAAATAAAGGTCAAGTTGCAAAAACTCGTATGAGCACTACCACTAAGCTTGACA
AAGATGAAAAGGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCAAATCTTTACTTTATTTGACCGCTAGTAAACCCGATATCATGTTTAGTGTATGTCTTTGT
GCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGA
GTTTAATTTGATAGGATATTCCGATGCGAATTTTGCCGGTAGTTTACTTGACCATAAAAGTACTAGTAGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTA
GTAAAAAGCAAAATTCGGTTGCTTTATCCACTACCGAAGCGGAATATATTGCGATTGCTAGTTGTTGTGCAAAAATTATTTGGATGAAACAAATTCTTTGTGATTTTGGA
TTAAAATTTGATAATGTGCCTATATTTTGTGATAATATTAGTGCCATAAATTTGACTAAGAATCCGATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTT
TATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTCTCTCCTGGGGCATCCTTCGATACTTATTTGTGATCCACAAGAGACTGGGAGTAGTTGTGATCTCTG
TCCATAGTAGTCATCCAGCCAGGAAGGGAAAGACCACGACACGCAGTGGAATACGAAAAAGAGCATCGAGCACAGCTTTCGTGTCACCATTGAGAAATCTGATGTTGTTA
GTGCTAGGGGAAGAAGATGGCGGTGAATTGGCGATTTTGAGGAAGAAAAAGAAGAAAAGAGAGTGTGAGAGAAGGGGAGAGGAGGGAGAAAGTAGAGAGAGAGGCAGTAG
TTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTGTCCCATGGGATCACCAATTATTATGCATCCATCGGGAGCATTAGACTGATATGTGTTTATGCAGAACACTTGACTGAACTGTACGTCCCTCAAGGCGTTAG
ATTGATACGTATATTCTATGGGATCACAAGACTGACTATGCAGGGTTTAGGCTATATTGATGAATCATCTACTCCTTCAAGTTCTAAAACTACATTTGTTCAAGCATCAC
TTATTGTGCCTAAGCTTAACATGCCTAATGATGTGTCTAATCATGTTAAATCTAGTTTTTGGTTGCTCAAGACACATGACGAGAGACCGATCCAAGTTATCTCTTTCTCC
AAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGTGATCACGAAGGAGAATTTGATAATTAATGATGCTTTTAAAGATTTTTGTGAAGAAAATGGTTT
TTCCCATAATTTTTCTTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTAC
CTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATT
CCAAATATTGGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAATATCATAGAAAAGAAAGA
AGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATT
TATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAA
ATGAACAAAGTTCGGAAATTAGTCCCTAGGCCGTATAATGCATCTATAATTGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAA
AGCTAGACTTGTAGCTCAAGGTTTTTGTCAAGAAGAAGATATATATTATGAAGAGACTTTTGCAACGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTT
CTTATAAAACATTCATTTTCTATCAAATGGATGTAAAATGTGCTTTTTTAAACGGTTATATTGTGGAGGAAGTTTACGTAGAACAACCTCTGGGCTTTGAAAAAGGCTCT
TTATGGCTTAAAACAAGCTCCAAGAGCTTGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTTTTCAAATCAAACAACTCAAGGA
TGACATCTTCATAAGTCAAGAAAAATACACAAGGAATTTGCTTAAGAAATTCAAATTAAATAAAGGTCAAGTTGCAAAAACTCGTATGAGCACTACCACTAAGCTTGACA
AAGATGAAAAGGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCAAATCTTTACTTTATTTGACCGCTAGTAAACCCGATATCATGTTTAGTGTATGTCTTTGT
GCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGA
GTTTAATTTGATAGGATATTCCGATGCGAATTTTGCCGGTAGTTTACTTGACCATAAAAGTACTAGTAGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTA
GTAAAAAGCAAAATTCGGTTGCTTTATCCACTACCGAAGCGGAATATATTGCGATTGCTAGTTGTTGTGCAAAAATTATTTGGATGAAACAAATTCTTTGTGATTTTGGA
TTAAAATTTGATAATGTGCCTATATTTTGTGATAATATTAGTGCCATAAATTTGACTAAGAATCCGATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTT
TATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTCTCTCCTGGGGCATCCTTCGATACTTATTTGTGATCCACAAGAGACTGGGAGTAGTTGTGATCTCTG
TCCATAGTAGTCATCCAGCCAGGAAGGGAAAGACCACGACACGCAGTGGAATACGAAAAAGAGCATCGAGCACAGCTTTCGTGTCACCATTGAGAAATCTGATGTTGTTA
GTGCTAGGGGAAGAAGATGGCGGTGAATTGGCGATTTTGAGGAAGAAAAAGAAGAAAAGAGAGTGTGAGAGAAGGGGAGAGGAGGGAGAAAGTAGAGAGAGAGGCAGTAG
TTAG
Protein sequenceShow/hide protein sequence
MAVSHGITNYYASIGSIRLICVYAEHLTELYVPQGVRLIRIFYGITRLTMQGLGYIDESSTPSSSKTTFVQASLIVPKLNMPNDVSNHVKSSFWLLKTHDERPIQVISFS
KKNGGMVTFGDNKKGVITKENLIINDAFKDFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKI
PNIGDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFE
MNKVRKLVPRPYNASIIGTKWVFRNKMDENGNIIRNKARLVAQGFCQEEDIYYEETFATVARLEAIRMLLAFASYKTFIFYQMDVKCAFLNGYIVEEVYVEQPLGFEKGS
LWLKTSSKSLCMHNEFEMSMMGELSFFLGFQIKQLKDDIFISQEKYTRNLLKKFKLNKGQVAKTRMSTTTKLDKDEKGKCVDIKTYRGMIKSLLYLTASKPDIMFSVCLC
ARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLIGYSDANFAGSLLDHKSTSRTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAIASCCAKIIWMKQILCDFG
LKFDNVPIFCDNISAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFLSWGILRYLFVIHKRLGVVVISVHSSHPARKGKTTTRSGIRKRASSTAFVSPLRNLMLL
VLGEEDGGELAILRKKKKKRECERRGEEGESRERGSS