; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G16080 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G16080
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr6:14501795..14504766
RNA-Seq ExpressionCSPI06G16080
SyntenyCSPI06G16080
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN64335.1 hypothetical protein VITISV_001808 [Vitis vinifera]4.7e-23462.52Show/hide
Query:  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQL---------------------
        K SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG+IIG+GNIGN +S+LIE+V LVDGLKH+LLSISQL                     
Subjt:  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQL---------------------

Query:  ---------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQL
                  ENVY ++++ Y   D+C S +H+ SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQMGKQ K+SFK+KN IST+RPL+L
Subjt:  ---------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQL

Query:  LHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG
        LHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C + G +HNF +PRT QQNG
Subjt:  LHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG

Query:  VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS
        VVERKNRTLQE AR+MLNE  LPKYFW EA+NT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++S
Subjt:  VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS

Query:  KAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTR
        KA+RVFNK+T+V+EESIH  +   W N    +       KD      N K  E +P        +K       L  + ++ ++HP+D I+GNP  GV+TR
Subjt:  KAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTR

Query:  SSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFA
        SSL N+ +NLAF+SQIEP++ KDA  DE W++AMQ+ELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFA
Subjt:  SSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFA

Query:  PVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        PVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EE+YVEQPPG +SF+ PNHV+K
Subjt:  PVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

KYP33441.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]4.6e-22160.33Show/hide
Query:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD-----------------
        CL+A K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KGKI+G GN+GN SS TLIENV LVDGLKH+LLSISQL                  
Subjt:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD-----------------

Query:  -------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT
                     +N+Y LDL +   I   KCL     + WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN ISTT
Subjt:  -------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT

Query:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT
        RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Subjt:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT

Query:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL
        PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE+++G+ PNI YF+VFGCKCF+LNN KE+LGKFD+K D  IFL
Subjt:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL

Query:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG
        GYS+ SKAYR++NK+TLV+EES+HVVFDES    + ++     +D L++   +   N+K KE           E  +E S  LPKEW+ +     D I+G
Subjt:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG

Query:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE
        N  +GV TRS++ N+ + +AFVSQ+EP+S  +A  DE W++AMQEELNQFERN+VW LVP P++  IIGTKWVFRNK+DE+G IIRNKARLVA+GY QEE
Subjt:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE

Query:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        GIDY+ETFAPVAR+EAIR+LLA++S  NF LYQMDVKSAFLNG I EEVYVEQPPG   F  PNHVYK
Subjt:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

KYP78729.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]6.0e-22159.55Show/hide
Query:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD-----------------
        CL+A K   WYLDSGCSRHMTGD SK  +   KN G VT+GDN KGKI+G GN+GN SS TLIENV LVDGLKH+LLSISQL                  
Subjt:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD-----------------

Query:  -------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT
                     +N+Y LDL +   +   KCL     + WLWHRR  H  M  ++ + +  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN IST+
Subjt:  -------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT

Query:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT
        RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Subjt:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT

Query:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL
        PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFL
Subjt:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL

Query:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPE
        GYS+ SKAYR++NK+TLV+EES+HVVFDES N         +DL +     L+ ++  E+    +    +EK +E    LPKEW+ +     D I+GN  
Subjt:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPE

Query:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID
        +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGID
Subjt:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID

Query:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        Y+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG+I EEVYVEQPPG   +  PNHVYK
Subjt:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.0e-24962.72Show/hide
Query:  MANLGLMAHSDKDDEHDDKVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQ
        +AN+  MA  D D+          SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG+IIG+GNIGN +S+LIE+V LVDGLKH+LLSISQ
Subjt:  MANLGLMAHSDKDDEHDDKVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQ

Query:  L------------------------------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQM
        L                               ENVY ++++ Y   D+C S +H+ SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQM
Subjt:  L------------------------------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQM

Query:  GKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFK
        GKQ K+SFK+KN IST+RPL+LLHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+
Subjt:  GKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFK

Query:  AFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNK
         +C ++G +HNFS+PRTPQQNGVVERKNRTLQE AR+MLNE  LPKYFW EAVNT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K
Subjt:  AFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNK

Query:  EKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKK
        + LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+EESIHV+FDES N++       DD  LE   G L + DK ++      P  +D  +      + +
Subjt:  EKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKK

Query:  EEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRN
         E S  LPK+W++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRN
Subjt:  EEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRN

Query:  KMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        KMDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPG +SF+ PNHV+K
Subjt:  KMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

XP_024024455.1 uncharacterized protein LOC112092461 [Morus notabilis]5.8e-23261.83Show/hide
Query:  KKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQL------------------------
        +K KW+LDSGCS+HMTGD S L +F++KNGG VTFGDN KGKI+G G++GN +S LIENV LVD LKH+LLSISQL                        
Subjt:  KKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQL------------------------

Query:  ------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHM
               ENVYT+D+      ++CLSVL++DSWLWHRRLGHASMHL+S +S+  +VRGLP+  F+KDK+CDACQ GKQ ++SFKS   ISTTRPLQLLHM
Subjt:  ------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHM

Query:  DLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVE
        DLFGPSR  S GG +YAFVIVDDFSR+TWV+ +  KD+ALK+F+ F KRVQNE+G+ I+ I+SDHGGEFDNDAF+  C ENG+ HNFS+PRTPQQNGVVE
Subjt:  DLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVE

Query:  RKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAY
        RKNR LQE ARSMLNE GLPKYFW EAVNT+CY+ NRVL+RP +DKTPYELW G+ PNIGYF+VFGCKCFILN K+ LGKFD+K+DVGIFLGYS+TSKAY
Subjt:  RKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAY

Query:  RVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVND------KGKE----IVPSMQDVNIIEKKEEGSSS-LPKEWRYALSHPKDLIL
        RVFNK+TLV+EES+HVVFDE+ N    +S+  DD  LE    ++ +ND      K KE      P  Q+  + + K++ +S+ LPKEWRY+ SHPKD   
Subjt:  RVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVND------KGKE----IVPSMQDVNIIEKKEEGSSS-LPKEWRYALSHPKDLIL

Query:  GNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE
                                               ILAMQEELNQFER+ VW LVPRPS  S+IGTKWVFRNK DE+G I+RNKARLVAQG+ QEE
Subjt:  GNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE

Query:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        GIDYEETFAPVARLE+IRMLLAFAS+K F LYQMDVKSAFLNGYI+EEVYVEQPPG +    P+H+++
Subjt:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

TrEMBL top hitse value%identityAlignment
A0A151QSZ9 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-22160.33Show/hide
Query:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD-----------------
        CL+A K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KGKI+G GN+GN SS TLIENV LVDGLKH+LLSISQL                  
Subjt:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD-----------------

Query:  -------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT
                     +N+Y LDL +   I   KCL     + WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN ISTT
Subjt:  -------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT

Query:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT
        RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Subjt:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT

Query:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL
        PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE+++G+ PNI YF+VFGCKCF+LNN KE+LGKFD+K D  IFL
Subjt:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL

Query:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG
        GYS+ SKAYR++NK+TLV+EES+HVVFDES    + ++     +D L++   +   N+K KE           E  +E S  LPKEW+ +     D I+G
Subjt:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG

Query:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE
        N  +GV TRS++ N+ + +AFVSQ+EP+S  +A  DE W++AMQEELNQFERN+VW LVP P++  IIGTKWVFRNK+DE+G IIRNKARLVA+GY QEE
Subjt:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE

Query:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        GIDY+ETFAPVAR+EAIR+LLA++S  NF LYQMDVKSAFLNG I EEVYVEQPPG   F  PNHVYK
Subjt:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

A0A151RY83 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-22059.76Show/hide
Query:  KVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD---------------
        K  L  +K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KGKI+G GN+GN SS TLIENV LVDGLKH+LLSISQL                
Subjt:  KVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD---------------

Query:  ---------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVIS
                       +N+Y LDL +   +   KCL      +WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN IS
Subjt:  ---------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVIS

Query:  TTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSP
        T+RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+P
Subjt:  TTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSP

Query:  RTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGI
        RTPQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V NRVL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  I
Subjt:  RTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGI

Query:  FLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLV----NDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDL
        FLGYS+ SK+YR++NK+TLV+EES+HVVFDES N         +DL +     L+    N+K KE V         EK++     LPKEW+ +     D 
Subjt:  FLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLV----NDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDL

Query:  ILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYC
        I+GN  +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY 
Subjt:  ILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYC

Query:  QEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        QEEGIDY+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG I EEVYVEQPPG   F  PNHVYK
Subjt:  QEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

A0A151UHG7 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-22159.55Show/hide
Query:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD-----------------
        CL+A K   WYLDSGCSRHMTGD SK  +   KN G VT+GDN KGKI+G GN+GN SS TLIENV LVDGLKH+LLSISQL                  
Subjt:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSS-TLIENVHLVDGLKHDLLSISQLD-----------------

Query:  -------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT
                     +N+Y LDL +   +   KCL     + WLWHRR  H  M  ++ + +  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN IST+
Subjt:  -------------ENVYTLDLNNYPIID--KCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT

Query:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT
        RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Subjt:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT

Query:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL
        PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFL
Subjt:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL

Query:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPE
        GYS+ SKAYR++NK+TLV+EES+HVVFDES N         +DL +     L+ ++  E+    +    +EK +E    LPKEW+ +     D I+GN  
Subjt:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPE

Query:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID
        +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGID
Subjt:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID

Query:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        Y+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG+I EEVYVEQPPG   +  PNHVYK
Subjt:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

A0A438GI90 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-25062.72Show/hide
Query:  MANLGLMAHSDKDDEHDDKVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQ
        +AN+  MA  D D+          SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG+IIG+GNIGN +S+LIE+V LVDGLKH+LLSISQ
Subjt:  MANLGLMAHSDKDDEHDDKVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQ

Query:  L------------------------------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQM
        L                               ENVY ++++ Y   D+C S +H+ SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQM
Subjt:  L------------------------------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQM

Query:  GKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFK
        GKQ K+SFK+KN IST+RPL+LLHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+
Subjt:  GKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFK

Query:  AFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNK
         +C ++G +HNFS+PRTPQQNGVVERKNRTLQE AR+MLNE  LPKYFW EAVNT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K
Subjt:  AFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNK

Query:  EKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKK
        + LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+EESIHV+FDES N++       DD  LE   G L + DK ++      P  +D  +      + +
Subjt:  EKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKK

Query:  EEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRN
         E S  LPK+W++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRN
Subjt:  EEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRN

Query:  KMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        KMDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPG +SF+ PNHV+K
Subjt:  KMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

A5C8K0 Uncharacterized protein2.3e-23462.52Show/hide
Query:  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQL---------------------
        K SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG+IIG+GNIGN +S+LIE+V LVDGLKH+LLSISQL                     
Subjt:  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQL---------------------

Query:  ---------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQL
                  ENVY ++++ Y   D+C S +H+ SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQMGKQ K+SFK+KN IST+RPL+L
Subjt:  ---------DENVYTLDLNNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQL

Query:  LHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG
        LHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C + G +HNF +PRT QQNG
Subjt:  LHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG

Query:  VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS
        VVERKNRTLQE AR+MLNE  LPKYFW EA+NT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++S
Subjt:  VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS

Query:  KAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTR
        KA+RVFNK+T+V+EESIH  +   W N    +       KD      N K  E +P        +K       L  + ++ ++HP+D I+GNP  GV+TR
Subjt:  KAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTR

Query:  SSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFA
        SSL N+ +NLAF+SQIEP++ KDA  DE W++AMQ+ELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFA
Subjt:  SSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFA

Query:  PVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        PVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EE+YVEQPPG +SF+ PNHV+K
Subjt:  PVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-6929.32Show/hide
Query:  LDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKII-----GKGNIGNDSSTLIENVHLVDGLKHDLLSISQLDENVYTLD-----------------
        LDSG S H+  D S L + S +    +     K+G+ I     G   + ND    +E+V        +L+S+ +L E   +++                 
Subjt:  LDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKII-----GKGNIGNDSSTLIENVHLVDGLKHDLLSISQLDENVYTLD-----------------

Query:  ----LNNYPIID---KCLSVLHNDSW-LWHRRLGHASMHLISNISKNCLV--RGLPSFKFEKDKVCDACQMGKQTKSSFKS-KNVISTTRPLQLLHMDLF
            LNN P+I+     ++  H +++ LWH R GH S   +  I +  +   + L +      ++C+ C  GKQ +  FK  K+     RPL ++H D+ 
Subjt:  ----LNNYPIID---KCLSVLHNDSW-LWHRRLGHASMHLISNISKNCLV--RGLPSFKFEKDKVCDACQMGKQTKSSFKS-KNVISTTRPLQLLHMDLF

Query:  GPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVERKN
        GP    +     Y  + VD F+ +    +IK+K D    F  F  + +      +  +  D+G E+ ++  + FC + G S++ + P TPQ NGV ER  
Subjt:  GPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVERKN

Query:  RTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD--KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS-KAY
        RT+ E AR+M++   L K FW EAV TA Y+ NR+  R  +D  KTPYE+WH K P + + +VFG   ++ + K K GKFD K+   IF+GY     K +
Subjt:  RTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD--KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS-KAY

Query:  RVFNKKTLVIEESIHVVFDESWNNVSN-----ESICSDDLEKDFGDLLVNDKGKEI-------------VPSMQDVNIIEKK------------------
           N+K +V  +   VV DE+ N V++     E++   D ++       ND  K I             +  ++D    E K                  
Subjt:  RVFNKKTLVIEESIHVVFDESWNNVSN-----ESICSDDLEKDFGDLLVNDKGKEI-------------VPSMQDVNIIEKK------------------

Query:  ---------------------------------EEGSSSLPKEWRYA--LSHPKDLILGNP------------EQGVKTR---------SSLN-LFSNLA
                                         E   S  P E R +    H K++ + NP             + +KT+         +SLN +  N  
Subjt:  ---------------------------------EEGSSSLPKEWRYA--LSHPKDLILGNP------------EQGVKTR---------SSLN-LFSNLA

Query:  FVSQIEPRSFKDAECDE---FWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAI
         +    P SF + +  +    W  A+  ELN  + N  W +  RP N +I+ ++WVF  K +E GN IR KARLVA+G+ Q+  IDYEETFAPVAR+ + 
Subjt:  FVSQIEPRSFKDAECDE---FWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAI

Query:  RMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGI
        R +L+     N  ++QMDVK+AFLNG + EE+Y+  P GI
Subjt:  RMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.0e-7931.81Show/hide
Query:  HDDKVCLKAS-KKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNI---GNDSSTLI-ENVHLVDGLKHDLLSISQLDENVY----
        ++++ C+  S  +++W +D+  S H T  R     +   + G V  G+    KI G G+I    N   TL+ ++V  V  L+ +L+S   LD + Y    
Subjt:  HDDKVCLKAS-KKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNI---GNDSSTLI-ENVHLVDGLKHDLLSISQLDENVY----

Query:  ---------------------TLDLNNYPIIDKCLSVLHNDSW--LWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNV
                             TL   N  I    L+   ++    LWH+R+GH S   +  ++K  L+      K    K CD C  GKQ + SF++ + 
Subjt:  ---------------------TLDLNNYPIIDKCLSVLHNDSW--LWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNV

Query:  ISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFS
              L L++ D+ GP  I S GGN Y    +DD SR  WV ++K KD   + F  F   V+ E G  + ++RSD+GGE+ +  F+ +C  +G  H  +
Subjt:  ISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFS

Query:  SPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVG
         P TPQ NGV ER NRT+ E  RSML    LPK FW EAV TACY+ NR    P   + P  +W  K  +  + KVFGC+ F    KE+  K D K+   
Subjt:  SPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVG

Query:  IFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG
        IF+GY      YR+++     +  S  VVF ES   V   +  S+ ++       V        P+  +    E  E+G    P E    +   + L  G
Subjt:  IFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG

Query:  -----NPEQGVKTRSSL----------NLFSNLAFV---SQIEPRSFKDA----ECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMD
             +P QG +    L            + +  +V      EP S K+     E ++  + AMQEE+   ++N  +KLV  P     +  KWVF+ K D
Subjt:  -----NPEQGVKTRSSL----------NLFSNLAFV---SQIEPRSFKDA----ECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMD

Query:  ENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIE
         +  ++R KARLV +G+ Q++GID++E F+PV ++ +IR +L+ A+  +  + Q+DVK+AFL+G + EE+Y+EQP G E
Subjt:  ENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIE

P92520 Uncharacterized mitochondrial protein AtMg008207.3e-2051.52Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EP+S   A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI + ET++PV R   IR +L  A
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-6428.02Show/hide
Query:  NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGDNKKGKIIGKGNIGNDSSTL-IENVHLVDGLKHDLLSISQL-----------DENVYTLDLN-
        N W LDSG + H+T D + L       GG   MV  G        G  ++   S  L + N+  V  +  +L+S+ +L             +    DLN 
Subjt:  NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGDNKKGKIIGKGNIGNDSSTL-IENVHLVDGLKHDLLSISQL-----------DENVYTLDLN-

Query:  --------------NYPII-DKCLSVLHNDS-----WLWHRRLGHASMHLISNISKNCLVRGL-PSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPL
                       +PI   + +S+  + S       WH RLGH +  +++++  N  +  L PS KF     C  C + K  K  F S++ I++TRPL
Subjt:  --------------NYPII-DKCLSVLHNDS-----WLWHRRLGHASMHLISNISKNCLVRGL-PSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPL

Query:  QLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQ
        + ++ D++  S I S+    Y  + VD F+R+TW+  +K K    ++FI+F   ++N     I    SD+GGEF   A   +  ++G SH  S P TP+ 
Subjt:  QLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQ

Query:  NGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYS
        NG+ ERK+R + E   ++L+   +PK +W  A   A Y+ NR L  P L  ++P++   G  PN    +VFGC C+         K D K+   +FLGYS
Subjt:  NGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYS

Query:  STSKAYRVFNKKTLVIEESIHVVFDESWNNVSN-------------ESIC-------------------------------------------SDDLEKD
         T  AY   + +T  +  S HV FDE+    SN             ES C                                           S +L+  
Subjt:  STSKAYRVFNKKTLVIEESIHVVFDESWNNVSN-------------ESIC-------------------------------------------SDDLEKD

Query:  FGDLL----------VNDKGKEIVPSM----------------------QDVNIIEKKEEGSSSLPKEWRYALSH---------------PKDLILGNPE
        F               N       P+                       Q    +    + SSS P     A S                P   I+ N  
Subjt:  FGDLL----------VNDKGKEIVPSM----------------------QDVNIIEKKEEGSSSLPKEWRYALSH---------------PKDLILGNPE

Query:  Q------GVKTRSSLNLFS-------NLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKAR
        Q       + TR+   +          ++  ++ EPR+   A  DE W  AM  E+N    N  W LV P PS+ +I+G +W+F  K + +G++ R KAR
Subjt:  Q------GVKTRSSLNLFS-------NLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKAR

Query:  LVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK
        LVA+GY Q  G+DY ETF+PV +  +IR++L  A  +++ + Q+DV +AFL G + ++VY+ QPPG    D PN+V K
Subjt:  LVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-6026.31Show/hide
Query:  NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGDNKKGKIIGKGNIGNDSSTL-IENVHLVDGLKHDLLSISQL-----------DENVYTLDLN-
        N W LDSG + H+T D + L       GG   M+  G        G  ++   S +L +  V  V  +  +L+S+ +L             +    DLN 
Subjt:  NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGDNKKGKIIGKGNIGNDSSTL-IENVHLVDGLKHDLLSISQL-----------DENVYTLDLN-

Query:  --------------NYPIIDK---------CLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGL-PSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT
                       +PI            C    H+    WH RLGH S+ +++++  N  +  L PS K      C  C + K  K  F S + I+++
Subjt:  --------------NYPIIDK---------CLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGL-PSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT

Query:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT
        +PL+ ++ D++  S I S     Y  + VD F+R+TW+  +K K     +FI F   V+N     I  + SD+GGEF     + +  ++G SH  S P T
Subjt:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT

Query:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFL
        P+ NG+ ERK+R + E   ++L+   +PK +W  A + A Y+ NR L  P L  ++P++   G+ PN    KVFGC C+         K + K+    F+
Subjt:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFL

Query:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDV---------------------NIIEKKEEGSSS
        GYS T  AY   +  T  +  S HV FDE     S  +      ++   D   N      +P+   V                     + +   +  SS+
Subjt:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDV---------------------NIIEKKEEGSSS

Query:  LP------------------------KEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNL----------------------------------------
        LP                        +  +   S+    IL NP     + +S N  S L                                        
Subjt:  LP------------------------KEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNL----------------------------------------

Query:  -----------------------------------AFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGN
                                           +  +  EPR+   A  D+ W  AM  E+N    N  W LV P P + +I+G +W+F  K + +G+
Subjt:  -----------------------------------AFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGN

Query:  IIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHV
        + R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A  +++ + Q+DV +AFL G + +EVY+ QPPG    D P++V
Subjt:  IIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.8e-2943.51Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EP ++ +A+    W  AM +E+   E    W++   P N   IG KWV++ K + +G I R KARLVA+GY Q+EGID+ ETF+PV +L +++++LA ++
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFILYQMDVKSAFLNGYIMEEVYVEQPPG
          NF L+Q+D+ +AFLNG + EE+Y++ PPG
Subjt:  YKNFILYQMDVKSAFLNGYIMEEVYVEQPPG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-0839.47Show/hide
Query:  NRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKL
        NRT+ E  RSML E GLPK F  +A NTA ++ N+          P E+W   +P   Y + FGC  +I  ++ KL
Subjt:  NRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.2e-2151.52Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EP+S   A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI + ET++PV R   IR +L  A
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATCTTGGTCTCATGGCTCATAGTGACAAAGATGATGAACATGATGATAAGGTTTGTTTGAAAGCCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTGCTC
AAGACACATGACGGGAGACCGATCCAAGCTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGTAAAATAATTGGTAAGGGTAATA
TAGGAAATGATTCATCTACTTTGATTGAAAATGTTCATTTGGTTGATGGTTTAAAGCATGATTTGCTTAGTATTAGTCAATTGGATGAAAATGTGTACACTCTTGATTTG
AATAATTATCCTATTATTGATAAATGTCTTTCGGTTTTGCATAATGATTCTTGGTTATGGCATAGAAGACTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCTAA
AAATTGTTTGGTTAGAGGTCTTCCTAGTTTTAAATTTGAAAAAGACAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGA
TCTCTACTACTAGACCCTTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTATGGAGGAAATTATTATGCTTTTGTGATAGTTGATGATTTTTCT
AGATTTACTTGGGTTTTGATGATAAAACATAAGGATGATGCTTTGAAAAGTTTTATTAGTTTTGCAAAAAGAGTACAAAATGAAAAAGGATTTTTTATTTCCAAAATTAG
GAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTAG
TTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCA
AATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAATATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTT
GAATAACAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATGTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAG
TTATTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGGAATAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAAT
GACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAACATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCC
CAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAG
ATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAAGAAACAAAGTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATC
GGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGA
AGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAAATTTCATTTTGTATCAAATGGATGTAAAAAGTGCTTTTTTAA
ACGGTTATATTATGGAGGAAGTTTACGTAGAACAACCTCCGGGCATTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAATCTTGGTCTCATGGCTCATAGTGACAAAGATGATGAACATGATGATAAGGTTTGTTTGAAAGCCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTGCTC
AAGACACATGACGGGAGACCGATCCAAGCTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGTAAAATAATTGGTAAGGGTAATA
TAGGAAATGATTCATCTACTTTGATTGAAAATGTTCATTTGGTTGATGGTTTAAAGCATGATTTGCTTAGTATTAGTCAATTGGATGAAAATGTGTACACTCTTGATTTG
AATAATTATCCTATTATTGATAAATGTCTTTCGGTTTTGCATAATGATTCTTGGTTATGGCATAGAAGACTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCTAA
AAATTGTTTGGTTAGAGGTCTTCCTAGTTTTAAATTTGAAAAAGACAAAGTTTGTGATGCTTGTCAAATGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGA
TCTCTACTACTAGACCCTTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTATGGAGGAAATTATTATGCTTTTGTGATAGTTGATGATTTTTCT
AGATTTACTTGGGTTTTGATGATAAAACATAAGGATGATGCTTTGAAAAGTTTTATTAGTTTTGCAAAAAGAGTACAAAATGAAAAAGGATTTTTTATTTCCAAAATTAG
GAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTTTTGTGAAGAAAATGGTTTTTCCCATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTAG
TTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCA
AATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAATATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTT
GAATAACAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATGTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAG
TTATTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGGAATAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAAT
GACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAACATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCC
CAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAG
ATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAAGAAACAAAGTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATC
GGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGA
AGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAAATTTCATTTTGTATCAAATGGATGTAAAAAGTGCTTTTTTAA
ACGGTTATATTATGGAGGAAGTTTACGTAGAACAACCTCCGGGCATTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTAG
Protein sequenceShow/hide protein sequence
MANLGLMAHSDKDDEHDDKVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGKIIGKGNIGNDSSTLIENVHLVDGLKHDLLSISQLDENVYTLDL
NNYPIIDKCLSVLHNDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFS
RFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVS
NRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVN
DKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASII
GTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIMEEVYVEQPPGIESFDLPNHVYK