; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G25980 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G25980
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr2:22077758..22081117
RNA-Seq ExpressionCSPI02G25980
SyntenyCSPI02G25980
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN63777.1 hypothetical protein VITISV_043745 [Vitis vinifera]2.7e-17668.67Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE
        MLNE  LPKYFWAE VNT+CYV +R+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SK +RVFNK+T+V+EE
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE

Query:  SIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKE--------------IVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKT
        SIHV+F ES N++       DD  LE   G L + DK ++               +P  Q V     + E S  LPK+W++ ++HP+D I+GNP  GV+T
Subjt:  SIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKE--------------IVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKT

Query:  RSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETF
        RSSL N+ +NLAF+ QIEP++ KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QE  IDYEETF
Subjt:  RSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETF

Query:  APVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIK
        APVARLEAIRMLLAFA +K+FILYQMDVKS FLNG+I EEVY EQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFL +  FKMGKID TLFIK
Subjt:  APVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIK

Query:  VKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG
         K  DML+VQIYVDDIIFG+TN SLCE+FSKCMH+EFEMSMMGEL+FFLG
Subjt:  VKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG

CAN64335.1 hypothetical protein VITISV_001808 [Vitis vinifera]3.1e-18058.85Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE
        MLNE  LPKYFWAEA+NT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+EE
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE

Query:  SIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQ
        SIH  +   W N    +       KD      N K  E +P        +K       L  + ++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQ
Subjt:  SIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQ

Query:  IEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        IEP++ KDA  DE W++AMQ+ELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA
Subjt:  IEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Query:  SYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDI
         +K+FILYQMDVKSAFLNG+I EE+YVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K NDML+VQIYVDDI
Subjt:  SYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDI

Query:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGKVAKTPMSTTTKLDKDEK------------------------------------------------
         FG+TN SLCE+FSKCMH ++   ++   +    KV KTPMS++ KLD DEK                                                
Subjt:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGKVAKTPMSTTTKLDKDEK------------------------------------------------

Query:  --------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAKYIA
                                  G+SDADFAG  ++RKSTSGTC  LG SLVSW SKKQNS+ALST EA+Y A
Subjt:  --------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAKYIA

KYP38726.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]8.1e-17353.93Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE
        MLNE  LPKYFWA+A+NTAC+V N+VL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLGYS+ SKAYR++NK+TLV+E
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE

Query:  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVS
        ES+HVVFDES N         +DL +     L+ ++  E VP   + +  EK +E    LPKEW+ +     D I+GN  +GV TRS++ N+ + +AFVS
Subjt:  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVS

Query:  QIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAF
        Q+EP+S  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGIDY+ETFAPVAR+EAIR+LLA+
Subjt:  QIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAF

Query:  ASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD
        +S KNF LYQMDVKSAFLNG I EEVYVEQPPGF  F  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DNTLF+K   ND + VQIYVDD
Subjt:  ASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD

Query:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-----------------
        I+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLG                              K A TP+S    LD DEK                 
Subjt:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-----------------

Query:  ---------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALST
                                                                 GYSD+D+AG  LDRKSTSGTC  LGS+LVSW SKKQ  VALST
Subjt:  ---------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALST

Query:  TEAKYIADAQVTPRLARQASKLR
         EA+YIA      ++     +LR
Subjt:  TEAKYIADAQVTPRLARQASKLR

KYP78729.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]4.7e-17353.29Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE
        MLNE  LPKYFWA+A+NTAC+V N+VL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLGYS+ SKAYR++NK+TLV+E
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE

Query:  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVS
        ES+HVVFDES N         +DL +     L+ ++  E+    +    +EK +E    LPKEW+ +     D I+GN  +GV TRS++ N+ + +AFVS
Subjt:  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVS

Query:  QIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAF
        Q+EP++  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGIDY+ETFAPVAR+EAIR+LLA+
Subjt:  QIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAF

Query:  ASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD
        +S KNF LYQMDVKSAFLNG+I EEVYVEQPPGF  +  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DNTLF+K   ND + VQIYVDD
Subjt:  ASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD

Query:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-----------------
        I+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLG                              K A TP+S    LD DEK                 
Subjt:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-----------------

Query:  ---------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALST
                                                                 GYSD+D+AG  LDRKSTSGTC  LGS+LVSW SKKQ  VALST
Subjt:  ---------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALST

Query:  TEAKYIADAQVTPRLARQASKLR
         EA+YIA      ++     +LR
Subjt:  TEAKYIADAQVTPRLARQASKLR

RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.6e-19959.97Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE
        MLNE  LPKYFWAEAVNT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+EE
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE

Query:  SIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-
        SIHV+FDES N++       DD  LE   G L + DK ++      P  +D  +      + + E S  LPK+W++ ++HP+D I+GNP  GV+TRSSL 
Subjt:  SIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-

Query:  NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVAR
        N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAPVAR
Subjt:  NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVAR

Query:  LEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNND
        LEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K  D
Subjt:  LEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNND

Query:  MLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-------
        ML+VQIYVDDIIFG+TN SLCE+FSKCMH+EFEMSMMGEL++FLG                              KV KTPMS++ KLD DEK       
Subjt:  MLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-------

Query:  -------------------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFS
                                                                           G+SDADFAG  ++RKSTSGTC FLG SLVSW S
Subjt:  -------------------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFS

Query:  KKQNSVALSTTEAKYIA
        KKQNSVALST EA+YIA
Subjt:  KKQNSVALSTTEAKYIA

TrEMBL top hitse value%identityAlignment
A0A151R893 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-17353.93Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE
        MLNE  LPKYFWA+A+NTAC+V N+VL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLGYS+ SKAYR++NK+TLV+E
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE

Query:  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVS
        ES+HVVFDES N         +DL +     L+ ++  E VP   + +  EK +E    LPKEW+ +     D I+GN  +GV TRS++ N+ + +AFVS
Subjt:  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVS

Query:  QIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAF
        Q+EP+S  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGIDY+ETFAPVAR+EAIR+LLA+
Subjt:  QIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAF

Query:  ASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD
        +S KNF LYQMDVKSAFLNG I EEVYVEQPPGF  F  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DNTLF+K   ND + VQIYVDD
Subjt:  ASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD

Query:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-----------------
        I+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLG                              K A TP+S    LD DEK                 
Subjt:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-----------------

Query:  ---------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALST
                                                                 GYSD+D+AG  LDRKSTSGTC  LGS+LVSW SKKQ  VALST
Subjt:  ---------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALST

Query:  TEAKYIADAQVTPRLARQASKLR
         EA+YIA      ++     +LR
Subjt:  TEAKYIADAQVTPRLARQASKLR

A0A151UHG7 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-17353.29Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE
        MLNE  LPKYFWA+A+NTAC+V N+VL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLGYS+ SKAYR++NK+TLV+E
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE

Query:  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVS
        ES+HVVFDES N         +DL +     L+ ++  E+    +    +EK +E    LPKEW+ +     D I+GN  +GV TRS++ N+ + +AFVS
Subjt:  ESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVS

Query:  QIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAF
        Q+EP++  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGIDY+ETFAPVAR+EAIR+LLA+
Subjt:  QIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAF

Query:  ASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD
        +S KNF LYQMDVKSAFLNG+I EEVYVEQPPGF  +  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DNTLF+K   ND + VQIYVDD
Subjt:  ASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDD

Query:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-----------------
        I+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLG                              K A TP+S    LD DEK                 
Subjt:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-----------------

Query:  ---------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALST
                                                                 GYSD+D+AG  LDRKSTSGTC  LGS+LVSW SKKQ  VALST
Subjt:  ---------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALST

Query:  TEAKYIADAQVTPRLARQASKLR
         EA+YIA      ++     +LR
Subjt:  TEAKYIADAQVTPRLARQASKLR

A0A438GI90 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-19959.97Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE
        MLNE  LPKYFWAEAVNT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+EE
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE

Query:  SIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-
        SIHV+FDES N++       DD  LE   G L + DK ++      P  +D  +      + + E S  LPK+W++ ++HP+D I+GNP  GV+TRSSL 
Subjt:  SIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-

Query:  NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVAR
        N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAPVAR
Subjt:  NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVAR

Query:  LEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNND
        LEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K  D
Subjt:  LEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNND

Query:  MLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-------
        ML+VQIYVDDIIFG+TN SLCE+FSKCMH+EFEMSMMGEL++FLG                              KV KTPMS++ KLD DEK       
Subjt:  MLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTTTKLDKDEK-------

Query:  -------------------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFS
                                                                           G+SDADFAG  ++RKSTSGTC FLG SLVSW S
Subjt:  -------------------------------------------------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFS

Query:  KKQNSVALSTTEAKYIA
        KKQNSVALST EA+YIA
Subjt:  KKQNSVALSTTEAKYIA

A5BS59 Uncharacterized protein1.3e-17668.67Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE
        MLNE  LPKYFWAE VNT+CYV +R+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SK +RVFNK+T+V+EE
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE

Query:  SIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKE--------------IVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKT
        SIHV+F ES N++       DD  LE   G L + DK ++               +P  Q V     + E S  LPK+W++ ++HP+D I+GNP  GV+T
Subjt:  SIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKE--------------IVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKT

Query:  RSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETF
        RSSL N+ +NLAF+ QIEP++ KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QE  IDYEETF
Subjt:  RSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETF

Query:  APVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIK
        APVARLEAIRMLLAFA +K+FILYQMDVKS FLNG+I EEVY EQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFL +  FKMGKID TLFIK
Subjt:  APVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIK

Query:  VKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG
         K  DML+VQIYVDDIIFG+TN SLCE+FSKCMH+EFEMSMMGEL+FFLG
Subjt:  VKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG

A5C8K0 Uncharacterized protein1.5e-18058.85Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE
        MLNE  LPKYFWAEA+NT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++SKA+RVFNK+T+V+EE
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE

Query:  SIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQ
        SIH  +   W N    +       KD      N K  E +P        +K       L  + ++ ++HP+D I+GNP  GV+TRSSL N+ +NLAF+SQ
Subjt:  SIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSL-NLFSNLAFVSQ

Query:  IEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        IEP++ KDA  DE W++AMQ+ELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGIDYEETFAPVARLEAIRMLLAFA
Subjt:  IEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Query:  SYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDI
         +K+FILYQMDVKSAFLNG+I EE+YVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K NDML+VQIYVDDI
Subjt:  SYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDI

Query:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGKVAKTPMSTTTKLDKDEK------------------------------------------------
         FG+TN SLCE+FSKCMH ++   ++   +    KV KTPMS++ KLD DEK                                                
Subjt:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGKVAKTPMSTTTKLDKDEK------------------------------------------------

Query:  --------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAKYIA
                                  G+SDADFAG  ++RKSTSGTC  LG SLVSW SKKQNS+ALST EA+Y A
Subjt:  --------------------------GYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAKYIA

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-5530.71Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLD--KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS-KAYRVFNKKTLV
        M++   L K FW EAV TA Y+ NR+  R  +D  KTPYE+WH K P + + +VFG   ++ + K K GKFD K+   IF+GY     K +   N+K +V
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLD--KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS-KAYRVFNKKTLV

Query:  IEESIHVVFDESWNNVSN-----ESICSDDLEKDFGDLLVNDKGKEI-------------VPSMQDVNIIEKK---------------------------
          +   VV DE+ N V++     E++   D ++       ND  K I             +  ++D    E K                           
Subjt:  IEESIHVVFDESWNNVSN-----ESICSDDLEKDFGDLLVNDKGKEI-------------VPSMQDVNIIEKK---------------------------

Query:  ------------------------EEGSSSLPKEWRYA--LSHPKDLILGNP------------EQGVKTR---------SSLN-LFSNLAFVSQIEPRS
                                E   S  P E R +    H K++ + NP             + +KT+         +SLN +  N   +    P S
Subjt:  ------------------------EEGSSSLPKEWRYA--LSHPKDLILGNP------------EQGVKTR---------SSLN-LFSNLAFVSQIEPRS

Query:  FKDAECDE---FWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASY
        F + +  +    W  A+  ELN  + N  W +  RP N +I+ ++WVF  K +E GN IR KARLVA+G+ Q+  IDYEETFAPVAR+ + R +L+    
Subjt:  FKDAECDE---FWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASY

Query:  KNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKN--NDMLIVQIYVDDI
         N  ++QMDVK+AFLNG + EE+Y+  P G       ++V KL KA+YGLKQA R W++   + L E +F    +D  ++I  K   N+ + V +YVDD+
Subjt:  KNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKN--NDMLIVQIYVDDI

Query:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG
        +  + + +    F + +  +F M+ + E+  F+G
Subjt:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-5628.3Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE
        ML    LPK FW EAV TACY+ NR    P   + P  +W  K  +  + KVFGC+ F    KE+  K D K+   IF+GY      YR+++     +  
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEE

Query:  SIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG-----NPEQGVKTRSSL-------
        S  VVF ES   V   +  S+ ++       V        P+  +    E  E+G    P E    +   + L  G     +P QG +    L       
Subjt:  SIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG-----NPEQGVKTRSSL-------

Query:  ---NLFSNLAFV---SQIEPRSFKDA----ECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID
             + +  +V      EP S K+     E ++  + AMQEE+   ++N  +KLV  P     +  KWVF+ K D +  ++R KARLV +G+ Q++GID
Subjt:  ---NLFSNLAFV---SQIEPRSFKDA----ECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID

Query:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDN
        ++E F+PV ++ +IR +L+ A+  +  + Q+DVK+AFL+G + EE+Y+EQP GFE     + V KL K+LYGLKQAPR WY +   F+    +     D 
Subjt:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDN

Query:  TLFIK-VKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG--------------------------------KVAKTPMSTTTK
         ++ K    N+ +I+ +YVDD++    +  L  +    +   F+M  +G     LG                                K   TP++   K
Subjt:  TLFIK-VKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG--------------------------------KVAKTPMSTTTK

Query:  LDKDE--------------------------------------------------------------------------------KGYSDADFAGSLLDR
        L K                                                                                  KGY+DAD AG + +R
Subjt:  LDKDE--------------------------------------------------------------------------------KGYSDADFAGSLLDR

Query:  KSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAKYIA
        KS++G         +SW SK Q  VALSTTEA+YIA
Subjt:  KSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAKYIA

P92520 Uncharacterized mitochondrial protein AtMg008207.4e-2051.52Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EP+S   A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI + ET++PV R   IR +L  A
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-5327.11Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE
        +L+   +PK +W  A   A Y+ NR L  P L  ++P++   G  PN    +VFGC C+         K D K+   +FLGYS T  AY   + +T  + 
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIE

Query:  ESIHVVFDESWNNVSN-------------ESIC-------------------------------------------SDDLEKDFGDLL----------VN
         S HV FDE+    SN             ES C                                           S +L+  F               N
Subjt:  ESIHVVFDESWNNVSN-------------ESIC-------------------------------------------SDDLEKDFGDLL----------VN

Query:  DKGKEIVPSM----------------------QDVNIIEKKEEGSSSLPKEWRYALSH---------------PKDLILGNPEQ------GVKTRSSLNL
               P+                       Q    +    + SSS P     A S                P   I+ N  Q       + TR+   +
Subjt:  DKGKEIVPSM----------------------QDVNIIEKKEEGSSSLPKEWRYALSH---------------PKDLILGNPEQ------GVKTRSSLNL

Query:  FS-------NLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEET
                  ++  ++ EPR+   A  DE W  AM  E+N    N  W LV P PS+ +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ET
Subjt:  FS-------NLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEET

Query:  FAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFI
        F+PV +  +IR++L  A  +++ + Q+DV +AFL G + ++VY+ QPPGF   D PN+V KL+KALYGLKQAPRAWY  L  +LL   F     D +LF+
Subjt:  FAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFI

Query:  KVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGKVAK------------------------------TPMSTTTKL------
          +   ++ + +YVDDI+    + +L       +   F +    EL +FLG  AK                              TPM+ + KL      
Subjt:  KVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGKVAK------------------------------TPMSTTTKL------

Query:  ---DKDE----------------------------------------------------------KG-------YSDADFAGSLLDRKSTSGTCQFLGSS
           D  E                                                          KG       YSDAD+AG   D  ST+G   +LG  
Subjt:  ---DKDE----------------------------------------------------------KG-------YSDADFAGSLLDRKSTSGTCQFLGSS

Query:  LVSWFSKKQNSVALSTTEAKYIADAQVTPRLARQASKLRQLKHRI
         +SW SKKQ  V  S+TEA+Y + A  +  +    S L +L  R+
Subjt:  LVSWFSKKQNSVALSTTEAKYIADAQVTPRLARQASKLRQLKHRI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-4732.08Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EPR+   A  D+ W  AM  E+N    N  W LV P P + +I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Query:  SYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDI
          +++ + Q+DV +AFL G + +EVY+ QPPGF   D P++V +L+KA+YGLKQAPRAWY  L  +LL   F     D +LF+  +   ++ + +YVDDI
Subjt:  SYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDI

Query:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGKVAK------------------------------TPMSTTTKL---------DKDE----------
        +    ++ L +     +   F +    +L +FLG  AK                              TPM+T+ KL         D  E          
Subjt:  IFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGKVAK------------------------------TPMSTTTKL---------DKDE----------

Query:  ------------------------------------------------KG-------YSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTT
                                                        KG       YSDAD+AG   D  ST+G   +LG   +SW SKKQ  V  S+T
Subjt:  ------------------------------------------------KG-------YSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTT

Query:  EAKYIADAQVTPRLARQASKLRQL
        EA+Y + A  +  L    S L +L
Subjt:  EAKYIADAQVTPRLARQASKLRQL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.7e-4829.39Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EP ++ +A+    W  AM +E+   E    W++   P N   IG KWV++ K + +G I R KARLVA+GY Q+EGID+ ETF+PV +L +++++LA ++
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFD----LPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYV
          NF L+Q+D+ +AFLNG + EE+Y++ PPG+ +       PN V  LKK++YGLKQA R W+ + S  L+   F     D+T F+K+     L V +YV
Subjt:  YKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFD----LPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYV

Query:  DDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTT-----------------------
        DDII  S N +  +E    + + F++  +G L +FLG                              K +  PM  +                       
Subjt:  DDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLG------------------------------KVAKTPMSTT-----------------------

Query:  ------TKLD------------------------------KDEKG---------------YSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVAL
              T+LD                              K   G               +SDA F      R+ST+G C FLG+SL+SW SKKQ  V+ 
Subjt:  ------TKLD------------------------------KDEKG---------------YSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVAL

Query:  STTEAKYIADAQVTPRLARQASKLRQLKHRIPSAASLVARVSSFLFVEAETQPSHPLFVIRLQPLHLKPLCHS
        S+ EA+Y A +  T  +   A   R+L+  +     L    ++ + +           V   +  H++  CHS
Subjt:  STTEAKYIADAQVTPRLARQASKLRQLKHRIPSAASLVARVSSFLFVEAETQPSHPLFVIRLQPLHLKPLCHS

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.2e-0637.88Show/hide
Query:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKL
        ML E GLPK F A+A NTA ++ N+          P E+W   +P   Y + FGC  +I  ++ KL
Subjt:  MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKL

ATMG00810.1 DNA/RNA polymerases superfamily protein1.4e-0536.84Show/hide
Query:  KGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAKYIADAQVTPRLARQASKLRQLKHRIPSA
        + + D+D+AG    R+ST+G C FLG +++SW +K+Q +V+ S+TE +Y A A     L   ++     + R PSA
Subjt:  KGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAKYIADAQVTPRLARQASKLRQLKHRIPSA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.3e-2151.52Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EP+S   A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI + ET++PV R   IR +L  A
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGGCGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTA
TGAACTTTGGCATGGAAAAATTCCAAATATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTTGAATAATAAAGAAAAACTTGGAAAATTTGATTCTAAGACGG
ATGTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGG
AATAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAA
TATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAA
CTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAA
GAATTAAATCAATTTGAAAGAAACAAAGTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATCGGAACAAAATGGGTTTTTAGAAATAAGATGGATGAAAATGG
AAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAA
TGTTGCTTGCTTTTGCTTCTTATAAAAATTTCATTTTGTATCAAATGGATGTAAAAAGTGCTTTTTTAAACGGTTATATTGTAGAGGAAGTTTACGTAGAACAACCTCCG
GGCTTTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACT
TGAGAATGACTTTAAGATGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTA
CTAATTCATCTTTGTGTGAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTAAAGTTGCAAAAACTCCTATG
AGCACTACCACTAAGCTTGACAAAGATGAAAAAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAG
TTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGAAATATATTGCGGATGCTCAAGTCACACCGCGTCTTGCAAGACAAGCAA
GCAAACTTCGGCAACTCAAGCACCGCATCCCGAGCGCTGCATCACTGGTCGCTCGTGTTTCGTCTTTCCTGTTCGTTGAAGCCGAAACCCAACCGTCGCATCCTTTGTTC
GTGATTCGTCTCCAACCGTTGCACCTCAAGCCGCTGTGTCACTCACACTTTGAGTTCGACCAAAGCCCAGTCGAAGCTACCAATCGTTCGTGGTTCTTCTTCCACGTTCG
AACTTCCCGCACCGACTCTCTACTAGCTGCTAGTTACAATTCGACCCACGCCCAAATCAATTGCACTTTCTTTTTCGCATTGTCTTCGAAGACGCATGTGCCTTGGTGTG
AACTATTGGGTTTGTGTTTGTCAACGAGCGAATCCTTTGAAATTCTCTCTAGTTTGGAGTGGGTTCGTTCCTCCACTAAATGGGTTGTAATTGTTGAAGTTAGTTCGACC
AAGGGGTTTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAATGAGTATGGTTTACCTAAATATTTTTGGGCGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTA
TGAACTTTGGCATGGAAAAATTCCAAATATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTTGAATAATAAAGAAAAACTTGGAAAATTTGATTCTAAGACGG
ATGTTGGTATTTTTCTAGGATATTCCTCTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGG
AATAATGTTTCTAATGAGTCTATTTGTAGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAA
TATCATAGAAAAGAAAGAAGAGGGTTCTTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAA
CTCGTTCTTCTCTTAATTTATTTAGTAATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAA
GAATTAAATCAATTTGAAAGAAACAAAGTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATCGGAACAAAATGGGTTTTTAGAAATAAGATGGATGAAAATGG
AAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAA
TGTTGCTTGCTTTTGCTTCTTATAAAAATTTCATTTTGTATCAAATGGATGTAAAAAGTGCTTTTTTAAACGGTTATATTGTAGAGGAAGTTTACGTAGAACAACCTCCG
GGCTTTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACT
TGAGAATGACTTTAAGATGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTA
CTAATTCATCTTTGTGTGAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTAAAGTTGCAAAAACTCCTATG
AGCACTACCACTAAGCTTGACAAAGATGAAAAAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAG
TTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGAAATATATTGCGGATGCTCAAGTCACACCGCGTCTTGCAAGACAAGCAA
GCAAACTTCGGCAACTCAAGCACCGCATCCCGAGCGCTGCATCACTGGTCGCTCGTGTTTCGTCTTTCCTGTTCGTTGAAGCCGAAACCCAACCGTCGCATCCTTTGTTC
GTGATTCGTCTCCAACCGTTGCACCTCAAGCCGCTGTGTCACTCACACTTTGAGTTCGACCAAAGCCCAGTCGAAGCTACCAATCGTTCGTGGTTCTTCTTCCACGTTCG
AACTTCCCGCACCGACTCTCTACTAGCTGCTAGTTACAATTCGACCCACGCCCAAATCAATTGCACTTTCTTTTTCGCATTGTCTTCGAAGACGCATGTGCCTTGGTGTG
AACTATTGGGTTTGTGTTTGTCAACGAGCGAATCCTTTGAAATTCTCTCTAGTTTGGAGTGGGTTCGTTCCTCCACTAAATGGGTTGTAATTGTTGAAGTTAGTTCGACC
AAGGGGTTTGCATAA
Protein sequenceShow/hide protein sequence
MLNEYGLPKYFWAEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESW
NNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQE
ELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPP
GFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGKVAKTPM
STTTKLDKDEKGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAKYIADAQVTPRLARQASKLRQLKHRIPSAASLVARVSSFLFVEAETQPSHPLF
VIRLQPLHLKPLCHSHFEFDQSPVEATNRSWFFFHVRTSRTDSLLAASYNSTHAQINCTFFFALSSKTHVPWCELLGLCLSTSESFEILSSLEWVRSSTKWVVIVEVSST
KGFA