; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G08510 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G08510
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr7:6330788..6334401
RNA-Seq ExpressionCSPI07G08510
SyntenyCSPI07G08510
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN64335.1 hypothetical protein VITISV_001808 [Vitis vinifera]0.0e+0061.85Show/hide
Query:  DLVPVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIV
        +++    K SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG I  +  I      ++  + +   +   + S+ +L NK   V  +    
Subjt:  DLVPVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIV

Query:  LCKNLLDH-----GNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVIS
        + K++ +      G+R ENVY ++++ Y   D+C S  HD SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQMGKQ K+SFK+KN IS
Subjt:  LCKNLLDH-----GNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVIS

Query:  TTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSP
        T+RPL+LLHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C + G +HNF +P
Subjt:  TTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSP

Query:  RTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIF
        RT QQNGVVERKNRTLQE AR+MLNE  LPKYFW EA+NT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIF
Subjt:  RTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIF

Query:  LGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNP
        LGYS++SKA+RVFNK+T+V+EESIH  +   W N    +       KD      N K  E +P        +K       L  + ++ ++HP+D I+GNP
Subjt:  LGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNP

Query:  EQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI
          GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQ+ELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGI
Subjt:  EQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI

Query:  DYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKID
        DYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EE+YVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID
Subjt:  DYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKID

Query:  NTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLD
         TLFIK K NDML+VQIYVDDI FG+TN SLCE+FSKCMH                               KY +DLLK+F + E KV KTPMS++ KLD
Subjt:  NTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLD

Query:  KDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQ
         DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+ +GLWYP+   F L+G+SDADFAG  ++RKSTSGTC 
Subjt:  KDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQ

Query:  FLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ
         LG SLVSW SKKQNS+ALST EAEY A + C AQILWMKQ
Subjt:  FLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ

KYP33441.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]0.0e+0062.21Show/hide
Query:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC-
        CL+A K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KG I         + + LI  +L +   K     I  L +   K+ +     ++C 
Subjt:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC-

Query:  ---KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT
           K +   G R +N+Y LDL +   I   KCL    ++ WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN ISTT
Subjt:  ---KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT

Query:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT
        RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Subjt:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT

Query:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL
        PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE+++G+ PNI YF+VFGCKCF+LNN KE+LGKFD+K D  IFL
Subjt:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL

Query:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG
        GYS+ SKAYR++NK+TLV+EES+HVVFDES    + ++     +D L++   +   N+K KE           E  +E S  LPKEW+ +     D I+G
Subjt:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG

Query:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE
        N  +GV TRS++ N+ + +AFVSQ+EP+S  +A  DE W++AMQEELNQFERN+VW LVP P++  IIGTKWVFRNK+DE+G IIRNKARLVA+GY QEE
Subjt:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE

Query:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGK
        GIDY+ETFAPVAR+EAIR+LLA++S  NF LYQMDVKSAFLNG I EEVYVEQPPGF  F  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END+  GK
Subjt:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGK

Query:  IDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTK
        +DNTLF+K   ND + VQIYVDDI+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DGIFISQ KY  +LLKKF +   K   TP+S    
Subjt:  IDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTK

Query:  LDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT
        LD DEKG  VD   YRG+IGSLLYLTASRPDIMF VCLCARFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+D+AG  LDRKSTSGT
Subjt:  LDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT

Query:  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMK
        C  LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMK
Subjt:  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMK

KYP47407.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]0.0e+0062.14Show/hide
Query:  LKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC--
        L  +K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KG I         + + LI  +L +   K     I  L +   K+ +     ++C  
Subjt:  LKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC--

Query:  --KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTR
          K +   G R +N+Y LDL +   +   KCL    + +WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN IST+R
Subjt:  --KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTR

Query:  PLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTP
        PLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRTP
Subjt:  PLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTP

Query:  QQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLG
        QQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V NRVL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLG
Subjt:  QQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLG

Query:  YSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLV----NDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG
        YS+ SK+YR++NK+TLV+EES+HVVFDES N         +DL +     L+    N+K KE V         EK++     LPKEW+ +     D I+G
Subjt:  YSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLV----NDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG

Query:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE
        N  +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEE
Subjt:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE

Query:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGK
        GIDY+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG I EEVYVEQPPGF  F  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK
Subjt:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGK

Query:  IDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTK
        +DNTLF+K   ND + VQIYVDDI+FGSTNSSLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DG FISQ KY  +LLKKF +   K A TP+S    
Subjt:  IDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTK

Query:  LDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT
        LD DEKG  VD   YRG+IGSLLYLTASRPDIMF+VCLCARFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+DFAG  LDRKSTSGT
Subjt:  LDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT

Query:  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ
        C  LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMKQ
Subjt:  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ

KYP78729.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]0.0e+0061.81Show/hide
Query:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC-
        CL+A K   WYLDSGCSRHMTGD SK  +   KN G VT+GDN KG I         + + LI  +L +   K     I  L +   K+ +     ++C 
Subjt:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC-

Query:  ---KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT
           K +   G R +N+Y LDL +   +   KCL    ++ WLWHRR  H  M  ++ + +  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN IST+
Subjt:  ---KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT

Query:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT
        RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Subjt:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT

Query:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL
        PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFL
Subjt:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL

Query:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPE
        GYS+ SKAYR++NK+TLV+EES+HVVFDES N         +DL +     L+ ++  E+    +    +EK +E    LPKEW+ +     D I+GN  
Subjt:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPE

Query:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID
        +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGID
Subjt:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID

Query:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDN
        Y+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG+I EEVYVEQPPGF  +  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DN
Subjt:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDN

Query:  TLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK
        TLF+K   ND + VQIYVDDI+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DGIFISQ KY  +LLKKF +   K A TP+S    LD 
Subjt:  TLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK

Query:  DEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQF
        DEKG  VD   YRG+IGSLLYLTASRPDIMF+VCLCARFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+D+AG  LDRKSTSGTC  
Subjt:  DEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQF

Query:  LGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ
        LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMKQ
Subjt:  LGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ

RVW71911.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]0.0e+0066.14Show/hide
Query:  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIVLCKNLLD
        + SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG I  +  I      ++  + +   +   + S+ +L +K   V  +    + K++ +
Subjt:  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIVLCKNLLD

Query:  H-----GNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQL
              G+R ENVY ++++ Y   D+C S  HD SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQMGKQ K+SFK+KN IST+RPL+L
Subjt:  H-----GNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQL

Query:  LHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG
        LHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C ++G +HNFS+PRTPQQNG
Subjt:  LHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG

Query:  VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS
        VVERKNRTLQE AR+MLNE  LPKYFW EAVNT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++S
Subjt:  VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS

Query:  KAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLI
        KA+RVFNK+T+V+EESIHV+FDES N++       DD  LE   G L + DK ++      P  +D  +      + + E S  LPK+W++ ++HP+D I
Subjt:  KAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLI

Query:  LGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQ
        +GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY Q
Subjt:  LGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQ

Query:  EEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKM
        EEGIDYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKM
Subjt:  EEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKM

Query:  GKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTT
        GKID TLFIK K  DML+VQIYVDDIIFG+TN SLCE+FSKCMH+EFEMSMMGEL++FLGLQIKQLK+G FI+Q KY +DLLK+F + E KV KTPMS++
Subjt:  GKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTT

Query:  TKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTS
         KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+++GLWYP+   F L+G+SDADFAG  ++RKSTS
Subjt:  TKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTS

Query:  GTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ
        GTC FLG SLVSW SKKQNSVALST EAEYIA   CCAQILWMKQ
Subjt:  GTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ

TrEMBL top hitse value%identityAlignment
A0A151QSZ9 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0062.21Show/hide
Query:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC-
        CL+A K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KG I         + + LI  +L +   K     I  L +   K+ +     ++C 
Subjt:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC-

Query:  ---KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT
           K +   G R +N+Y LDL +   I   KCL    ++ WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN ISTT
Subjt:  ---KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT

Query:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT
        RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Subjt:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT

Query:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL
        PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE+++G+ PNI YF+VFGCKCF+LNN KE+LGKFD+K D  IFL
Subjt:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL

Query:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG
        GYS+ SKAYR++NK+TLV+EES+HVVFDES    + ++     +D L++   +   N+K KE           E  +E S  LPKEW+ +     D I+G
Subjt:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNES---ICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG

Query:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE
        N  +GV TRS++ N+ + +AFVSQ+EP+S  +A  DE W++AMQEELNQFERN+VW LVP P++  IIGTKWVFRNK+DE+G IIRNKARLVA+GY QEE
Subjt:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE

Query:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGK
        GIDY+ETFAPVAR+EAIR+LLA++S  NF LYQMDVKSAFLNG I EEVYVEQPPGF  F  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END+  GK
Subjt:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGK

Query:  IDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTK
        +DNTLF+K   ND + VQIYVDDI+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DGIFISQ KY  +LLKKF +   K   TP+S    
Subjt:  IDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTK

Query:  LDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT
        LD DEKG  VD   YRG+IGSLLYLTASRPDIMF VCLCARFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+D+AG  LDRKSTSGT
Subjt:  LDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT

Query:  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMK
        C  LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMK
Subjt:  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMK

A0A151RY83 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0062.14Show/hide
Query:  LKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC--
        L  +K   WYLDSGCSRHMTGD SK  S   KN G VT+GDN KG I         + + LI  +L +   K     I  L +   K+ +     ++C  
Subjt:  LKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC--

Query:  --KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTR
          K +   G R +N+Y LDL +   +   KCL    + +WLWHRR  H  M  ++ +S+  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN IST+R
Subjt:  --KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTR

Query:  PLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTP
        PLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRTP
Subjt:  PLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTP

Query:  QQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLG
        QQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V NRVL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFLG
Subjt:  QQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFLG

Query:  YSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLV----NDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG
        YS+ SK+YR++NK+TLV+EES+HVVFDES N         +DL +     L+    N+K KE V         EK++     LPKEW+ +     D I+G
Subjt:  YSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLV----NDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG

Query:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE
        N  +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEE
Subjt:  NPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEE

Query:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGK
        GIDY+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG I EEVYVEQPPGF  F  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK
Subjt:  GIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGK

Query:  IDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTK
        +DNTLF+K   ND + VQIYVDDI+FGSTNSSLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DG FISQ KY  +LLKKF +   K A TP+S    
Subjt:  IDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTK

Query:  LDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT
        LD DEKG  VD   YRG+IGSLLYLTASRPDIMF+VCLCARFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+DFAG  LDRKSTSGT
Subjt:  LDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGT

Query:  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ
        C  LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMKQ
Subjt:  CQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ

A0A151UHG7 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0061.81Show/hide
Query:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC-
        CL+A K   WYLDSGCSRHMTGD SK  +   KN G VT+GDN KG I         + + LI  +L +   K     I  L +   K+ +     ++C 
Subjt:  CLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVI---------TEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLC-

Query:  ---KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT
           K +   G R +N+Y LDL +   +   KCL    ++ WLWHRR  H  M  ++ + +  LV GLP  KF KDK+CDACQ GKQ K+SFKSKN IST+
Subjt:  ---KNLLDHGNRDENVYTLDLNYYPIID--KCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTT

Query:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT
        RPLQL+HMDLFGPSR  S GGNYY  VIVDD+SR+TWV+ + +K+DA  +F  FAK VQNEK   I+ IRSDHGGEF N  F+ FCEE+G +HNFS+PRT
Subjt:  RPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRT

Query:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL
        PQQNGVVERKNR+L+E AR+MLNE  LPKYFW +A+NTAC+V N+VL+RP L KTPYE++ GK PNI YF+VFGCKC++LNN KE+LGKFD+K D  IFL
Subjt:  PQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNN-KEKLGKFDSKTDVGIFL

Query:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPE
        GYS+ SKAYR++NK+TLV+EES+HVVFDES N         +DL +     L+ ++  E+    +    +EK +E    LPKEW+ +     D I+GN  
Subjt:  GYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPE

Query:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID
        +GV TRS++ N+ + +AFVSQ+EP++  +A  DE W++AMQEELNQFERN+VW LVP P +  IIGTKWVFRNK+DE+G I+RNKARLVA+GY QEEGID
Subjt:  QGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGID

Query:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDN
        Y+ETFAPVAR+EAIR+LLA++S KNF LYQMDVKSAFLNG+I EEVYVEQPPGF  +  PNHVYKLKKALYGLKQAPR+WYDRLSKFL+END++ GK+DN
Subjt:  YEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDN

Query:  TLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK
        TLF+K   ND + VQIYVDDI+FGSTN+SLC+EF+K M  EFEMSMMGEL+FFLGLQIKQ+ DGIFISQ KY  +LLKKF +   K A TP+S    LD 
Subjt:  TLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK

Query:  DEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQF
        DEKG  VD   YRG+IGSLLYLTASRPDIMF+VCLCARFQ+ PKESH  +VKRILKYL GT +VGLWYP+ V  +L+GYSD+D+AG  LDRKSTSGTC  
Subjt:  DEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQF

Query:  LGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ
        LGS+LVSW SKKQ  VALST EAEYIA  SCCAQILWMKQ
Subjt:  LGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ

A0A438GI90 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0066.14Show/hide
Query:  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIVLCKNLLD
        + SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG I  +  I      ++  + +   +   + S+ +L +K   V  +    + K++ +
Subjt:  KASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIVLCKNLLD

Query:  H-----GNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQL
              G+R ENVY ++++ Y   D+C S  HD SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQMGKQ K+SFK+KN IST+RPL+L
Subjt:  H-----GNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQL

Query:  LHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG
        LHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C ++G +HNFS+PRTPQQNG
Subjt:  LHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNG

Query:  VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS
        VVERKNRTLQE AR+MLNE  LPKYFW EAVNT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIFLGYS++S
Subjt:  VVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS

Query:  KAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLI
        KA+RVFNK+T+V+EESIHV+FDES N++       DD  LE   G L + DK ++      P  +D  +      + + E S  LPK+W++ ++HP+D I
Subjt:  KAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDD--LEKDFGDLLVNDKGKEIV----PSMQDVNII-----EKKEEGSSSLPKEWRYALSHPKDLI

Query:  LGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQ
        +GNP  GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQEELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY Q
Subjt:  LGNPEQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQ

Query:  EEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKM
        EEGIDYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EEVYVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKM
Subjt:  EEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKM

Query:  GKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTT
        GKID TLFIK K  DML+VQIYVDDIIFG+TN SLCE+FSKCMH+EFEMSMMGEL++FLGLQIKQLK+G FI+Q KY +DLLK+F + E KV KTPMS++
Subjt:  GKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTT

Query:  TKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTS
         KLD DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+++GLWYP+   F L+G+SDADFAG  ++RKSTS
Subjt:  TKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTS

Query:  GTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ
        GTC FLG SLVSW SKKQNSVALST EAEYIA   CCAQILWMKQ
Subjt:  GTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ

A5C8K0 Uncharacterized protein0.0e+0061.85Show/hide
Query:  DLVPVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIV
        +++    K SK++KW+LDSGCSRHMTGD SK    +K+ GG VTFGDN KG I  +  I      ++  + +   +   + S+ +L NK   V  +    
Subjt:  DLVPVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLI-----MMLLKIFVKKMVFPIIFSLQELLNK--MVWLKGKIV

Query:  LCKNLLDH-----GNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVIS
        + K++ +      G+R ENVY ++++ Y   D+C S  HD SWLWHRRLGHA+M LIS ++K+ LVRGLP   F+KDK+C+ACQMGKQ K+SFK+KN IS
Subjt:  LCKNLLDH-----GNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVIS

Query:  TTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSP
        T+RPL+LLHMDLFGPSR  S GG  YA+VIVDDFSR+TWVL +  K +A   F  F  +VQNEKGF I+ IRSDHG EF+N  F+ +C + G +HNF +P
Subjt:  TTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSP

Query:  RTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIF
        RT QQNGVVERKNRTLQE AR+MLNE  LPKYFW EA+NT+CYV NR+L+RP L KTPYELW  K PNI YFKVFGCKCFILN K+ LGKFD+K+DVGIF
Subjt:  RTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIF

Query:  LGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNP
        LGYS++SKA+RVFNK+T+V+EESIH  +   W N    +       KD      N K  E +P        +K       L  + ++ ++HP+D I+GNP
Subjt:  LGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNP

Query:  EQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI
          GV+TRSSL N+ +NLAF+SQIEP++ KDA  DE W++AMQ+ELNQFER++VW+LVPRPSN S+IGTKWVFRNKMDENG I+RNKARLVAQGY QEEGI
Subjt:  EQGVKTRSSL-NLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGI

Query:  DYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKID
        DYEETFAPVARLEAIRMLLAFA +K+FILYQMDVKSAFLNG+I EE+YVEQPPGF+SF+ PNHV+KLKKALYGLKQAPRAWY+RLSKFLL+  FKMGKID
Subjt:  DYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKID

Query:  NTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLD
         TLFIK K NDML+VQIYVDDI FG+TN SLCE+FSKCMH                               KY +DLLK+F + E KV KTPMS++ KLD
Subjt:  NTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLD

Query:  KDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQ
         DEKGK +D   YRGMIGSLLYLTASRPDIM+SVCLCARFQSCPKESH  AVKRIL+YL GT+ +GLWYP+   F L+G+SDADFAG  ++RKSTSGTC 
Subjt:  KDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQ

Query:  FLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ
         LG SLVSW SKKQNS+ALST EAEY A + C AQILWMKQ
Subjt:  FLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-11630.2Show/hide
Query:  LDSGCSRHMTGDRSKL---------ISFSKKNGGMVTFGDNKKGVITEENLIMMLLK--IFVKKMVFPI--IFSLQELLNKMVWLKGKIVLCKN---LLD
        LDSG S H+  D S           +  +    G   +   K+G++   N   + L+  +F K+    +  +  LQE    + + K  + + KN   ++ 
Subjt:  LDSGCSRHMTGDRSKL---------ISFSKKNGGMVTFGDNKKGVITEENLIMMLLK--IFVKKMVFPI--IFSLQELLNKMVWLKGKIVLCKN---LLD

Query:  HGNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLV--RGLPSFKFEKDKVCDACQMGKQTKSSFKS-KNVISTTRPLQLLH
        +     NV  ++   Y I  K  + F     LWH R GH S   +  I +  +   + L +      ++C+ C  GKQ +  FK  K+     RPL ++H
Subjt:  HGNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLV--RGLPSFKFEKDKVCDACQMGKQTKSSFKS-KNVISTTRPLQLLH

Query:  MDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVV
         D+ GP    +     Y  + VD F+ +    +IK+K D    F  F  + +      +  +  D+G E+ ++  + FC + G S++ + P TPQ NGV 
Subjt:  MDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVV

Query:  ERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD--KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS
        ER  RT+ E AR+M++   L K FW EAV TA Y+ NR+  R  +D  KTPYE+WH K P + + +VFG   ++ + K K GKFD K+   IF+GY    
Subjt:  ERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD--KTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTS

Query:  -KAYRVFNKKTLVIEESIHVVFDESWNNVSN-----ESICSDDLEKDFGDLLVNDKGKEI-------------VPSMQDVNIIEKK--------------
         K +   N+K +V  +   VV DE+ N V++     E++   D ++       ND  K I             +  ++D    E K              
Subjt:  -KAYRVFNKKTLVIEESIHVVFDESWNNVSN-----ESICSDDLEKDFGDLLVNDKGKEI-------------VPSMQDVNIIEKK--------------

Query:  -------------------------------------EEGSSSLPKEWRYA--LSHPKDLILGNP------------EQGVKTR---------SSLN-LF
                                             E   S  P E R +    H K++ + NP             + +KT+         +SLN + 
Subjt:  -------------------------------------EEGSSSLPKEWRYA--LSHPKDLILGNP------------EQGVKTR---------SSLN-LF

Query:  SNLAFVSQIEPRSFKDAECDE---FWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVAR
         N   +    P SF + +  +    W  A+  ELN  + N  W +  RP N +I+ ++WVF  K +E GN IR KARLVA+G+ Q+  IDYEETFAPVAR
Subjt:  SNLAFVSQIEPRSFKDAECDE---FWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVAR

Query:  LEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKN--
        + + R +L+     N  ++QMDVK+AFLNG + EE+Y+  P G       ++V KL KA+YGLKQA R W++   + L E +F    +D  ++I  K   
Subjt:  LEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKN--

Query:  NDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVD
        N+ + V +YVDD++  + + +    F + +  +F M+ + E+  F+G++I+  +D I++SQ  Y + +L KF +       TP+   +K++ +      D
Subjt:  NDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVD

Query:  IKT-YRGMIGSLLY-LTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEF--NLVGYSDADFAGSLLDRKSTSG-TCQFLGS
          T  R +IG L+Y +  +RPD+  +V + +R+ S      +  +KR+L+YL GTID+ L + +N+ F   ++GY D+D+AGS +DRKST+G   +    
Subjt:  IKT-YRGMIGSLLY-LTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEF--NLVGYSDADFAGSLLDRKSTSG-TCQFLGS

Query:  SLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQLFVILD
        +L+ W +K+QNSVA S+TEAEY+A+     + LW+K L   ++
Subjt:  SLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQLFVILD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-12334.58Show/hide
Query:  LWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMI
        LWH+R+GH S   +  ++K  L+      K    K CD C  GKQ + SF++ +       L L++ D+ GP  I S GGN Y    +DD SR  WV ++
Subjt:  LWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMI

Query:  KHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACY
        K KD   + F  F   V+ E G  + ++RSD+GGE+ +  F+ +C  +G  H  + P TPQ NGV ER NRT+ E  RSML    LPK FW EAV TACY
Subjt:  KHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACY

Query:  VSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSD
        + NR    P   + P  +W  K  +  + KVFGC+ F    KE+  K D K+   IF+GY      YR+++     +  S  VVF ES   V   +  S+
Subjt:  VSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSD

Query:  DLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG-----NPEQGVKTRSSL----------NLFSNLAFV---SQIEPR
         ++       V        P+  +    E  E+G    P E    +   + L  G     +P QG +    L            + +  +V      EP 
Subjt:  DLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILG-----NPEQGVKTRSSL----------NLFSNLAFV---SQIEPR

Query:  SFKDA----ECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        S K+     E ++  + AMQEE+   ++N  +KLV  P     +  KWVF+ K D +  ++R KARLV +G+ Q++GID++E F+PV ++ +IR +L+ A
Subjt:  SFKDA----ECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA

Query:  SYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIK-VKNNDMLIVQIYVDD
        +  +  + Q+DVK+AFL+G + EE+Y+EQP GFE     + V KL K+LYGLKQAPR WY +   F+    +     D  ++ K    N+ +I+ +YVDD
Subjt:  SYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIK-VKNNDMLIVQIYVDD

Query:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQI--KQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK-------DEKGKCVDIKT
        ++    +  L  +    +   F+M  +G     LG++I  ++    +++SQEKY   +L++F +   K   TP++   KL K       +EKG    +  
Subjt:  IIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQI--KQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK-------DEKGKCVDIKT

Query:  YRGMIGSLLY-LTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFS
        Y   +GSL+Y +  +RPDI  +V + +RF   P + H+ AVK IL+YL GT    L +  +    L GY+DAD AG + +RKS++G         +SW S
Subjt:  YRGMIGSLLY-LTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFS

Query:  KKQNSVALSTTEAEYIAVASCCAQILWMKQ
        K Q  VALSTTEAEYIA      +++W+K+
Subjt:  KKQNSVALSTTEAEYIAVASCCAQILWMKQ

P92519 Uncharacterized mitochondrial protein AtMg008103.4e-3436Show/hide
Query:  IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTY
        +YVDDI+   ++++L       + + F M  +G + +FLG+QIK    G+F+SQ KY   +L     N G +   PMST   L  +      K  D   +
Subjt:  IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTY

Query:  RGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKK
        R ++G+L YLT +RPDI ++V +  +    P  + F  +KR+L+Y+ GTI  GL+  +N + N+  + D+D+AG    R+ST+G C FLG +++SW +K+
Subjt:  RGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKK

Query:  QNSVALSTTEAEYIAVASCCAQILW
        Q +V+ S+TE EY A+A   A++ W
Subjt:  QNSVALSTTEAEYIAVASCCAQILW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.7e-12330.88Show/hide
Query:  NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGD-------------------NKKGVITEENLIMMLLKIF-------VKKMVFPIIFSLQELLN
        N W LDSG + H+T D + L       GG   MV  G                    N   ++   N+   L+ ++       V    FP  F +++L  
Subjt:  NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGD-------------------NKKGVITEENLIMMLLKIF-------VKKMVFPIIFSLQELLN

Query:  KMVWLKGKIVLCKNLLDHGNRDENVYTLDLNYYPII-DKCLSVFHDDS-----WLWHRRLGHASMHLISNISKNCLVRGL-PSFKFEKDKVCDACQMGKQ
         +  L+GK            +DE      L  +PI   + +S+F   S       WH RLGH +  +++++  N  +  L PS KF     C  C + K 
Subjt:  KMVWLKGKIVLCKNLLDHGNRDENVYTLDLNYYPII-DKCLSVFHDDS-----WLWHRRLGHASMHLISNISKNCLVRGL-PSFKFEKDKVCDACQMGKQ

Query:  TKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFC
         K  F S++ I++TRPL+ ++ D++  S I S+    Y  + VD F+R+TW+  +K K    ++FI+F   ++N     I    SD+GGEF   A   + 
Subjt:  TKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFC

Query:  EENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEK
         ++G SH  S P TP+ NG+ ERK+R + E   ++L+   +PK +W  A   A Y+ NR L  P L  ++P++   G  PN    +VFGC C+       
Subjt:  EENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNNKEK

Query:  LGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSN-------------ESIC---------------------------------
          K D K+   +FLGYS T  AY   + +T  +  S HV FDE+    SN             ES C                                 
Subjt:  LGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSN-------------ESIC---------------------------------

Query:  ----------SDDLEKDFGDLL----------VNDKGKEIVPSM----------------------QDVNIIEKKEEGSSSLPKEWRYALSH--------
                  S +L+  F               N       P+                       Q    +    + SSS P     A S         
Subjt:  ----------SDDLEKDFGDLL----------VNDKGKEIVPSM----------------------QDVNIIEKKEEGSSSLPKEWRYALSH--------

Query:  -------PKDLILGNPEQ------GVKTRSSLNLFS-------NLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWV
               P   I+ N  Q       + TR+   +          ++  ++ EPR+   A  DE W  AM  E+N    N  W LV P PS+ +I+G +W+
Subjt:  -------PKDLILGNPEQ------GVKTRSSLNLFS-------NLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNASIIGTKWV

Query:  FRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKAL
        F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A  +++ + Q+DV +AFL G + ++VY+ QPPGF   D PN+V KL+KAL
Subjt:  FRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHVYKLKKAL

Query:  YGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQE
        YGLKQAPRAWY  L  +LL   F     D +LF+  +   ++ + +YVDDI+    + +L       +   F +    EL +FLG++ K++  G+ +SQ 
Subjt:  YGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQE

Query:  KYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPR
        +Y  DLL +  +   K   TPM+ + KL      K  D   YRG++GSL YL  +RPDI ++V   ++F   P E H  A+KRIL+YL GT + G++  +
Subjt:  KYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPR

Query:  NVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQLFVIL
            +L  YSDAD+AG   D  ST+G   +LG   +SW SKKQ  V  S+TEAEY +VA+  +++ W+  L   L
Subjt:  NVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQLFVIL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-11829.21Show/hide
Query:  NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGD-------------------NKKGVITEENLIMMLLKIF-------VKKMVFPIIFSLQELLN
        N W LDSG + H+T D + L       GG   M+  G                    +   V+   N+   L+ ++       V    FP  F +++L  
Subjt:  NKWYLDSGCSRHMTGDRSKLISFSKKNGG---MVTFGD-------------------NKKGVITEENLIMMLLKIF-------VKKMVFPIIFSLQELLN

Query:  KMVWLKGKIVLCKNLLDHGNRDENVYTLDLNYYPIIDK---------CLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGL-PSFKFEKDKVCDACQM
         +  L+GK            +DE      L  +PI            C    H     WH RLGH S+ +++++  N  +  L PS K      C  C +
Subjt:  KMVWLKGKIVLCKNLLDHGNRDENVYTLDLNYYPIIDK---------CLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGL-PSFKFEKDKVCDACQM

Query:  GKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFK
         K  K  F S + I++++PL+ ++ D++  S I S     Y  + VD F+R+TW+  +K K     +FI F   V+N     I  + SD+GGEF     +
Subjt:  GKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASYGGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFK

Query:  AFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNN
         +  ++G SH  S P TP+ NG+ ERK+R + E   ++L+   +PK +W  A + A Y+ NR L  P L  ++P++   G+ PN    KVFGC C+    
Subjt:  AFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLD-KTPYELWHGKIPNIGYFKVFGCKCFILNN

Query:  KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDV----------------
             K + K+    F+GYS T  AY   +  T  +  S HV FDE     S  +      ++   D   N      +P+   V                
Subjt:  KEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESICSDDLEKDFGDLLVNDKGKEIVPSMQDV----------------

Query:  -----NIIEKKEEGSSSLP------------------------KEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNL-----------------------
             + +   +  SS+LP                        +  +   S+    IL NP     + +S N  S L                       
Subjt:  -----NIIEKKEEGSSSLP------------------------KEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNL-----------------------

Query:  ----------------------------------------------------AFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNAS
                                                            +  +  EPR+   A  D+ W  AM  E+N    N  W LV P P + +
Subjt:  ----------------------------------------------------AFVSQIEPRSFKDAECDEFWILAMQEELNQFERNKVWKLV-PRPSNAS

Query:  IIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHV
        I+G +W+F  K + +G++ R KARLVA+GY Q  G+DY ETF+PV +  +IR++L  A  +++ + Q+DV +AFL G + +EVY+ QPPGF   D P++V
Subjt:  IIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPNHV

Query:  YKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKD
         +L+KA+YGLKQAPRAWY  L  +LL   F     D +LF+  +   ++ + +YVDDI+    ++ L +     +   F +    +L +FLG++ K++  
Subjt:  YKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKD

Query:  GIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTID
        G+ +SQ +YT DLL +  +   K   TPM+T+ KL      K  D   YRG++GSL YL  +RPD+ ++V   +++   P + H++A+KR+L+YL GT D
Subjt:  GIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTID

Query:  VGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQLFVIL
         G++  +    +L  YSDAD+AG   D  ST+G   +LG   +SW SKKQ  V  S+TEAEY +VA+  +++ W+  L   L
Subjt:  VGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQLFVIL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-8640.33Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS
        EP ++ +A+    W  AM +E+   E    W++   P N   IG KWV++ K + +G I R KARLVA+GY Q+EGID+ ETF+PV +L +++++LA ++
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFAS

Query:  YKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFD----LPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYV
          NF L+Q+D+ +AFLNG + EE+Y++ PPG+ +       PN V  LKK++YGLKQA R W+ + S  L+   F     D+T F+K+     L V +YV
Subjt:  YKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFD----LPNHVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYV

Query:  DDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGS
        DDII  S N +  +E    + + F++  +G L +FLGL+I +   GI I Q KY  DLL +  L   K +  PM  +        G  VD K YR +IG 
Subjt:  DDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGS

Query:  LLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVAL
        L+YL  +R DI F+V   ++F   P+ +H  AV +IL Y+ GT+  GL+Y    E  L  +SDA F      R+ST+G C FLG+SL+SW SKKQ  V+ 
Subjt:  LLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVAL

Query:  STTEAEYIAVASCCAQILWMKQLF
        S+ EAEY A++    +++W+ Q F
Subjt:  STTEAEYIAVASCCAQILWMKQLF

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-0839.47Show/hide
Query:  NRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKL
        NRT+ E  RSML E GLPK F  +A NTA ++ N+          P E+W   +P   Y + FGC  +I  ++ KL
Subjt:  NRTLQEFARSMLNEYGLPKYFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKL

ATMG00810.1 DNA/RNA polymerases superfamily protein2.4e-3536Show/hide
Query:  IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTY
        +YVDDI+   ++++L       + + F M  +G + +FLG+QIK    G+F+SQ KY   +L     N G +   PMST   L  +      K  D   +
Subjt:  IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTY

Query:  RGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKK
        R ++G+L YLT +RPDI ++V +  +    P  + F  +KR+L+Y+ GTI  GL+  +N + N+  + D+D+AG    R+ST+G C FLG +++SW +K+
Subjt:  RGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKK

Query:  QNSVALSTTEAEYIAVASCCAQILW
        Q +V+ S+TE EY A+A   A++ W
Subjt:  QNSVALSTTEAEYIAVASCCAQILW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.5e-2151.52Show/hide
Query:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA
        EP+S   A  D  W  AMQEEL+   RNK W LVP P N +I+G KWVF+ K+  +G + R KARLVA+G+ QEEGI + ET++PV R   IR +L  A
Subjt:  EPRSFKDAECDEFWILAMQEELNQFERNKVWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATAAAGCTTATTCTTGTTACCTATCTAGATCTAGTGCCTGTTTGTTTGAAAGCCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTGCTCAAGACACATGAC
GGGAGACCGATCCAAGCTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTT
TAAAGATTTTTGTCAAGAAAATGGTTTTTCCCATCATTTTTTCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTA
GATCACGGAAATAGGGATGAAAATGTGTACACTCTTGATTTGAATTATTATCCTATTATTGATAAATGTCTTTCGGTTTTCCATGATGATTCTTGGTTATGGCATAGAAG
ACTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCTAAAAATTGTTTGGTTAGAGGTCTTCCTAGTTTTAAATTTGAAAAAGACAAAGTTTGTGATGCTTGTCAAA
TGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGATCTCTACTACTAGACCCTTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTAT
GGAGGAAATTATTATGCTTTTGTGATAGTTGATGATTTTTCTAGATTTACTTGGGTTTTGATGATAAAACATAAGGATGATGCTTTGAAAAGTTTTATTAGTTTTGCAAA
AAGAGTACAAAATGAAAAAGGATTTTTTATTTCCAAAATTAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTTTTGTGAAGAAAATGGTTTTTCCC
ATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAA
TATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAA
TATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTTGAATAACAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATGTTGGTATTTTTCTAGGATATTCCT
CTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGGAATAATGTTTCTAATGAGTCTATTTGT
AGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAACATCATAGAAAAGAAAGAAGAGGGTTC
TTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTA
ATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAAGAAACAAA
GTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATCGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACT
TGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAA
ATTTCATTTTGTATCAAATGGATGTAAAAAGTGCTTTTTTAAACGGTTATATTGTAGAGGAAGTTTACGTAGAACAACCTCCGGGTTTTGAAAGTTTTGATTTACCTAAT
CATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGATGGGAAAAAT
TGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGTGAAGAATTTT
CCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAA
TACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGA
TATAAAGACTTATCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAG
AATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGAT
GCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTT
ATCCACTACCGAAGCGGAATATATTGCGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAACTCTTTGTGATTTTGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACATAAAGCTTATTCTTGTTACCTATCTAGATCTAGTGCCTGTTTGTTTGAAAGCCTCCAAGAAAAACAAATGGTACTTGGATAGTGGTTGCTCAAGACACATGAC
GGGAGACCGATCCAAGCTTATCTCTTTCTCCAAAAAGAATGGAGGCATGGTAACCTTTGGTGACAACAAGAAAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTT
TAAAGATTTTTGTCAAGAAAATGGTTTTTCCCATCATTTTTTCTCTCCAAGAACTCCTCAACAAAATGGTGTGGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTA
GATCACGGAAATAGGGATGAAAATGTGTACACTCTTGATTTGAATTATTATCCTATTATTGATAAATGTCTTTCGGTTTTCCATGATGATTCTTGGTTATGGCATAGAAG
ACTAGGACATGCTAGCATGCACTTAATTTCAAATATTTCTAAAAATTGTTTGGTTAGAGGTCTTCCTAGTTTTAAATTTGAAAAAGACAAAGTTTGTGATGCTTGTCAAA
TGGGTAAGCAAACTAAGTCCTCTTTCAAATCTAAAAATGTGATCTCTACTACTAGACCCTTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTAT
GGAGGAAATTATTATGCTTTTGTGATAGTTGATGATTTTTCTAGATTTACTTGGGTTTTGATGATAAAACATAAGGATGATGCTTTGAAAAGTTTTATTAGTTTTGCAAA
AAGAGTACAAAATGAAAAAGGATTTTTTATTTCCAAAATTAGGAGTGATCACGGAGGAGAATTTGATAATGATGCTTTTAAAGCTTTTTGTGAAGAAAATGGTTTTTCCC
ATAATTTTTCCTCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAAAATCGTACTTTGCAAGAATTTGCTAGATCAATGTTGAATGAGTATGGTTTACCTAAA
TATTTTTGGACGGAAGCCGTTAACACCGCTTGCTATGTTTCAAATAGAGTTTTAGTTAGACCCTCTTTAGATAAAACTCCTTATGAACTTTGGCATGGAAAAATTCCAAA
TATTGGGTATTTCAAAGTTTTCGGTTGTAAATGTTTTATTTTGAATAACAAAGAAAAACTTGGAAAATTTGATTCTAAGACGGATGTTGGTATTTTTCTAGGATATTCCT
CTACTAGTAAAGCTTATAGAGTTTTCAATAAGAAAACTTTAGTTATTGAGGAATCTATTCATGTTGTATTTGATGAATCTTGGAATAATGTTTCTAATGAGTCTATTTGT
AGTGATGATTTAGAAAAAGATTTTGGAGATTTACTTGTTAATGACAAAGGCAAAGAAATTGTTCCAAGTATGCAAGATGTGAACATCATAGAAAAGAAAGAAGAGGGTTC
TTCATCCTTGCCTAAAGAGTGGAGATATGCTCTATCCCATCCCAAGGATCTAATTCTTGGCAATCCCGAACAAGGTGTCAAAACTCGTTCTTCTCTTAATTTATTTAGTA
ATCTTGCTTTTGTGTCTCAAATTGAACCTAGAAGTTTTAAAGATGCCGAATGTGATGAGTTTTGGATTTTAGCTATGCAAGAAGAATTAAATCAATTTGAAAGAAACAAA
GTTTGGAAATTAGTCCCTAGGCCTTCTAATGCATCTATAATCGGAACTAAATGGGTTTTTAGAAATAAGATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACT
TGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAA
ATTTCATTTTGTATCAAATGGATGTAAAAAGTGCTTTTTTAAACGGTTATATTGTAGAGGAAGTTTACGTAGAACAACCTCCGGGTTTTGAAAGTTTTGATTTACCTAAT
CATGTTTATAAGTTGAAAAAGGCTCTTTATGGCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGATGGGAAAAAT
TGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGTGAAGAATTTT
CCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAA
TACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGA
TATAAAGACTTATCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAG
AATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGAT
GCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTT
ATCCACTACCGAAGCGGAATATATTGCGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAACTCTTTGTGATTTTGGATTAA
Protein sequenceShow/hide protein sequence
MDIKLILVTYLDLVPVCLKASKKNKWYLDSGCSRHMTGDRSKLISFSKKNGGMVTFGDNKKGVITEENLIMMLLKIFVKKMVFPIIFSLQELLNKMVWLKGKIVLCKNLL
DHGNRDENVYTLDLNYYPIIDKCLSVFHDDSWLWHRRLGHASMHLISNISKNCLVRGLPSFKFEKDKVCDACQMGKQTKSSFKSKNVISTTRPLQLLHMDLFGPSRIASY
GGNYYAFVIVDDFSRFTWVLMIKHKDDALKSFISFAKRVQNEKGFFISKIRSDHGGEFDNDAFKAFCEENGFSHNFSSPRTPQQNGVVERKNRTLQEFARSMLNEYGLPK
YFWTEAVNTACYVSNRVLVRPSLDKTPYELWHGKIPNIGYFKVFGCKCFILNNKEKLGKFDSKTDVGIFLGYSSTSKAYRVFNKKTLVIEESIHVVFDESWNNVSNESIC
SDDLEKDFGDLLVNDKGKEIVPSMQDVNIIEKKEEGSSSLPKEWRYALSHPKDLILGNPEQGVKTRSSLNLFSNLAFVSQIEPRSFKDAECDEFWILAMQEELNQFERNK
VWKLVPRPSNASIIGTKWVFRNKMDENGNIIRNKARLVAQGYCQEEGIDYEETFAPVARLEAIRMLLAFASYKNFILYQMDVKSAFLNGYIVEEVYVEQPPGFESFDLPN
HVYKLKKALYGLKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEK
YTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLWYPRNVEFNLVGYSD
ADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQLFVILD