; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G07530 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G07530
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationClcChr07:20004392..20015084
RNA-Seq ExpressionClc07G07530
SyntenyClc07G07530
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0032991 - protein-containing complex (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR007175 - Ribonuclease P subunit, Rpr2/Snm1/Rpp21
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044832.1 Rpr2 domain-containing protein [Cucumis melo var. makuwa]4.4e-11166.37Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA
        SNC +RIEKNN K+  R  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LTMD P   P TT D LTIDTPA
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA

Query:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
        IP                PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS

XP_004146565.1 uncharacterized protein LOC101220608 [Cucumis sativus]3.1e-10965.79Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KK NT +G+SNP  GPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV+PD SLFLC RCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA
        SNC++RIEKN  K+ RR  K SNLTQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+  TMD PKI P TT D LTIDTPA
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA

Query:  IP----------------PPT----------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
        IP                 PT                                   DIPT+DAPATP T+TGMTLL+ KRRKRKKPSSKNQTEPESC   
Subjt:  IP----------------PPT----------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS
        T+ G+ +EGTSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS

XP_008452026.1 PREDICTED: uncharacterized protein LOC103493157 [Cucumis melo]2.0e-11166.37Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA
        SNC +RIEKNN K+ RR  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LT+D P   P TT D LTIDTPA
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA

Query:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
        IP                PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS

XP_022984531.1 uncharacterized protein LOC111482797 [Cucurbita maxima]3.7e-10263.16Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MAK+KGNTKKGASNPTSGPQDSIT+RQEITGK KPKV NNVK YLNHLENLATWASG+ SIPSLAAFFGQRLA AAESLAV PDASLF CQRCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCE-------NESLTMDVPKI-----LPS
        SNCS+RIEKNN KR RR+ K SN  QN V YYCH+CSCRN KRGTPKGHMK LYD  F  +VK VDVKDGK+CE        E LT+D PKI     +P 
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCE-------NESLTMDVPKI-----LPS

Query:  T--------------TRDSLTIDTPAIP---------------------------------PPTEDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQ
        T              T+  L I++PA P                                   T  +PTVDAPATP T TG+TLL+ K+RKR KPSSKNQ
Subjt:  T--------------TRDSLTIDTPAIP---------------------------------PPTEDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQ

Query:  TEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES
        TEP SC A TADGD++EGTSKR R RKSW SLKE+A+ NE+S
Subjt:  TEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES

XP_038906436.1 uncharacterized protein LOC120092350 isoform X1 [Benincasa hispida]1.7e-11066.95Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MAKKKGN KKG+SNPTSGPQDSITLRQEITGKIKPKV NNVKVYLNHLENLATWA GQPSIPSLA FFGQRLAAAAESLAV PDASLFLCQRCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENE--------SLTMDVPKILPSTTRD
        SNCS+RIEKNN KR R+ NK SNLTQNVV YYCHYCSCRN KRGTPKGHMK LY T+FVSK+KSV V+DGK+CEN+         LT+D P I PSTT +
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENE--------SLTMDVPKILPSTTRD

Query:  SLTIDTPAIPPPTE----------------------------------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRK
           IDT AIPP  +                                                          DIPTVDAPATPPTMTG+TLL+ KRRKRK
Subjt:  SLTIDTPAIPPPTE----------------------------------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRK

Query:  KPSSKNQTEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEE
        KPSSKNQTEPES  A T  GDKT G SKRKRNRKSW SLKEIA+R+EE
Subjt:  KPSSKNQTEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEE

TrEMBL top hitse value%identityAlignment
A0A0A0KUP1 Uncharacterized protein1.5e-10965.79Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KK NT +G+SNP  GPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV+PD SLFLC RCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA
        SNC++RIEKN  K+ RR  K SNLTQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+  TMD PKI P TT D LTIDTPA
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA

Query:  IP----------------PPT----------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
        IP                 PT                                   DIPT+DAPATP T+TGMTLL+ KRRKRKKPSSKNQTEPESC   
Subjt:  IP----------------PPT----------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS
        T+ G+ +EGTSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS

A0A1S3BU13 uncharacterized protein LOC1034931579.5e-11266.37Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA
        SNC +RIEKNN K+ RR  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LT+D P   P TT D LTIDTPA
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA

Query:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
        IP                PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS

A0A5A7TNC0 Rpr2 domain-containing protein2.1e-11166.37Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA
        SNC +RIEKNN K+  R  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LTMD P   P TT D LTIDTPA
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA

Query:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
        IP                PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS

A0A5D3CYJ3 Rpr2 domain-containing protein9.5e-11266.37Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA
        SNC +RIEKNN K+ RR  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LT+D P   P TT D LTIDTPA
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPA

Query:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
        IP                PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  IP----------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLS

A0A6J1J5I6 uncharacterized protein LOC1114827971.8e-10263.16Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MAK+KGNTKKGASNPTSGPQDSIT+RQEITGK KPKV NNVK YLNHLENLATWASG+ SIPSLAAFFGQRLA AAESLAV PDASLF CQRCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCE-------NESLTMDVPKI-----LPS
        SNCS+RIEKNN KR RR+ K SN  QN V YYCH+CSCRN KRGTPKGHMK LYD  F  +VK VDVKDGK+CE        E LT+D PKI     +P 
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCE-------NESLTMDVPKI-----LPS

Query:  T--------------TRDSLTIDTPAIP---------------------------------PPTEDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQ
        T              T+  L I++PA P                                   T  +PTVDAPATP T TG+TLL+ K+RKR KPSSKNQ
Subjt:  T--------------TRDSLTIDTPAIP---------------------------------PPTEDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQ

Query:  TEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES
        TEP SC A TADGD++EGTSKR R RKSW SLKE+A+ NE+S
Subjt:  TEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.0e-1740.87Show/hide
Query:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFY--KHTRNDKVVVL
        +++   ++  ++Q+DVK AFLNG L+EE++M LP G   N   + VCKL K++YGLKQ+ R WFE F +A+    F  S VD  ++   K   N+ + VL
Subjt:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFY--KHTRNDKVVVL

Query:  IVYVDDIILTGNDET
        + YVDD+++   D T
Subjt:  IVYVDDIILTGNDET

P04146 Copia protein1.2e-0741.89Show/hide
Query:  KGIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKR--ESSLIEENFW
        KGI +  T   TPQ NGV+E   R + E AR ++    + K  WG+ VLTA YLINR+P++   +SS      W
Subjt:  KGIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKR--ESSLIEENFW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2623.89Show/hide
Query:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKRESSLIEENFW-----------------------------
        GI H+ T   TPQ NGVAE  NR ++E  R+++    +PK  WG+ V TA YLINR P+   +  I E  W                             
Subjt:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKRESSLIEENFW-----------------------------

Query:  DTSPLPNI-------------------------------------------ISPEIMSPSPSIPSVENSPTGGETL-------------------QTDLT
        D   +P I                                           +   I+    +IPS  N+PT  E+                    Q D  
Subjt:  DTSPLPNI-------------------------------------------ISPEIMSPSPSIPSVENSPTGGETL-------------------QTDLT

Query:  VNDPENPNMSLS------------------PSSHNMLL----------DVFDLDISIAQRKVVMEEMNALKQSGTWGIVDLPEDK---------------
        V + E+P                       PS+  +L+          +V          K + EEM +L+++GT+ +V+LP+ K               
Subjt:  VNDPENPNMSLS------------------PSSHNMLL----------DVFDLDISIAQRKVVMEEMNALKQSGTWGIVDLPEDK---------------

Query:  ----------------------------------------IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSP
                                                +++  + D  + QLDVK AFL+G+LEEE++M+ P GFE     + VCKL KSLYGLKQ+P
Subjt:  ----------------------------------------IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSP

Query:  RAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIVYVDDIILTGNDE
        R W+ +F   + S  + ++  D  +++K    +  ++L++YVDD+++ G D+
Subjt:  RAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIVYVDDIILTGNDE

P25600 Putative transposon Ty5-1 protein YCL074W6.6e-0931.91Show/hide
Query:  LDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIVYVDDIIL
        +DV  AFLN  ++E +++  P GF      + V +L   +YGLKQ+P  W E     +   GF + + +H ++++ T +D  + + VYVDD+++
Subjt:  LDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIVYVDDIIL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.8e-2045.69Show/hide
Query:  KIAMGFNFD--WSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVV
        +I +G   D  W + QLDV NAFL G L ++V+M  P GF      N VCKL+K+LYGLKQ+PRAW+      + + GF  S  D ++F    R   +V 
Subjt:  KIAMGFNFD--WSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVV

Query:  LIVYVDDIILTGNDET
        ++VYVDDI++TGND T
Subjt:  LIVYVDDIILTGNDET

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.2e-0640.68Show/hide
Query:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPT
        GI H  +   TP+ NG++E K+RH++E    L+    +PK  W      A YLINR+PT
Subjt:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-1842.98Show/hide
Query:  KIAMGFNFD--WSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVV
        +I +G   D  W + QLDV NAFL G L +EV+M  P GF      + VC+L+K++YGLKQ+PRAW+      + + GF  S  D ++F    R   ++ 
Subjt:  KIAMGFNFD--WSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVV

Query:  LIVYVDDIILTGND
        ++VYVDDI++TGND
Subjt:  LIVYVDDIILTGND

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.2e-0642.37Show/hide
Query:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPT
        GI H  +   TP+ NG++E K+RH++E+   L+    VPK  W      A YLINR+PT
Subjt:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.8e-2348.28Show/hide
Query:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLG----INKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVV
        +A+   ++++L+QLD+ NAFLNG+L+EE++M LP G+ A  G     N VC LKKS+YGLKQ+ R WF +F   +  +GF QS  DHT F K T    + 
Subjt:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEANLG----INKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVV

Query:  VLIVYVDDIILTGNDE
        VL VYVDDII+  N++
Subjt:  VLIVYVDDIILTGNDE

AT5G41270.1 CONTAINS InterPro DOMAIN/s: RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).3.7e-3141.8Show/hide
Query:  HLENLATWAS-GQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPGSNCSVRIEK--NNVKRCRRRNKSSN---LTQNVVVYYCHYCSCRN
        HL+NLA W+S G   IPSLA+  G+RLAA  ES  +T D  L  CQRCETIL+PG NC+VRIEK   NVK+ R R K SN     QN VVY+C++CS RN
Subjt:  HLENLATWAS-GQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPGSNCSVRIEK--NNVKRCRRRNKSSN---LTQNVVVYYCHYCSCRN

Query:  RKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPAIPPPTEDIPTVDAPATPPTMTGMTLLNLKRRKR-KKPSSK
         KRGT KG MK LY  K     +S   K  K+       M +P+ + S   + L+    ++    E+    D P          +L L+R +R +KP SK
Subjt:  RKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPAIPPPTEDIPTVDAPATPPTMTGMTLLNLKRRKR-KKPSSK

Query:  NQTEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES
          +EP+S        +KT G S +++ +  W S+KEIA+ N+ S
Subjt:  NQTEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGAAGAAGGGAAATACGAAGAAAGGAGCATCTAACCCCACATCCGGTCCTCAAGATTCGATCACTCTGAGGCAGGAAATTACTGGGAAGATCAAACCCAAAGT
CCCTAACAATGTCAAAGTTTATTTGAACCATTTAGAAAACTTAGCGACTTGGGCTAGTGGTCAACCGTCTATTCCTTCATTGGCTGCTTTCTTTGGGCAGCGCCTCGCAG
CTGCAGCAGAGTCTTTGGCGGTCACTCCCGACGCTTCTCTATTTCTCTGCCAGAGGTGTGAAACAATTCTTCAACCTGGTTCTAACTGCTCAGTACGAATAGAGAAGAAT
AACGTCAAGAGATGTCGGAGACGCAACAAATCAAGTAATTTGACACAGAACGTTGTGGTGTATTATTGCCACTACTGCTCATGTAGGAACAGAAAGAGAGGTACTCCCAA
AGGTCATATGAAAGGGCTTTATGACACGAAGTTTGTAAGCAAGGTGAAATCTGTGGATGTCAAGGATGGTAAACAATGTGAAAACGAGAGTCTTACCATGGATGTTCCTA
AAATTCTTCCTTCTACAACCAGAGACAGTCTAACTATTGATACTCCTGCAATTCCTCCTCCAACTGAGGACATTCCCACGGTAGATGCTCCTGCAACTCCTCCAACCATG
ACCGGAATGACTCTGTTGAATTTGAAGAGGAGAAAGCGGAAGAAACCATCATCTAAGAATCAAACTGAACCTGAAAGTTGTTTGGCTTCAACAGCAGATGGGGACAAAAC
TGAAGGCACATCCAAAAGGAAGCGTAATAGAAAATCATGGGCAAGTTTGAAGGAAATTGCTAAGAGGAATGAAGAGAGTGATCTTCGTCGGACGTCTCACCTCAGTCACC
GAACGACGCCCAGGCTCCTAGTTGATGCTAGACCGCCGGTTGTTTATGGATCTCGTGAGCGCAGAAGAAAGCCGTGTGCCCTAGCCGTCACCGACGCCGACGCTGCTGTG
ACCTGGGTTCACTCCTTCTCTGCCGGCCGACCAAGTCTACACCATTGGCCGAGGGATTCCCGGCCAGTTCCAGCGATTCTCTGGTGGTTCCTTATTCTATCGCTGAGTGG
AAGTTTTTTGGGAATTATGGGACATGTGACTCAAATGTATTCTGATTTGGGTAACCAGTCACAAGTGTTCGAGGTGAATCTTAAGTTGGGTGATATACAACAAGGAGGTA
ACTCAGTTACACAATATTCTCACTCTATGAAGAGGATGTGGCAAGAACTTGATCTATTTGATACGTATGAGTGGAAGTCCACAGACGACCAAAAACATTATCGGAAAACT
GTTGAAGATGATCGCATTTACAAATTCCTTGCTGGCCTCAATGTTGAGTTTGATAAGGGCATCTTTCATCAAGCTACATGTCGCGATACTCCTCAGCAAAATGGTGTTGC
TGAATGGAAAAATCGACATTTGCTTGAAGTTGCTCGTGCCCTTATGTTTTCTATACATGTTCCAAAATATTTATGGGGGGATGTTGTTCTAACCGCTGCTTACCTAATCA
ATAGAATGCCTACTAAGAGGGAGTCATCTCTTATTGAAGAGAATTTTTGGGACACTTCACCTCTCCCAAACATCATTAGTCCTGAAATTATGAGCCCTAGTCCTTCGATC
CCAAGTGTGGAAAATTCTCCGACAGGGGGAGAAACACTACAAACAGATCTGACAGTGAATGATCCTGAAAATCCGAATATGTCTCTTAGTCCTTCCTCTCATAATATGTT
GCTTGATGTCTTTGATCTTGATATTTCAATTGCCCAGAGAAAAGTAGTGATGGAAGAGATGAACGCGCTGAAACAAAGTGGTACTTGGGGCATAGTTGATCTACCAGAAG
ACAAGATAGCAATGGGATTTAATTTTGATTGGTCACTTTATCAACTTGATGTTAAAAATGCATTTCTTAATGGGGAACTTGAAGAAGAAGTATTTATGGATTTACCACTT
GGTTTTGAAGCTAACCTTGGGATTAACAAGGTATGTAAATTAAAAAAATCACTATACGGCCTTAAACAGTCTCCTAGAGCTTGGTTTGAACGTTTTGGAAAGGCAGTCAC
AAGCTATGGATTCAGCCAAAGTCAAGTCGATCACACTATGTTCTACAAGCATACGAGAAATGACAAGGTTGTTGTTCTGATAGTGTATGTCGATGATATCATTCTTACAG
GCAATGATGAGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAGAAGAAGGGAAATACGAAGAAAGGAGCATCTAACCCCACATCCGGTCCTCAAGATTCGATCACTCTGAGGCAGGAAATTACTGGGAAGATCAAACCCAAAGT
CCCTAACAATGTCAAAGTTTATTTGAACCATTTAGAAAACTTAGCGACTTGGGCTAGTGGTCAACCGTCTATTCCTTCATTGGCTGCTTTCTTTGGGCAGCGCCTCGCAG
CTGCAGCAGAGTCTTTGGCGGTCACTCCCGACGCTTCTCTATTTCTCTGCCAGAGGTGTGAAACAATTCTTCAACCTGGTTCTAACTGCTCAGTACGAATAGAGAAGAAT
AACGTCAAGAGATGTCGGAGACGCAACAAATCAAGTAATTTGACACAGAACGTTGTGGTGTATTATTGCCACTACTGCTCATGTAGGAACAGAAAGAGAGGTACTCCCAA
AGGTCATATGAAAGGGCTTTATGACACGAAGTTTGTAAGCAAGGTGAAATCTGTGGATGTCAAGGATGGTAAACAATGTGAAAACGAGAGTCTTACCATGGATGTTCCTA
AAATTCTTCCTTCTACAACCAGAGACAGTCTAACTATTGATACTCCTGCAATTCCTCCTCCAACTGAGGACATTCCCACGGTAGATGCTCCTGCAACTCCTCCAACCATG
ACCGGAATGACTCTGTTGAATTTGAAGAGGAGAAAGCGGAAGAAACCATCATCTAAGAATCAAACTGAACCTGAAAGTTGTTTGGCTTCAACAGCAGATGGGGACAAAAC
TGAAGGCACATCCAAAAGGAAGCGTAATAGAAAATCATGGGCAAGTTTGAAGGAAATTGCTAAGAGGAATGAAGAGAGTGATCTTCGTCGGACGTCTCACCTCAGTCACC
GAACGACGCCCAGGCTCCTAGTTGATGCTAGACCGCCGGTTGTTTATGGATCTCGTGAGCGCAGAAGAAAGCCGTGTGCCCTAGCCGTCACCGACGCCGACGCTGCTGTG
ACCTGGGTTCACTCCTTCTCTGCCGGCCGACCAAGTCTACACCATTGGCCGAGGGATTCCCGGCCAGTTCCAGCGATTCTCTGGTGGTTCCTTATTCTATCGCTGAGTGG
AAGTTTTTTGGGAATTATGGGACATGTGACTCAAATGTATTCTGATTTGGGTAACCAGTCACAAGTGTTCGAGGTGAATCTTAAGTTGGGTGATATACAACAAGGAGGTA
ACTCAGTTACACAATATTCTCACTCTATGAAGAGGATGTGGCAAGAACTTGATCTATTTGATACGTATGAGTGGAAGTCCACAGACGACCAAAAACATTATCGGAAAACT
GTTGAAGATGATCGCATTTACAAATTCCTTGCTGGCCTCAATGTTGAGTTTGATAAGGGCATCTTTCATCAAGCTACATGTCGCGATACTCCTCAGCAAAATGGTGTTGC
TGAATGGAAAAATCGACATTTGCTTGAAGTTGCTCGTGCCCTTATGTTTTCTATACATGTTCCAAAATATTTATGGGGGGATGTTGTTCTAACCGCTGCTTACCTAATCA
ATAGAATGCCTACTAAGAGGGAGTCATCTCTTATTGAAGAGAATTTTTGGGACACTTCACCTCTCCCAAACATCATTAGTCCTGAAATTATGAGCCCTAGTCCTTCGATC
CCAAGTGTGGAAAATTCTCCGACAGGGGGAGAAACACTACAAACAGATCTGACAGTGAATGATCCTGAAAATCCGAATATGTCTCTTAGTCCTTCCTCTCATAATATGTT
GCTTGATGTCTTTGATCTTGATATTTCAATTGCCCAGAGAAAAGTAGTGATGGAAGAGATGAACGCGCTGAAACAAAGTGGTACTTGGGGCATAGTTGATCTACCAGAAG
ACAAGATAGCAATGGGATTTAATTTTGATTGGTCACTTTATCAACTTGATGTTAAAAATGCATTTCTTAATGGGGAACTTGAAGAAGAAGTATTTATGGATTTACCACTT
GGTTTTGAAGCTAACCTTGGGATTAACAAGGTATGTAAATTAAAAAAATCACTATACGGCCTTAAACAGTCTCCTAGAGCTTGGTTTGAACGTTTTGGAAAGGCAGTCAC
AAGCTATGGATTCAGCCAAAGTCAAGTCGATCACACTATGTTCTACAAGCATACGAGAAATGACAAGGTTGTTGTTCTGATAGTGTATGTCGATGATATCATTCTTACAG
GCAATGATGAGACATGA
Protein sequenceShow/hide protein sequence
MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPGSNCSVRIEKN
NVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPAIPPPTEDIPTVDAPATPPTM
TGMTLLNLKRRKRKKPSSKNQTEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSHLSHRTTPRLLVDARPPVVYGSRERRRKPCALAVTDADAAV
TWVHSFSAGRPSLHHWPRDSRPVPAILWWFLILSLSGSFLGIMGHVTQMYSDLGNQSQVFEVNLKLGDIQQGGNSVTQYSHSMKRMWQELDLFDTYEWKSTDDQKHYRKT
VEDDRIYKFLAGLNVEFDKGIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKRESSLIEENFWDTSPLPNIISPEIMSPSPSI
PSVENSPTGGETLQTDLTVNDPENPNMSLSPSSHNMLLDVFDLDISIAQRKVVMEEMNALKQSGTWGIVDLPEDKIAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPL
GFEANLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIVYVDDIILTGNDET