; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G008080 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G008080
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCG_Chr07:21561836..21572517
RNA-Seq ExpressionClCG07G008080
SyntenyClCG07G008080
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0032991 - protein-containing complex (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR007175 - Ribonuclease P subunit, Rpr2/Snm1/Rpp21
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044832.1 Rpr2 domain-containing protein [Cucumis melo var. makuwa]2.0e-10865.2Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--
        SNC +RIEKNN K+  R  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LTMD P   P TT D LTIDT  
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--

Query:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
                          PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS

XP_004146565.1 uncharacterized protein LOC101220608 [Cucumis sativus]1.7e-10764.91Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KK NT +G+SNP  GPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV+PD SLFLC RCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTP-
        SNC++RIEKN  K+ RR  K SNLTQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+  TMD PKI P TT D LTIDTP 
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTP-

Query:  -------------------PT----------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
                           PT                                   DIPT+DAPATP T+TGMTLL+ KRRKRKKPSSKNQTEPESC   
Subjt:  -------------------PT----------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS
        T+ G+ +EGTSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS

XP_008452026.1 PREDICTED: uncharacterized protein LOC103493157 [Cucumis melo]9.1e-10965.2Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--
        SNC +RIEKNN K+ RR  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LT+D P   P TT D LTIDT  
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--

Query:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
                          PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS

XP_022984531.1 uncharacterized protein LOC111482797 [Cucurbita maxima]1.8e-10162.57Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MAK+KGNTKKGASNPTSGPQDSIT+RQEITGK KPKV NNVK YLNHLENLATWASG+ SIPSLAAFFGQRLA AAESLAV PDASLF CQRCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCE-------NESLTMDVPKI-----LPS
        SNCS+RIEKNN KR RR+ K SN  QN V YYCH+CSCRN KRGTPKGHMK LYD  F  +VK VDVKDGK+CE        E LT+D PKI     +P 
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCE-------NESLTMDVPKI-----LPS

Query:  T--------------TRDSLTIDTPPTED-------------------------------------IPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQ
        T              T+  L I++P T                                       +PTVDAPATP T TG+TLL+ K+RKR KPSSKNQ
Subjt:  T--------------TRDSLTIDTPPTED-------------------------------------IPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQ

Query:  TEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES
        TEP SC A TADGD++EGTSKR R RKSW SLKE+A+ NE+S
Subjt:  TEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES

XP_038906436.1 uncharacterized protein LOC120092350 isoform X1 [Benincasa hispida]4.1e-10966.67Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MAKKKGN KKG+SNPTSGPQDSITLRQEITGKIKPKV NNVKVYLNHLENLATWA GQPSIPSLA FFGQRLAAAAESLAV PDASLFLCQRCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENE--------SLTMDVPKILPSTTRD
        SNCS+RIEKNN KR R+ NK SNLTQNVV YYCHYCSCRN KRGTPKGHMK LY T+FVSK+KSV V+DGK+CEN+         LT+D P I PSTT +
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENE--------SLTMDVPKILPSTTRD

Query:  SLTIDT---PPT-----------------------------------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRK
           IDT   PPT                                                            DIPTVDAPATPPTMTG+TLL+ KRRKRK
Subjt:  SLTIDT---PPT-----------------------------------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRK

Query:  KPSSKNQTEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEE
        KPSSKNQTEPES  A T  GDKT G SKRKRNRKSW SLKEIA+R+EE
Subjt:  KPSSKNQTEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEE

TrEMBL top hitse value%identityAlignment
A0A0A0KUP1 Uncharacterized protein8.3e-10864.91Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KK NT +G+SNP  GPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV+PD SLFLC RCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTP-
        SNC++RIEKN  K+ RR  K SNLTQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+  TMD PKI P TT D LTIDTP 
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTP-

Query:  -------------------PT----------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
                           PT                                   DIPT+DAPATP T+TGMTLL+ KRRKRKKPSSKNQTEPESC   
Subjt:  -------------------PT----------------------------------EDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS
        T+ G+ +EGTSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS

A0A1S3BU13 uncharacterized protein LOC1034931574.4e-10965.2Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--
        SNC +RIEKNN K+ RR  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LT+D P   P TT D LTIDT  
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--

Query:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
                          PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS

A0A5A7TNC0 Rpr2 domain-containing protein9.8e-10965.2Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--
        SNC +RIEKNN K+  R  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LTMD P   P TT D LTIDT  
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--

Query:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
                          PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS

A0A5D3CYJ3 Rpr2 domain-containing protein4.4e-10965.2Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MA+KKGNTK+G+SNPTSGPQ+SITLRQE TGKIKPKV NN KVYLNHLENLATWASGQPS+PSLAAFFGQRLAAAAESLAV PD SLFLC RCET+LQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--
        SNC +RIEKNN K+ RR  K+SN+TQNVV YYCHYCSCRN KRGTPKGHMK LY T+ VSKVKSV VKDGK+CEN+ LT+D P   P TT D LTIDT  
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDT--

Query:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS
                          PPTE                                  DIPT+DAPATP T+T MTLL+ KRRKRKKPSSKN+TEPESC A 
Subjt:  ------------------PPTE----------------------------------DIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEPESCLAS

Query:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS
        T+ G+K+E TSKRKRNRKSW SLKEIA+R EE   +  + L+
Subjt:  TADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLS

A0A6J1J5I6 uncharacterized protein LOC1114827978.9e-10262.57Show/hide
Query:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG
        MAK+KGNTKKGASNPTSGPQDSIT+RQEITGK KPKV NNVK YLNHLENLATWASG+ SIPSLAAFFGQRLA AAESLAV PDASLF CQRCETILQPG
Subjt:  MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPG

Query:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCE-------NESLTMDVPKI-----LPS
        SNCS+RIEKNN KR RR+ K SN  QN V YYCH+CSCRN KRGTPKGHMK LYD  F  +VK VDVKDGK+CE        E LT+D PKI     +P 
Subjt:  SNCSVRIEKNNVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCE-------NESLTMDVPKI-----LPS

Query:  T--------------TRDSLTIDTPPTED-------------------------------------IPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQ
        T              T+  L I++P T                                       +PTVDAPATP T TG+TLL+ K+RKR KPSSKNQ
Subjt:  T--------------TRDSLTIDTPPTED-------------------------------------IPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQ

Query:  TEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES
        TEP SC A TADGD++EGTSKR R RKSW SLKE+A+ NE+S
Subjt:  TEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-1640Show/hide
Query:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFY--KHTRNDKVVVL
        +++   ++  ++Q+DVK AFLNG L+EE++M LP G   +   + VCKL K++YGLKQ+ R WFE F +A+    F  S VD  ++   K   N+ + VL
Subjt:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFY--KHTRNDKVVVL

Query:  IVYVDDIILTGNDET
        + YVDD+++   D T
Subjt:  IVYVDDIILTGNDET

P04146 Copia protein1.2e-0741.89Show/hide
Query:  KGIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKR--ESSLIEENFW
        KGI +  T   TPQ NGV+E   R + E AR ++    + K  WG+ VLTA YLINR+P++   +SS      W
Subjt:  KGIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKR--ESSLIEENFW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-2040.18Show/hide
Query:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIV
        +++  + D  + QLDVK AFL+G+LEEE++M+ P GFE     + VCKL KSLYGLKQ+PR W+ +F   + S  + ++  D  +++K    +  ++L++
Subjt:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIV

Query:  YVDDIILTGNDE
        YVDD+++ G D+
Subjt:  YVDDIILTGNDE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-0945.07Show/hide
Query:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKRESSLIEENFW
        GI H+ T   TPQ NGVAE  NR ++E  R+++    +PK  WG+ V TA YLINR P+   +  I E  W
Subjt:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKRESSLIEENFW

P25600 Putative transposon Ty5-1 protein YCL074W3.8e-0931.91Show/hide
Query:  LDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIVYVDDIIL
        +DV  AFLN  ++E +++  P GF  +   + V +L   +YGLKQ+P  W E     +   GF + + +H ++++ T +D  + + VYVDD+++
Subjt:  LDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIVYVDDIIL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.8e-2045.69Show/hide
Query:  KIAMGFNFD--WSLYQLDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVV
        +I +G   D  W + QLDV NAFL G L ++V+M  P GF      N VCKL+K+LYGLKQ+PRAW+      + + GF  S  D ++F    R   +V 
Subjt:  KIAMGFNFD--WSLYQLDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVV

Query:  LIVYVDDIILTGNDET
        ++VYVDDI++TGND T
Subjt:  LIVYVDDIILTGNDET

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.2e-0640.68Show/hide
Query:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPT
        GI H  +   TP+ NG++E K+RH++E    L+    +PK  W      A YLINR+PT
Subjt:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-1842.98Show/hide
Query:  KIAMGFNFD--WSLYQLDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVV
        +I +G   D  W + QLDV NAFL G L +EV+M  P GF      + VC+L+K++YGLKQ+PRAW+      + + GF  S  D ++F    R   ++ 
Subjt:  KIAMGFNFD--WSLYQLDVKNAFLNGELEEEVFMDLPLGFEADLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVV

Query:  LIVYVDDIILTGND
        ++VYVDDI++TGND
Subjt:  LIVYVDDIILTGND

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.2e-0642.37Show/hide
Query:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPT
        GI H  +   TP+ NG++E K+RH++E+   L+    VPK  W      A YLINR+PT
Subjt:  GIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPT

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.7e-2348.28Show/hide
Query:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEA----DLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVV
        +A+   ++++L+QLD+ NAFLNG+L+EE++M LP G+ A     L  N VC LKKS+YGLKQ+ R WF +F   +  +GF QS  DHT F K T    + 
Subjt:  IAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEA----DLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVV

Query:  VLIVYVDDIILTGNDE
        VL VYVDDII+  N++
Subjt:  VLIVYVDDIILTGNDE

AT5G41270.1 CONTAINS InterPro DOMAIN/s: RNAse P, Rpr2/Rpp21 subunit (InterPro:IPR007175); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).9.6e-3241.84Show/hide
Query:  HLENLATWAS-GQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPGSNCSVRIEK--NNVKRCRRRNKSSN---LTQNVVVYYCHYCSCRN
        HL+NLA W+S G   IPSLA+  G+RLAA  ES  +T D  L  CQRCETIL+PG NC+VRIEK   NVK+ R R K SN     QN VVY+C++CS RN
Subjt:  HLENLATWAS-GQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPGSNCSVRIEK--NNVKRCRRRNKSSN---LTQNVVVYYCHYCSCRN

Query:  RKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPPTEDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEP
         KRGT KG MK LY  K     +S   K  K+       M +P+ + S       + +P       V+  +   T   M L   + R+ +KP SK  +EP
Subjt:  RKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPPTEDIPTVDAPATPPTMTGMTLLNLKRRKRKKPSSKNQTEP

Query:  ESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES
        +S        +KT G S +++ +  W S+KEIA+ N+ S
Subjt:  ESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGAAGAAGGGAAATACGAAGAAAGGAGCATCTAACCCCACATCCGGTCCTCAAGATTCGATCACTCTGAGGCAGGAAATTACTGGGAAGATCAAACCCAAAGT
CCCTAACAATGTCAAAGTTTATTTGAACCATTTAGAAAACTTAGCGACTTGGGCTAGTGGTCAACCGTCTATTCCTTCATTGGCTGCTTTCTTTGGGCAGCGCCTCGCAG
CTGCAGCAGAGTCTTTGGCGGTCACTCCCGACGCTTCTCTATTTCTCTGCCAGAGGTGTGAAACAATTCTTCAACCTGGTTCTAACTGCTCAGTACGAATAGAGAAGAAT
AACGTCAAGAGATGTCGGAGACGCAACAAATCAAGTAATTTGACACAGAACGTTGTGGTGTATTATTGCCACTACTGCTCATGTAGGAACAGAAAGAGAGGTACTCCCAA
AGGTCATATGAAAGGGCTTTATGACACGAAGTTTGTAAGCAAGGTGAAATCTGTGGATGTCAAGGATGGTAAACAATGTGAAAACGAGAGTCTTACCATGGATGTTCCTA
AAATTCTTCCTTCTACAACCAGAGACAGTCTAACTATTGATACTCCTCCAACTGAGGACATTCCCACGGTAGATGCTCCTGCAACTCCTCCAACCATGACCGGAATGACT
CTGTTGAATTTGAAGAGGAGAAAGCGGAAGAAACCATCATCTAAGAATCAAACTGAACCTGAAAGTTGTTTGGCTTCAACAGCAGATGGGGACAAAACTGAAGGCACATC
CAAAAGGAAGCGTAATAGAAAATCATGGGCAAGTTTGAAGGAAATTGCTAAGAGGAATGAAGAGAGTGATCTTCGTCGGACGTCTCGCCTCAGTCACCGAACGACGCCCA
GGCTCCTAGTTGATGCTAGACCGCCGGTTGTTTATGGATCTCGTGAGCGCAGAAGAAAGCCGTGTGCCCTAGCCGTCACCGACGCCGACGCTGCTGTGACCTGGGTTCAC
TCCTTCTCTGCCGGCCGACCAAGTCTACACCATTGGCCGAGGGATTCCCGGCCAGTTCCAGCGATTCTCTGGTGGTTCCTTATTCTATCGCTGAGTGGAAGTTTTTTGGG
AATTATGGGACATGTGACTCAAATGTATTCTGATTTGGGTAACCAGTCACAAGTGTTCGAGGTGAATCTTAAGTTGGGTGATATACAACAAGGAGGTAACTCAGTTACAC
AATATTCTCACTCTATGAAGAGGATGTGGCAAGAACTTGATCTATTTGATACGTATGAGTGGAAGTCCACAGACGACCAAAAACATTATCGGAAAACTGTTGAAGATGAT
CGCATTTACAAATTCCTTGCTGGCCTCAATGTTGAGTTTGATAAGGGCATCTTTCATCAAGCTACATGTCGCGATACTCCTCAGCAAAATGGTGTTGCTGAATGGAAAAA
TCGACATTTGCTTGAAGTTGCTCGTGCCCTTATGTTTTCTATACATGTTCCAAAATATTTATGGGGGGATGTTGTTCTAACCGCTGCTTACCTAATCAATAGAATGCCTA
CTAAGAGGGAGTCATCTCTTATTGAAGAGAATTTTTGGGACACTTCACCTCTCCCAAACATCATTAGTCCTGAAATTATGAGCCCTAGTCCTTCGATCCCAAGTGTGGAA
AATTCTCCGACAGGGGGAGAAACACTACAAACAGATCTGACAGTGAATGATCCTGAAAATCCGAATATGTCTCTTAGTCCTTCCTCTCATAATATGTTGCTTGATGTCTT
TGATCTTGATATTTCAATTGCCCAGAGAAAAGTAGTGATGGAAGAGATGAACGCGCTGAAACAAAGTGGTACTTGGGGCATAGTTGATCTACCAGAAGACAAGATAGCAA
TGGGATTTAATTTTGATTGGTCACTTTATCAACTTGATGTTAAAAATGCATTTCTTAATGGGGAACTTGAAGAAGAAGTATTTATGGATTTACCACTTGGTTTTGAAGCT
GACCTTGGGATTAACAAGGTATGTAAATTAAAAAAATCACTATACGGCCTTAAACAGTCTCCTAGAGCTTGGTTTGAACGTTTTGGAAAGGCAGTCACAAGCTATGGATT
CAGCCAAAGTCAAGTCGATCACACTATGTTCTACAAGCATACGAGAAATGACAAGGTTGTTGTTCTGATAGTGTATGTCGATGATATCATTCTTACAGGCAATGATGAGA
CATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAGAAGAAGGGAAATACGAAGAAAGGAGCATCTAACCCCACATCCGGTCCTCAAGATTCGATCACTCTGAGGCAGGAAATTACTGGGAAGATCAAACCCAAAGT
CCCTAACAATGTCAAAGTTTATTTGAACCATTTAGAAAACTTAGCGACTTGGGCTAGTGGTCAACCGTCTATTCCTTCATTGGCTGCTTTCTTTGGGCAGCGCCTCGCAG
CTGCAGCAGAGTCTTTGGCGGTCACTCCCGACGCTTCTCTATTTCTCTGCCAGAGGTGTGAAACAATTCTTCAACCTGGTTCTAACTGCTCAGTACGAATAGAGAAGAAT
AACGTCAAGAGATGTCGGAGACGCAACAAATCAAGTAATTTGACACAGAACGTTGTGGTGTATTATTGCCACTACTGCTCATGTAGGAACAGAAAGAGAGGTACTCCCAA
AGGTCATATGAAAGGGCTTTATGACACGAAGTTTGTAAGCAAGGTGAAATCTGTGGATGTCAAGGATGGTAAACAATGTGAAAACGAGAGTCTTACCATGGATGTTCCTA
AAATTCTTCCTTCTACAACCAGAGACAGTCTAACTATTGATACTCCTCCAACTGAGGACATTCCCACGGTAGATGCTCCTGCAACTCCTCCAACCATGACCGGAATGACT
CTGTTGAATTTGAAGAGGAGAAAGCGGAAGAAACCATCATCTAAGAATCAAACTGAACCTGAAAGTTGTTTGGCTTCAACAGCAGATGGGGACAAAACTGAAGGCACATC
CAAAAGGAAGCGTAATAGAAAATCATGGGCAAGTTTGAAGGAAATTGCTAAGAGGAATGAAGAGAGTGATCTTCGTCGGACGTCTCGCCTCAGTCACCGAACGACGCCCA
GGCTCCTAGTTGATGCTAGACCGCCGGTTGTTTATGGATCTCGTGAGCGCAGAAGAAAGCCGTGTGCCCTAGCCGTCACCGACGCCGACGCTGCTGTGACCTGGGTTCAC
TCCTTCTCTGCCGGCCGACCAAGTCTACACCATTGGCCGAGGGATTCCCGGCCAGTTCCAGCGATTCTCTGGTGGTTCCTTATTCTATCGCTGAGTGGAAGTTTTTTGGG
AATTATGGGACATGTGACTCAAATGTATTCTGATTTGGGTAACCAGTCACAAGTGTTCGAGGTGAATCTTAAGTTGGGTGATATACAACAAGGAGGTAACTCAGTTACAC
AATATTCTCACTCTATGAAGAGGATGTGGCAAGAACTTGATCTATTTGATACGTATGAGTGGAAGTCCACAGACGACCAAAAACATTATCGGAAAACTGTTGAAGATGAT
CGCATTTACAAATTCCTTGCTGGCCTCAATGTTGAGTTTGATAAGGGCATCTTTCATCAAGCTACATGTCGCGATACTCCTCAGCAAAATGGTGTTGCTGAATGGAAAAA
TCGACATTTGCTTGAAGTTGCTCGTGCCCTTATGTTTTCTATACATGTTCCAAAATATTTATGGGGGGATGTTGTTCTAACCGCTGCTTACCTAATCAATAGAATGCCTA
CTAAGAGGGAGTCATCTCTTATTGAAGAGAATTTTTGGGACACTTCACCTCTCCCAAACATCATTAGTCCTGAAATTATGAGCCCTAGTCCTTCGATCCCAAGTGTGGAA
AATTCTCCGACAGGGGGAGAAACACTACAAACAGATCTGACAGTGAATGATCCTGAAAATCCGAATATGTCTCTTAGTCCTTCCTCTCATAATATGTTGCTTGATGTCTT
TGATCTTGATATTTCAATTGCCCAGAGAAAAGTAGTGATGGAAGAGATGAACGCGCTGAAACAAAGTGGTACTTGGGGCATAGTTGATCTACCAGAAGACAAGATAGCAA
TGGGATTTAATTTTGATTGGTCACTTTATCAACTTGATGTTAAAAATGCATTTCTTAATGGGGAACTTGAAGAAGAAGTATTTATGGATTTACCACTTGGTTTTGAAGCT
GACCTTGGGATTAACAAGGTATGTAAATTAAAAAAATCACTATACGGCCTTAAACAGTCTCCTAGAGCTTGGTTTGAACGTTTTGGAAAGGCAGTCACAAGCTATGGATT
CAGCCAAAGTCAAGTCGATCACACTATGTTCTACAAGCATACGAGAAATGACAAGGTTGTTGTTCTGATAGTGTATGTCGATGATATCATTCTTACAGGCAATGATGAGA
CATGA
Protein sequenceShow/hide protein sequence
MAKKKGNTKKGASNPTSGPQDSITLRQEITGKIKPKVPNNVKVYLNHLENLATWASGQPSIPSLAAFFGQRLAAAAESLAVTPDASLFLCQRCETILQPGSNCSVRIEKN
NVKRCRRRNKSSNLTQNVVVYYCHYCSCRNRKRGTPKGHMKGLYDTKFVSKVKSVDVKDGKQCENESLTMDVPKILPSTTRDSLTIDTPPTEDIPTVDAPATPPTMTGMT
LLNLKRRKRKKPSSKNQTEPESCLASTADGDKTEGTSKRKRNRKSWASLKEIAKRNEESDLRRTSRLSHRTTPRLLVDARPPVVYGSRERRRKPCALAVTDADAAVTWVH
SFSAGRPSLHHWPRDSRPVPAILWWFLILSLSGSFLGIMGHVTQMYSDLGNQSQVFEVNLKLGDIQQGGNSVTQYSHSMKRMWQELDLFDTYEWKSTDDQKHYRKTVEDD
RIYKFLAGLNVEFDKGIFHQATCRDTPQQNGVAEWKNRHLLEVARALMFSIHVPKYLWGDVVLTAAYLINRMPTKRESSLIEENFWDTSPLPNIISPEIMSPSPSIPSVE
NSPTGGETLQTDLTVNDPENPNMSLSPSSHNMLLDVFDLDISIAQRKVVMEEMNALKQSGTWGIVDLPEDKIAMGFNFDWSLYQLDVKNAFLNGELEEEVFMDLPLGFEA
DLGINKVCKLKKSLYGLKQSPRAWFERFGKAVTSYGFSQSQVDHTMFYKHTRNDKVVVLIVYVDDIILTGNDET