; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh20G010310 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh20G010310
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCma_Chr20:7525317..7528411
RNA-Seq ExpressionCmaCh20G010310
SyntenyCmaCh20G010310
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68148.1 hypothetical protein VITISV_035665 [Vitis vinifera]7.2e-5460Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        ALERN+TW ++PLPP HK IGC WVYKIK++SDG+IERYKARLVAKGYTQV G+DY+ETFSPTA+LTTL CLLT+ A+R W+ HQL+V NAFLHGNL EE
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT
        VYM+ P                                       GY QS  DYSLFTKS+G  FTA LIYVDDILLT NDL EI+ LKT
Subjt:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT

KAD4180157.1 hypothetical protein E3N88_28748 [Mikania micrantha]7.2e-5461.9Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        AL+ N+TWSL+PLP  HK IGC WVYKIK+NSDG+IERYKARLVAKGYTQVEG+DYKETFSPTA+LTTL CLLT+ AAR WF HQL+VQNAFLHG+L E 
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLK
        VYM+ P                                       GYTQS  DYSLFTK++G SFTA LIYVDDILLT NDL EI+ LK
Subjt:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLK

RVW14960.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.1e-5469.62Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        ALERN+TW ++PLPP HK IGC WVYKIK++ DG+IE YKARLVAKGYTQV G+DY+ETFSPT +LTTL CLLT+ A+R W+ HQL+V NAFLHGNL EE
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLP------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT
        VYM+ P       GY QS  DYSLFTKS+G  FTA LIYVDDILL  NDL EI+ LKT
Subjt:  VYMSLP------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT

RVW70215.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]7.2e-5460Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        ALERN+TW ++PLPP HK IGC WVYKIK++SDG+IERYKARLVAKGYTQV G+DY+ETFSPTA+LTTL CLLT+ A+R W+ HQL+V NAFLHGNL EE
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT
        VYM+ P                                       GY QS  DYSLFTKS+G  FTA LIYVDDILLT NDL EI+ LKT
Subjt:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT

RZB68903.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja]3.2e-5470.13Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        ALE+N+TWSL+PLP  HK IGC WVYKIK+ SDG+IERYKARLVAKGYTQVEG+DY+ETFSPTA++TTL CLLT+ AAR WF HQL+VQ+AFLHG+L E 
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLPLGYTQS----VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLK
        VYM  P G  +      DYSLF KS+GTS T  LIYVDDILLT NDL+E++ LK
Subjt:  VYMSLPLGYTQS----VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLK

TrEMBL top hitse value%identityAlignment
A0A438BVT5 Retrovirus-related Pol polyprotein from transposon RE15.4e-5569.62Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        ALERN+TW ++PLPP HK IGC WVYKIK++ DG+IE YKARLVAKGYTQV G+DY+ETFSPT +LTTL CLLT+ A+R W+ HQL+V NAFLHGNL EE
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLP------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT
        VYM+ P       GY QS  DYSLFTKS+G  FTA LIYVDDILL  NDL EI+ LKT
Subjt:  VYMSLP------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT

A0A438GDC6 Retrovirus-related Pol polyprotein from transposon RE13.5e-5460Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        ALERN+TW ++PLPP HK IGC WVYKIK++SDG+IERYKARLVAKGYTQV G+DY+ETFSPTA+LTTL CLLT+ A+R W+ HQL+V NAFLHGNL EE
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT
        VYM+ P                                       GY QS  DYSLFTKS+G  FTA LIYVDDILLT NDL EI+ LKT
Subjt:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT

A0A445H5L3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-5470.13Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        ALE+N+TWSL+PLP  HK IGC WVYKIK+ SDG+IERYKARLVAKGYTQVEG+DY+ETFSPTA++TTL CLLT+ AAR WF HQL+VQ+AFLHG+L E 
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLPLGYTQS----VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLK
        VYM  P G  +      DYSLF KS+GTS T  LIYVDDILLT NDL+E++ LK
Subjt:  VYMSLPLGYTQS----VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLK

A0A5N6N1H3 Reverse transcriptase Ty1/copia-type domain-containing protein3.5e-5461.9Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        AL+ N+TWSL+PLP  HK IGC WVYKIK+NSDG+IERYKARLVAKGYTQVEG+DYKETFSPTA+LTTL CLLT+ AAR WF HQL+VQNAFLHG+L E 
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLK
        VYM+ P                                       GYTQS  DYSLFTK++G SFTA LIYVDDILLT NDL EI+ LK
Subjt:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLK

A5BNR5 Integrase catalytic domain-containing protein3.5e-5460Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        ALERN+TW ++PLPP HK IGC WVYKIK++SDG+IERYKARLVAKGYTQV G+DY+ETFSPTA+LTTL CLLT+ A+R W+ HQL+V NAFLHGNL EE
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT
        VYM+ P                                       GY QS  DYSLFTKS+G  FTA LIYVDDILLT NDL EI+ LKT
Subjt:  VYMSLP--------------------------------------LGYTQS-VDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.4e-2130.11Show/hide
Query:  NHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEEVYMS
        N+TW++   P +   +   WV+ +K+N  G+  RYKARLVA+G+TQ   +DY+ETF+P A +++   +L++V       HQ++V+ AFL+G L EE+YM 
Subjt:  NHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEEVYMS

Query:  LPLGYT--------------------------------------QSVDYSLFTKSKG--TSFTAALIYVDDILLTVNDLKEIQHLK
        LP G +                                       SVD  ++   KG        L+YVDD+++   D+  + + K
Subjt:  LPLGYT--------------------------------------QSVDYSLFTKSKG--TSFTAALIYVDDILLTVNDLKEIQHLK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.8e-2342.2Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        +L++N T+ L+ LP   + + C WV+K+K + D  + RYKARLV KG+ Q +G+D+ E FSP  ++T++  +L++ A+      QL+V+ AFLHG+L+EE
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLPLGY
        +YM  P G+
Subjt:  VYMSLPLGY

P92520 Uncharacterized mitochondrial protein AtMg008202.4e-1548Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTI
        AL RN TW L+P P +   +GC WV+K K +SDG+++R KARLVAKG+ Q EG+ + ET+SP     T+  +L +
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-3142.46Show/hide
Query:  NHTWSLIPLPPDHKAI-GCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEEVYM
        NHTW L+P PP H  I GC W++  K+NSDGS+ RYKARLVAKGY Q  G+DY ETFSP  + T++  +L +   R W   QL+V NAFL G L ++VYM
Subjt:  NHTWSLIPLPPDHKAI-GCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEEVYM

Query:  SLP---------------------------------------LGYTQSV-DYSLFTKSKGTSFTAALIYVDDILLTVND
        S P                                       +G+  SV D SLF   +G S    L+YVDDIL+T ND
Subjt:  SLP---------------------------------------LGYTQSV-DYSLFTKSKGTSFTAALIYVDDILLTVND

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.8e-3141.08Show/hide
Query:  NHTWSLIPLPPDHKAI-GCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEEVYM
        NHTW L+P PP    I GC W++  K NSDGS+ RYKARLVAKGY Q  G+DY ETFSP  + T++  +L +   R W   QL+V NAFL G L +EVYM
Subjt:  NHTWSLIPLPPDHKAI-GCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEEVYM

Query:  SLP---------------------------------------LGYTQSV-DYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQH
        S P                                       +G+  S+ D SLF   +G S    L+YVDDIL+T ND   ++H
Subjt:  SLP---------------------------------------LGYTQSV-DYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.6e-3842.05Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE
        A+E  HTW +  LPP+ K IGC WVYKIK+NSDG+IERYKARLVAKGYTQ EG+D+ ETFSP  +LT++  +L I A   +  HQL++ NAFL+G+LDEE
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEE

Query:  VYMSLPLGYT--------------------------------------------QSVDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT
        +YM LP GY                                                D++ F K   T F   L+YVDDI++  N+   +  LK+
Subjt:  VYMSLPLGYT--------------------------------------------QSVDYSLFTKSKGTSFTAALIYVDDILLTVNDLKEIQHLKT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.7e-1648Show/hide
Query:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTI
        AL RN TW L+P P +   +GC WV+K K +SDG+++R KARLVAKG+ Q EG+ + ET+SP     T+  +L +
Subjt:  ALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYKARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACAATTGAAGAACCGTCTCAAAATGCAAATTCAACCGAATTTGAACTCTGGAATTAATGCAACAGACTTGGCTACGGGAAAGATGATTGGCTTGGGTAAA
CAATTCAGGGGTCTCTATCATATTTCATCTTCAATCAAATATTCAGCTCACCAAGTATCTCAGCCATCTGATTTGCGGCATTTACGCCTAGCTTTAGAACGTAAT
CATACTTGGTCTCTCATTCCTCTACCACCTGACCATAAAGCTATTGGTTGTCATTGGGTGTACAAGATTAAACACAACTCTGATGGTTCTATTGAACGTTATAAA
GCTCGACTAGTAGCAAAGGGATACACTCAAGTTGAAGGTGTTGATTACAAAGAGACATTTTCTCCTACAGCAGAACTTACTACACTTTGTTGCTTACTCACTATT
GTCGCTGCTCGAAAATGGTTCGCTCATCAGTTGAATGTTCAAAATGCCTTTCTCCACGGTAATCTAGACGAGGAAGTCTATATGTCTTTACCACTAGGCTACACT
CAGTCAGTAGATTACTCTTTATTTACTAAGAGTAAAGGTACTTCTTTCACTGCAGCTCTTATCTATGTTGATGATATTTTATTGACAGTCAATGATCTCAAAGAA
ATTCAACATCTCAAGACTAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCACAATTGAAGAACCGTCTCAAAATGCAAATTCAACCGAATTTGAACTCTGGAATTAATGCAACAGACTTGGCTACGGGAAAGATGATTGGCTTGGGTAAA
CAATTCAGGGGTCTCTATCATATTTCATCTTCAATCAAATATTCAGCTCACCAAGTATCTCAGCCATCTGATTTGCGGCATTTACGCCTAGCTTTAGAACGTAAT
CATACTTGGTCTCTCATTCCTCTACCACCTGACCATAAAGCTATTGGTTGTCATTGGGTGTACAAGATTAAACACAACTCTGATGGTTCTATTGAACGTTATAAA
GCTCGACTAGTAGCAAAGGGATACACTCAAGTTGAAGGTGTTGATTACAAAGAGACATTTTCTCCTACAGCAGAACTTACTACACTTTGTTGCTTACTCACTATT
GTCGCTGCTCGAAAATGGTTCGCTCATCAGTTGAATGTTCAAAATGCCTTTCTCCACGGTAATCTAGACGAGGAAGTCTATATGTCTTTACCACTAGGCTACACT
CAGTCAGTAGATTACTCTTTATTTACTAAGAGTAAAGGTACTTCTTTCACTGCAGCTCTTATCTATGTTGATGATATTTTATTGACAGTCAATGATCTCAAAGAA
ATTCAACATCTCAAGACTAGTTAA
Protein sequenceShow/hide protein sequence
MAQLKNRLKMQIQPNLNSGINATDLATGKMIGLGKQFRGLYHISSSIKYSAHQVSQPSDLRHLRLALERNHTWSLIPLPPDHKAIGCHWVYKIKHNSDGSIERYK
ARLVAKGYTQVEGVDYKETFSPTAELTTLCCLLTIVAARKWFAHQLNVQNAFLHGNLDEEVYMSLPLGYTQSVDYSLFTKSKGTSFTAALIYVDDILLTVNDLKE
IQHLKTS