; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G009750 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G009750
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr20:5420190..5420951
RNA-Seq ExpressionCmoCh20G009750
SyntenyCmoCh20G009750
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022927092.1 uncharacterized protein LOC111434028 [Cucurbita moschata]5.7e-10375.39Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        M+ M+TQI VTMK VEN+  GQT+TGS+KLKF DPRPFKGNRDAKEL+NFIFD+E YFK T ACT+DIKV VA+M+L+DDAK WWR KVQDIEN LCTID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLKR+LRDQF PENVE++A E    LKQT SI +YVRQFSTLMLDIRGTSEKDKVF FINGLQPWAKTK++EKKVQDLAT +ASAERL D+GS AS+
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPYP
        QRK  Q PNT GK YK  G+RN   NRPNR+NDRP+ W +RP QNNQ GTSRGPYP
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPYP

XP_022972954.1 uncharacterized protein LOC111471473 [Cucurbita maxima]6.4e-9470.2Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        MA M T+I VTMKAVENV AGQ +TGS+KL+F DPR FK NRDAKELENFIFD+EQYFK T+ACT+D KV VA+M+L+DDAK WWR+KVQDIE+ L TID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLK++LRD+F PEN  ++A E    LK T  I +YVRQFSTLMLDI GT EKDK+F FINGLQPWAKTK++E KVQ LA A+A AERLLD+G+EA  
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY
        QR+IT  PNT GK YK   HRN   NRPN  NDRP+ W DRP QNNQAGTSRGPY
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY

XP_022975176.1 uncharacterized protein LOC111474215 [Cucurbita maxima]1.1e-9872.94Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        MA M T+I VTMKAVENV AGQT+TGS+KL+F +PR FKGNRDAKELENFIFD+EQYFK T+ACT+D KV VA+M+L DDAK WWR+KVQDIE+ LCTID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLK++LRDQF PEN  ++A E    LK T SI +YVRQFSTLMLDIRGTSEKDKVF FINGLQPWAKTK++E KVQ LA A+A AERLLD+G+EA  
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY
        QR+IT  PNT GK YK   HRN   NRPN  NDRP+ W DRP QNNQAGTSRGPY
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY

XP_022975516.1 uncharacterized protein LOC111474945, partial [Cucurbita maxima]5.6e-9872.16Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        MA M T+I VTMKAVENV AGQT+TGS+KL+F +PR FKGN+DAKELENFIFD+EQYFK T+ C +D KV VA+M+L DDAK WWR+KVQDIE+ LCTID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLK++LRDQF PEN E++A E    LK T SI +YVRQFSTLMLDIRGTSEKDKVF FINGLQPWAKTK++E KVQ LA A+A AERLLD+G+EA  
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY
        QR+IT  PNT GK YK   HRN   NRPN  NDRP+ W DRP QNNQAGTSRGPY
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY

XP_022975706.1 uncharacterized protein LOC111475733, partial [Cucurbita maxima]3.6e-9772.16Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        MA M T+I VTMKAVENV AGQT+TGS KL+F DPR FKGNRDAKELENFIFD+EQYFK T+ACT+D KV VA+M+L DDAK WWR+KVQDIE+ LCTID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLK++LRDQF PEN  ++A E    LK T  I +YVRQFSTLMLDIRGTSEKDKVF FINGLQPWAKTK++E KVQ LA A+A  ERLLD+G+EA  
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY
        QR+IT  PNT GK YK   HRN   NRPN  NDRP+ W DRP QNNQAGTSR PY
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY

TrEMBL top hitse value%identityAlignment
A0A6J1EG61 uncharacterized protein LOC1114340282.8e-10375.39Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        M+ M+TQI VTMK VEN+  GQT+TGS+KLKF DPRPFKGNRDAKEL+NFIFD+E YFK T ACT+DIKV VA+M+L+DDAK WWR KVQDIEN LCTID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLKR+LRDQF PENVE++A E    LKQT SI +YVRQFSTLMLDIRGTSEKDKVF FINGLQPWAKTK++EKKVQDLAT +ASAERL D+GS AS+
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPYP
        QRK  Q PNT GK YK  G+RN   NRPNR+NDRP+ W +RP QNNQ GTSRGPYP
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPYP

A0A6J1ID35 uncharacterized protein LOC1114714733.1e-9470.2Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        MA M T+I VTMKAVENV AGQ +TGS+KL+F DPR FK NRDAKELENFIFD+EQYFK T+ACT+D KV VA+M+L+DDAK WWR+KVQDIE+ L TID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLK++LRD+F PEN  ++A E    LK T  I +YVRQFSTLMLDI GT EKDK+F FINGLQPWAKTK++E KVQ LA A+A AERLLD+G+EA  
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY
        QR+IT  PNT GK YK   HRN   NRPN  NDRP+ W DRP QNNQAGTSRGPY
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY

A0A6J1IDF7 uncharacterized protein LOC1114742155.4e-9972.94Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        MA M T+I VTMKAVENV AGQT+TGS+KL+F +PR FKGNRDAKELENFIFD+EQYFK T+ACT+D KV VA+M+L DDAK WWR+KVQDIE+ LCTID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLK++LRDQF PEN  ++A E    LK T SI +YVRQFSTLMLDIRGTSEKDKVF FINGLQPWAKTK++E KVQ LA A+A AERLLD+G+EA  
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY
        QR+IT  PNT GK YK   HRN   NRPN  NDRP+ W DRP QNNQAGTSRGPY
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY

A0A6J1IEF9 uncharacterized protein LOC1114749452.7e-9872.16Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        MA M T+I VTMKAVENV AGQT+TGS+KL+F +PR FKGN+DAKELENFIFD+EQYFK T+ C +D KV VA+M+L DDAK WWR+KVQDIE+ LCTID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLK++LRDQF PEN E++A E    LK T SI +YVRQFSTLMLDIRGTSEKDKVF FINGLQPWAKTK++E KVQ LA A+A AERLLD+G+EA  
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY
        QR+IT  PNT GK YK   HRN   NRPN  NDRP+ W DRP QNNQAGTSRGPY
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY

A0A6J1IEY4 uncharacterized protein LOC1114757331.7e-9772.16Show/hide
Query:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID
        MA M T+I VTMKAVENV AGQT+TGS KL+F DPR FKGNRDAKELENFIFD+EQYFK T+ACT+D KV VA+M+L DDAK WWR+KVQDIE+ LCTID
Subjt:  MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTID

Query:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF
        SWEDLK++LRDQF PEN  ++A E    LK T  I +YVRQFSTLMLDIRGTSEKDKVF FINGLQPWAKTK++E KVQ LA A+A  ERLLD+G+EA  
Subjt:  SWEDLKRKLRDQFFPENVEYIARE---TLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASF

Query:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY
        QR+IT  PNT GK YK   HRN   NRPN  NDRP+ W DRP QNNQAGTSR PY
Subjt:  QRKITQTPNTRGKAYKSLGHRNEGSNRPNRSNDRPNEWADRPLQNNQAGTSRGPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGATGAACACCCAAATTGGGGTGACCATGAAAGCCGTGGAGAATGTCATGGCAGGACAAACTCATACAGGATCCGACAAACTGAAGTTCATAGACCCCAGACC
CTTCAAAGGGAATCGGGACGCCAAAGAGTTGGAGAACTTTATTTTTGATATCGAACAGTACTTCAAAATCACATCAGCCTGTACCAACGACATAAAAGTTGCAGTAGCCA
CAATGCATCTCATGGACGATGCAAAGTTTTGGTGGCGTTCGAAGGTGCAAGACATTGAAAATGAATTATGCACCATCGACTCGTGGGAAGACCTCAAGAGAAAGTTGAGG
GACCAATTCTTCCCCGAAAACGTAGAATACATAGCAAGAGAAACACTAAAACAAACTGAAAGCATAATGGAATATGTCAGACAGTTCTCGACCCTGATGCTGGATATCAG
GGGCACGTCAGAGAAAGACAAGGTATTCCTTTTCATAAATGGGTTGCAACCGTGGGCCAAAACAAAAATATACGAGAAAAAGGTTCAAGACCTAGCCACCGCAGTCGCCA
GTGCCGAAAGACTCCTAGACTTTGGAAGCGAAGCGAGTTTCCAAAGAAAAATAACACAAACCCCAAACACTAGGGGCAAAGCATATAAGTCGTTGGGACATCGAAACGAA
GGCTCCAATAGGCCAAATAGAAGTAACGACAGACCAAATGAATGGGCAGATAGACCTCTTCAGAATAACCAAGCGGGGACATCTCGAGGACCTTACCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGATGAACACCCAAATTGGGGTGACCATGAAAGCCGTGGAGAATGTCATGGCAGGACAAACTCATACAGGATCCGACAAACTGAAGTTCATAGACCCCAGACC
CTTCAAAGGGAATCGGGACGCCAAAGAGTTGGAGAACTTTATTTTTGATATCGAACAGTACTTCAAAATCACATCAGCCTGTACCAACGACATAAAAGTTGCAGTAGCCA
CAATGCATCTCATGGACGATGCAAAGTTTTGGTGGCGTTCGAAGGTGCAAGACATTGAAAATGAATTATGCACCATCGACTCGTGGGAAGACCTCAAGAGAAAGTTGAGG
GACCAATTCTTCCCCGAAAACGTAGAATACATAGCAAGAGAAACACTAAAACAAACTGAAAGCATAATGGAATATGTCAGACAGTTCTCGACCCTGATGCTGGATATCAG
GGGCACGTCAGAGAAAGACAAGGTATTCCTTTTCATAAATGGGTTGCAACCGTGGGCCAAAACAAAAATATACGAGAAAAAGGTTCAAGACCTAGCCACCGCAGTCGCCA
GTGCCGAAAGACTCCTAGACTTTGGAAGCGAAGCGAGTTTCCAAAGAAAAATAACACAAACCCCAAACACTAGGGGCAAAGCATATAAGTCGTTGGGACATCGAAACGAA
GGCTCCAATAGGCCAAATAGAAGTAACGACAGACCAAATGAATGGGCAGATAGACCTCTTCAGAATAACCAAGCGGGGACATCTCGAGGACCTTACCCATAA
Protein sequenceShow/hide protein sequence
MAEMNTQIGVTMKAVENVMAGQTHTGSDKLKFIDPRPFKGNRDAKELENFIFDIEQYFKITSACTNDIKVAVATMHLMDDAKFWWRSKVQDIENELCTIDSWEDLKRKLR
DQFFPENVEYIARETLKQTESIMEYVRQFSTLMLDIRGTSEKDKVFLFINGLQPWAKTKIYEKKVQDLATAVASAERLLDFGSEASFQRKITQTPNTRGKAYKSLGHRNE
GSNRPNRSNDRPNEWADRPLQNNQAGTSRGPYP