; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh06G009010 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh06G009010
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCma_Chr06:5804189..5806116
RNA-Seq ExpressionCmaCh06G009010
SyntenyCmaCh06G009010
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3680274.1 putative 50S ribosomal protein L18-like [Capsicum annuum]1.6e-2843.72Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKL----------------------------------KFDEI
        M E GSV D+INEFNMIVSQL SV+INFEDE K+LILMSSL ESWDTVV  IS+    D L                                   F ++
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKL----------------------------------KFDEI

Query:  -------------RDVVLS---------ESIR-----KREL---GNSSGSALSVDQRGIRCMNRVAVAESASNSSIWHNRLGHMNVKGMKMSAAKEVLEL
                     RDV +          E +R     K+ L   G    +  +  + G  C+N V+V ESAS S +W+NRLGHM+ KGMKM AAK  L+ 
Subjt:  -------------RDVVLS---------ESIR-----KREL---GNSSGSALSVDQRGIRCMNRVAVAESASNSSIWHNRLGHMNVKGMKMSAAKEVLEL

Query:  LKSVDMSPCVNCVMSKQKRVSFIKTVRELKK
        LKSVDM  C +CVM KQKRVSF+KT RE K+
Subjt:  LKSVDMSPCVNCVMSKQKRVSFIKTVRELKK

KAG7011443.1 hypothetical protein SDJN02_26349, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-3436.71Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVD---------
        MSEGGS+ DYINEFNMIVS+LS VEINF+DEIKALILMSSLPESWDTVV  I++ R SDKLKFDEIRD+VL ESIR R+ G+SSG ALS D         
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVD---------

Query:  ----------------------------------QRG-------------------------------IRCMNRVAVAESASNSSIWHNRLGHMNVKGMK
                                          Q+G                                 C+N  A   S SNSS+WHNRLGH++VKGMK
Subjt:  ----------------------------------QRG-------------------------------IRCMNRVAVAESASNSSIWHNRLGHMNVKGMK

Query:  MSAAKEVLELLKSVDM-----SPCVNCVMSKQKRVSFIKTVRELKKSIGTTKQV--GGEVELQNNSQSDVVEDTQ---------ETSKTVAEKPEKDSHS
        M  AK  LE LKSVD+     SP V+ +   +  V+FI        ++ TT  +  G  V L+     +V    +           +  V    EK    
Subjt:  MSAAKEVLELLKSVDM-----SPCVNCVMSKQKRVSFIKTVRELKKSIGTTKQV--GGEVELQNNSQSDVVEDTQ---------ETSKTVAEKPEKDSHS

Query:  DVVA-----------------------------DIQETPETLAEEPEV--KQVGVEVEVEVELLKDSLNDVVADTQETPKTVAEEPEEEQVTPKQ
        DV A                             D+      L +  E    +   +V VE+E  ++S +DV  + QETP  VAEE + EQVTP++
Subjt:  DVVA-----------------------------DIQETPETLAEEPEV--KQVGVEVEVEVELLKDSLNDVVADTQETPKTVAEEPEEEQVTPKQ

VFQ59121.1 unnamed protein product [Cuscuta campestris]2.3e-2775.26Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRGIRC
        M E GSV ++IN+FNMIVSQL  VEINFEDEIK LIL+SS+PESWD VV  IS+ R S+KL+FDEIRDVVLSESIRKRE+ +SSGSALSVD+RG RC
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRGIRC

VFQ69914.1 unnamed protein product [Cuscuta campestris]8.6e-3078.72Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG
        M E GSV ++IN+FNMIVSQL SVEINFEDEIKALIL+SS+PESWDTVV  IS+ R S+KL+FDEIRDVVLSESIRKRE+G+SSGSALSVD++G
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG

VFR00719.1 unnamed protein product [Cuscuta campestris]1.1e-3744.44Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG------
        M E GSV ++IN+FNMIVSQL SVEINFEDEIKALIL+SS+ ESWDTVV  IS+ R S+KL+FDEIRDVVLSESIRKRE+G+SSGSALSVDQ+G      
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG------

Query:  ------IRCMNRVAVAESASNSSIWH-NRLGHMNVKGMKMSAAKEVLELLKSVDMSPCVNCVMSKQKRVSFIKTVRELKKSIGTTKQVGGEVELQNNSQS
               +  NR   + + SN + W+    GH      K    +      KS D    VN   ++    + I +V +L    G    V   +  Q  S S
Subjt:  ------IRCMNRVAVAESASNSSIWH-NRLGHMNVKGMKMSAAKEVLELLKSVDMSPCVNCVMSKQKRVSFIKTVRELKKSIGTTKQVGGEVELQNNSQS

Query:  DVVEDTQETSKTVAEKPEKD-SHSDVVAD--------IQETPETLAEEPEVKQVGVEVEVEVELLKDSLNDVVADTQETPKTVAEEPEEEQVTPKQVLKR
          + + +   +   +K +K   H DV  D         Q+ PE+       KQVGVEVE+E    K +  +V A+TQ TP T+ EEPE EQVTP+QVL+R
Subjt:  DVVEDTQETSKTVAEKPEKD-SHSDVVAD--------IQETPETLAEEPEVKQVGVEVEVEVELLKDSLNDVVADTQETPKTVAEEPEEEQVTPKQVLKR

Query:  SSRAIKLPDRLQKCVALSSAEVEY
        SSR  ++PDR+ + +  SS  V Y
Subjt:  SSRAIKLPDRLQKCVALSSAEVEY

TrEMBL top hitse value%identityAlignment
A0A438E1E8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2632.7Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSA-------------
        M+E  SV  ++NEFN I +QLSSVEI+F+DEI+ALI+++SLP SW+ +   +SN    +KLK+++IRD++L+E IR+R+ G +SGS              
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSA-------------

Query:  -----LSVDQ---------------------------------------------------------------RGIRCMNR----------------VAV
             L+VD                                                                +G R + R                +AV
Subjt:  -----LSVDQ---------------------------------------------------------------RGIRCMNR----------------VAV

Query:  AESASNSSIWHNRLGHMNVKGMKMSAAKEVLELLKSVDMSPCVNCVMSKQKRVSFIKTVRELK
        A++++++S+WH RLGHM+ KGMKM  +K  L  LKS+D   C +C++ KQK+VSF+KT R LK
Subjt:  AESASNSSIWHNRLGHMNVKGMKMSAAKEVLELLKSVDMSPCVNCVMSKQKRVSFIKTVRELK

A0A484K039 Uncharacterized protein1.1e-2775.26Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRGIRC
        M E GSV ++IN+FNMIVSQL  VEINFEDEIK LIL+SS+PESWD VV  IS+ R S+KL+FDEIRDVVLSESIRKRE+ +SSGSALSVD+RG RC
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRGIRC

A0A484KZ82 CCHC-type domain-containing protein4.1e-3078.72Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG
        M E GSV ++IN+FNMIVSQL SVEINFEDEIKALIL+SS+PESWDTVV  IS+ R S+KL+FDEIRDVVLSESIRKRE+G+SSGSALSVD++G
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG

A0A484NK44 CCHC-type domain-containing protein5.4e-3844.44Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG------
        M E GSV ++IN+FNMIVSQL SVEINFEDEIKALIL+SS+ ESWDTVV  IS+ R S+KL+FDEIRDVVLSESIRKRE+G+SSGSALSVDQ+G      
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG------

Query:  ------IRCMNRVAVAESASNSSIWH-NRLGHMNVKGMKMSAAKEVLELLKSVDMSPCVNCVMSKQKRVSFIKTVRELKKSIGTTKQVGGEVELQNNSQS
               +  NR   + + SN + W+    GH      K    +      KS D    VN   ++    + I +V +L    G    V   +  Q  S S
Subjt:  ------IRCMNRVAVAESASNSSIWH-NRLGHMNVKGMKMSAAKEVLELLKSVDMSPCVNCVMSKQKRVSFIKTVRELKKSIGTTKQVGGEVELQNNSQS

Query:  DVVEDTQETSKTVAEKPEKD-SHSDVVAD--------IQETPETLAEEPEVKQVGVEVEVEVELLKDSLNDVVADTQETPKTVAEEPEEEQVTPKQVLKR
          + + +   +   +K +K   H DV  D         Q+ PE+       KQVGVEVE+E    K +  +V A+TQ TP T+ EEPE EQVTP+QVL+R
Subjt:  DVVEDTQETSKTVAEKPEKD-SHSDVVAD--------IQETPETLAEEPEVKQVGVEVEVEVELLKDSLNDVVADTQETPKTVAEEPEEEQVTPKQVLKR

Query:  SSRAIKLPDRLQKCVALSSAEVEY
        SSR  ++PDR+ + +  SS  V Y
Subjt:  SSRAIKLPDRLQKCVALSSAEVEY

A0A6A3C4B2 65-kDa microtubule-associated protein 34.7e-2635.91Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGN-SSGSALSVDQRG-----
        M+EG S+  ++NE N I +QLSSV+I F+DE++ALI++SSLP+SW+ +V  +S+   + KLKF+++RD+VLSE IR+RE G  S+ SAL  + RG     
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGN-SSGSALSVDQRG-----

Query:  --------------------IRCM----------------------------------NRVAVAESASNSSIWHNRLGHMNVKGMKMSAAKEVLELLKSV
                              C                                   N +  A++   S++WH RLGHM+ KGMK   +K  L  LK+V
Subjt:  --------------------IRCM----------------------------------NRVAVAESASNSSIWHNRLGHMNVKGMKMSAAKEVLELLKSV

Query:  DMSPCVNCVMSKQKRVSFIK
        D+  C +C+  KQK+VSF K
Subjt:  DMSPCVNCVMSKQKRVSFIK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-0749.12Show/hide
Query:  SSRAIKLPDRLQKCVALSSAEVEYVTLAETGKKMIWMIDYLEELGNKKHEKILQVDS
        S  AI    +LQKCVALS+ E EY+   ETGK+MIW+  +L+ELG  + E ++  DS
Subjt:  SSRAIKLPDRLQKCVALSSAEVEYVTLAETGKKMIWMIDYLEELGNKKHEKILQVDS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-0634.04Show/hide
Query:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG
        MSEG + + ++N FN +++QL+++ +  E+E KA++L++SLP S+D +   I + + + +LK D    ++L+E +RK+    + G AL  + RG
Subjt:  MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAAGGTGGATCTGTTGTGGACTATATAAATGAATTCAATATGATCGTAAGTCAACTGAGTTCGGTGGAAATTAATTTCGAGGATGAAATTAAAGCATTGATTTT
GATGTCATCTTTACCCGAGTCGTGGGATACTGTTGTTTTCGTAATCAGCAATTTCCGAGAATCTGATAAACTAAAGTTTGATGAAATTCGAGATGTAGTTCTCAGCGAAA
GTATTCGTAAACGAGAACTTGGAAATTCATCTGGGAGTGCTCTTAGTGTTGACCAACGGGGAATAAGGTGTATGAACAGAGTTGCTGTTGCTGAGAGTGCTTCAAATTCA
AGTATATGGCACAATAGACTTGGTCATATGAATGTTAAAGGAATGAAGATGTCAGCTGCAAAAGAAGTTTTGGAACTTCTGAAATCTGTTGATATGAGTCCTTGTGTGAA
CTGTGTTATGAGCAAACAGAAACGAGTTAGCTTCATAAAGACTGTCAGAGAATTGAAGAAAAGTATCGGGACAACGAAGCAAGTGGGAGGTGAGGTTGAGTTGCAGAACA
ATTCACAGAGTGATGTTGTAGAAGATACTCAAGAAACTTCTAAGACTGTTGCTGAGAAACCAGAGAAAGATTCACATAGTGATGTTGTAGCGGATATTCAAGAAACTCCT
GAGACTCTTGCTGAGGAACCAGAGGTGAAGCAAGTTGGAGTTGAGGTTGAGGTTGAGGTTGAGTTGCTGAAAGATTCACTTAATGATGTTGTAGCTGATACTCAAGAAAC
TCCTAAGACTGTTGCTGAAGAACCGGAAGAGGAGCAAGTAACACCTAAGCAGGTGTTGAAAAGATCATCCAGAGCCATCAAATTACCAGATAGGCTTCAGAAATGCGTTG
CTCTTTCATCTGCTGAGGTTGAGTACGTGACATTAGCTGAAACTGGAAAGAAGATGATATGGATGATAGACTATCTAGAAGAATTAGGCAATAAGAAGCACGAGAAGATT
CTTCAGGTCGATAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAAGGTGGATCTGTTGTGGACTATATAAATGAATTCAATATGATCGTAAGTCAACTGAGTTCGGTGGAAATTAATTTCGAGGATGAAATTAAAGCATTGATTTT
GATGTCATCTTTACCCGAGTCGTGGGATACTGTTGTTTTCGTAATCAGCAATTTCCGAGAATCTGATAAACTAAAGTTTGATGAAATTCGAGATGTAGTTCTCAGCGAAA
GTATTCGTAAACGAGAACTTGGAAATTCATCTGGGAGTGCTCTTAGTGTTGACCAACGGGGAATAAGGTGTATGAACAGAGTTGCTGTTGCTGAGAGTGCTTCAAATTCA
AGTATATGGCACAATAGACTTGGTCATATGAATGTTAAAGGAATGAAGATGTCAGCTGCAAAAGAAGTTTTGGAACTTCTGAAATCTGTTGATATGAGTCCTTGTGTGAA
CTGTGTTATGAGCAAACAGAAACGAGTTAGCTTCATAAAGACTGTCAGAGAATTGAAGAAAAGTATCGGGACAACGAAGCAAGTGGGAGGTGAGGTTGAGTTGCAGAACA
ATTCACAGAGTGATGTTGTAGAAGATACTCAAGAAACTTCTAAGACTGTTGCTGAGAAACCAGAGAAAGATTCACATAGTGATGTTGTAGCGGATATTCAAGAAACTCCT
GAGACTCTTGCTGAGGAACCAGAGGTGAAGCAAGTTGGAGTTGAGGTTGAGGTTGAGGTTGAGTTGCTGAAAGATTCACTTAATGATGTTGTAGCTGATACTCAAGAAAC
TCCTAAGACTGTTGCTGAAGAACCGGAAGAGGAGCAAGTAACACCTAAGCAGGTGTTGAAAAGATCATCCAGAGCCATCAAATTACCAGATAGGCTTCAGAAATGCGTTG
CTCTTTCATCTGCTGAGGTTGAGTACGTGACATTAGCTGAAACTGGAAAGAAGATGATATGGATGATAGACTATCTAGAAGAATTAGGCAATAAGAAGCACGAGAAGATT
CTTCAGGTCGATAGTTAG
Protein sequenceShow/hide protein sequence
MSEGGSVVDYINEFNMIVSQLSSVEINFEDEIKALILMSSLPESWDTVVFVISNFRESDKLKFDEIRDVVLSESIRKRELGNSSGSALSVDQRGIRCMNRVAVAESASNS
SIWHNRLGHMNVKGMKMSAAKEVLELLKSVDMSPCVNCVMSKQKRVSFIKTVRELKKSIGTTKQVGGEVELQNNSQSDVVEDTQETSKTVAEKPEKDSHSDVVADIQETP
ETLAEEPEVKQVGVEVEVEVELLKDSLNDVVADTQETPKTVAEEPEEEQVTPKQVLKRSSRAIKLPDRLQKCVALSSAEVEYVTLAETGKKMIWMIDYLEELGNKKHEKI
LQVDS