; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G009850 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G009850
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRNA polymerase II elongation factor
Genome locationCmo_Chr16:6762346..6762993
RNA-Seq ExpressionCmoCh16G009850
SyntenyCmoCh16G009850
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015562.1 hypothetical protein SDJN02_23198, partial [Cucurbita argyrosperma subsp. argyrosperma]5.5e-11599.07Show/hide
Query:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
        MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
Subjt:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN

Query:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF
        VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLT LSRNFVTI LLCF
Subjt:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF

Query:  SGIVFGASKFILCGL
        SGIVFGASKFILCGL
Subjt:  SGIVFGASKFILCGL

XP_022932317.1 uncharacterized protein LOC111438709 [Cucurbita moschata]2.9e-116100Show/hide
Query:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
        MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
Subjt:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN

Query:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF
        VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF
Subjt:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF

Query:  SGIVFGASKFILCGL
        SGIVFGASKFILCGL
Subjt:  SGIVFGASKFILCGL

XP_022965180.1 uncharacterized protein LOC111465114 [Cucurbita maxima]4.0e-11397.21Show/hide
Query:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
        MATETKDSGAA H+VEIPAENQN+MISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
Subjt:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN

Query:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF
        VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTR VQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLT LSRNFVTI LLCF
Subjt:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF

Query:  SGIVFGASKFILCGL
        SGIVFGASKFILCGL
Subjt:  SGIVFGASKFILCGL

XP_023553388.1 uncharacterized protein LOC111810817 [Cucurbita pepo subsp. pepo]4.0e-11397.67Show/hide
Query:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
        MATET+DSG AAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLR+GRRETK+ESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
Subjt:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN

Query:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF
        VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLT LSRNFVTIVLLCF
Subjt:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF

Query:  SGIVFGASKFILCGL
        SGIVFGASKFILCGL
Subjt:  SGIVFGASKFILCGL

XP_038905570.1 uncharacterized protein LOC120091553 [Benincasa hispida]1.1e-9786.57Show/hide
Query:  ETKDSGAAAHVVEIPAE------NQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSS
        ETKDSG  AH+VEIP E      NQN MISVI+ HPLRQISESSGHLLLLKLWQR+EHLFGLRIGRRETK+ESLKQQIFQLCC+FFLFHALSLTLL+TSS
Subjt:  ETKDSGAAAHVVEIPAE------NQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSS

Query:  DPNVCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVL
        DPNVC KWWVP VAMGATSGVFVIVVQLKLWLYWKAS QLQREK+ENRALTRCVQELRMKGSCFNLSKEPQIG+RMKSSSVEIKWGPLT  SRNF+TI L
Subjt:  DPNVCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVL

Query:  LCFSGIVFGASKFILC
        L FS IVF ASKFILC
Subjt:  LCFSGIVFGASKFILC

TrEMBL top hitse value%identityAlignment
A0A0A0L2U7 Uncharacterized protein3.4e-9483.26Show/hide
Query:  ETKDSGAAAHVVEIPAE----NQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDP
        ETKDS  AAH+VEIP E     QN+MISVI+ HPLRQISESSGHLLLLKLWQR+EHLFGLRIGRRETK+ESLKQQIFQLCC+FFLFHALSLTLL+TSSDP
Subjt:  ETKDSGAAAHVVEIPAE----NQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDP

Query:  NVCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLC
         VC KWWVP V +GATSGVFVIVVQLKLW+YWKA  QLQ+EK+ENRALTRCVQELRMKGSCFNLSKEPQIG+RMKSSSVEIKWGPLT  SRNF+TI LL 
Subjt:  NVCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLC

Query:  FSGIVFGASKFILCG
        FS I+F  SKFILCG
Subjt:  FSGIVFGASKFILCG

A0A1S3BL81 uncharacterized protein LOC1034910492.0e-9483.72Show/hide
Query:  ETKDSGAAAHVVEIPAE----NQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDP
        +TKDS  AAH+VEIP E     QN+MISVI+ HPLRQISESSGHLLLLKLWQR+EHLFGLRIGRRETK+ESLKQQIFQLCC+FFLFHALSLTLL+TSSDP
Subjt:  ETKDSGAAAHVVEIPAE----NQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDP

Query:  NVCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLC
         VC KWWVP V +GATSGVFVIVVQLKLW+YWKA  QLQREKSENRALTRC QELRMKGSCFNLSKEPQIG+RMKSSSVEIKWGPLT  SRNF+ I LL 
Subjt:  NVCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLC

Query:  FSGIVFGASKFILCG
        FS IVF ASKFILCG
Subjt:  FSGIVFGASKFILCG

A0A5A7VJ68 Uncharacterized protein1.5e-6586.39Show/hide
Query:  LESLKQQIFQLCCYFFLFHALSLTLLFTSSDPNVCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEP
        +ESLKQQIFQLCC+FFLFHALSLTLL+TSSDP VC KWWVP V +GATSGVFVIVVQLKLW+YWKA  QLQREKSENRALTRCVQELRMKGSCFNLSKEP
Subjt:  LESLKQQIFQLCCYFFLFHALSLTLLFTSSDPNVCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEP

Query:  QIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCFSGIVFGASKFILCG
        QIG+RMKSSSVEIKWGPLT  SRNF+ I LL FS IVF ASKFILCG
Subjt:  QIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCFSGIVFGASKFILCG

A0A6J1EWB5 uncharacterized protein LOC1114387091.4e-116100Show/hide
Query:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
        MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
Subjt:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN

Query:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF
        VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF
Subjt:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF

Query:  SGIVFGASKFILCGL
        SGIVFGASKFILCGL
Subjt:  SGIVFGASKFILCGL

A0A6J1HN07 uncharacterized protein LOC1114651141.9e-11397.21Show/hide
Query:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
        MATETKDSGAA H+VEIPAENQN+MISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN
Subjt:  MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPN

Query:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF
        VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTR VQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLT LSRNFVTI LLCF
Subjt:  VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCF

Query:  SGIVFGASKFILCGL
        SGIVFGASKFILCGL
Subjt:  SGIVFGASKFILCGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12870.1 unknown protein2.8e-2436.13Show/hide
Query:  QHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLF-------TSSDPNVCHKWWVPGVAMGATSGVFVIVV
        +HPL QI+++  H LLLK W +EE L   R+  +E++++S++++I QL  +FFLFH++SL LLF       +S+  + C + W+P +    +S   +  V
Subjt:  QHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLF-------TSSDPNVCHKWWVPGVAMGATSGVFVIVV

Query:  QLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLS-RNFVTIVLLCFSGIVFGASKFILC
        + K  +       L+REK + + L +CV+EL+ KG  F+L KE     R KS  VE K  P+   S R+FVT+     S +V    + ILC
Subjt:  QLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLS-RNFVTIVLLCFSGIVFGASKFILC

AT5G56120.1 unknown protein4.2e-6052.12Show/hide
Query:  TKDSGAAAHVVEIPAENQNV-----------------MISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHA
        TKDS    HVVEIP + ++                  ++ VI+QHPL +ISES GHLLLLKLWQREE LF  R+  +E++LES+K++IFQLCC+F +FH 
Subjt:  TKDSGAAAHVVEIPAENQNV-----------------MISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHA

Query:  LSLTLLFTSSDPN---------VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSV
           TL+++SS  +         VC KWW+P     ATS V V +VQ KL+++WK    + RE+++NR LTRCV ELRMKGS F+LSKEP  G RMKSSSV
Subjt:  LSLTLLFTSSDPN---------VCHKWWVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSV

Query:  EIKWGPLTLLSRNFVTIVLLCFSGIVFGASKFILCG
        EIKW P+T  S+  +TIVLLC +G+ F  SKFILCG
Subjt:  EIKWGPLTLLSRNFVTIVLLCFSGIVFGASKFILCG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACCGAAACGAAAGATTCCGGTGCCGCCGCGCACGTAGTCGAAATCCCAGCAGAGAATCAGAACGTTATGATCTCTGTAATAGAACAACACCCATTGAGG
CAAATCTCCGAAAGCTCTGGCCATTTATTGCTTTTAAAGCTCTGGCAACGAGAGGAGCATCTGTTCGGCCTACGAATTGGGCGGCGAGAGACCAAACTGGAGTCT
CTAAAGCAACAGATCTTCCAACTCTGCTGCTACTTCTTCCTGTTCCACGCCCTGTCTCTGACTCTCTTGTTCACTTCGTCCGATCCCAATGTGTGCCACAAATGG
TGGGTTCCGGGAGTGGCAATGGGGGCGACGTCGGGCGTGTTTGTGATCGTGGTGCAGCTGAAATTGTGGCTGTATTGGAAGGCATCAGCGCAGCTGCAAAGGGAG
AAGAGTGAGAACAGAGCACTTACAAGATGTGTGCAAGAGCTGAGGATGAAAGGGTCGTGTTTTAATTTGTCGAAAGAGCCTCAAATTGGGCATAGGATGAAGAGC
TCTAGTGTGGAGATTAAATGGGGGCCTCTTACTTTGTTGTCTAGGAATTTCGTCACCATTGTTCTTCTGTGCTTTTCTGGCATTGTTTTTGGTGCTTCCAAGTTC
ATACTCTGTGGCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACCGAAACGAAAGATTCCGGTGCCGCCGCGCACGTAGTCGAAATCCCAGCAGAGAATCAGAACGTTATGATCTCTGTAATAGAACAACACCCATTGAGG
CAAATCTCCGAAAGCTCTGGCCATTTATTGCTTTTAAAGCTCTGGCAACGAGAGGAGCATCTGTTCGGCCTACGAATTGGGCGGCGAGAGACCAAACTGGAGTCT
CTAAAGCAACAGATCTTCCAACTCTGCTGCTACTTCTTCCTGTTCCACGCCCTGTCTCTGACTCTCTTGTTCACTTCGTCCGATCCCAATGTGTGCCACAAATGG
TGGGTTCCGGGAGTGGCAATGGGGGCGACGTCGGGCGTGTTTGTGATCGTGGTGCAGCTGAAATTGTGGCTGTATTGGAAGGCATCAGCGCAGCTGCAAAGGGAG
AAGAGTGAGAACAGAGCACTTACAAGATGTGTGCAAGAGCTGAGGATGAAAGGGTCGTGTTTTAATTTGTCGAAAGAGCCTCAAATTGGGCATAGGATGAAGAGC
TCTAGTGTGGAGATTAAATGGGGGCCTCTTACTTTGTTGTCTAGGAATTTCGTCACCATTGTTCTTCTGTGCTTTTCTGGCATTGTTTTTGGTGCTTCCAAGTTC
ATACTCTGTGGCCTTTGA
Protein sequenceShow/hide protein sequence
MATETKDSGAAAHVVEIPAENQNVMISVIEQHPLRQISESSGHLLLLKLWQREEHLFGLRIGRRETKLESLKQQIFQLCCYFFLFHALSLTLLFTSSDPNVCHKW
WVPGVAMGATSGVFVIVVQLKLWLYWKASAQLQREKSENRALTRCVQELRMKGSCFNLSKEPQIGHRMKSSSVEIKWGPLTLLSRNFVTIVLLCFSGIVFGASKF
ILCGL