; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G011340 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G011340
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCmo_Chr14:8000253..8002259
RNA-Seq ExpressionCmoCh14G011340
SyntenyCmoCh14G011340
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN73071.1 hypothetical protein VITISV_032383 [Vitis vinifera]1.1e-2149.65Show/hide
Query:  NGAWFL---DIGTSLVGSSLPPSTFDSTSIEPFANPMITQVKADIFKTCHPTSLGYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAF
        NG W L    + T++VGS     T      + F +  + ++KA +       + GYTQVPGLDY  TFSPVVKATTVR+VLS+A+TNKW LRQLDVKNAF
Subjt:  NGAWFL---DIGTSLVGSSLPPSTFDSTSIEPFANPMITQVKADIFKTCHPTSLGYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAF

Query:  LNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR
        LN TL +HV      G           +LKKALY LKQ PR
Subjt:  LNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR

RVW43615.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.3e-2149.65Show/hide
Query:  NGAWFL---DIGTSLVGSSLPPSTFDSTSIEPFANPMITQVKADIFKTCHPTSLGYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAF
        NG W L    + T++VGS     T      +   +  + ++KA +       + GYTQVPGLDY  TFSPVVKATTVR+VLS+AITNKW LRQLDVKNAF
Subjt:  NGAWFL---DIGTSLVGSSLPPSTFDSTSIEPFANPMITQVKADIFKTCHPTSLGYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAF

Query:  LNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR
        LN TL +HV      G           +LKKALY LKQ PR
Subjt:  LNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR

RVW73295.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.7e-2239.41Show/hide
Query:  LDIGTSLVGSSLPPSTFDSTSIEPFA--------NPMITQVKADIFKTCHPTSL----------------------------------------------
        + + TSL GSSLPP      SIE  A        +PMIT+ KA IFKT HP +L                                              
Subjt:  LDIGTSLVGSSLPPSTFDSTSIEPFA--------NPMITQVKADIFKTCHPTSL----------------------------------------------

Query:  ---------------------------------------GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLL
                                               GYTQVPGLDY  TFSPVVKATTVR+VLS+A+TNKW LRQLDVKNAFLNDTL++  IWNNLL
Subjt:  ---------------------------------------GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLL

Query:  GIL
        GIL
Subjt:  GIL

RVW76295.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.6e-2245.03Show/hide
Query:  FLDIGTSLVGSSLPPSTFDSTSIEPF--------ANPMITQVKADIFKTCHPTSLGYTQ-----------------------------VPGLDYIGTFSP
        FL + TSL GS LPP      SIE          ++PMIT+ KA IFK  HP +LG                                VPGLDY  TFS 
Subjt:  FLDIGTSLVGSSLPPSTFDSTSIEPF--------ANPMITQVKADIFKTCHPTSLGYTQ-----------------------------VPGLDYIGTFSP

Query:  VVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR
        VVKATTVR+VL +A+TNKW LRQLDVKN FLN TL +HV      G           +LKKALY LKQ PR
Subjt:  VVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR

RVW87709.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.1e-2339.71Show/hide
Query:  MTADSSILNQSKNYTSKDSVIVGNGAWFLDIGTSLVG-------------------------------------------SSLPPSTFD---STSIEPFA
        MT D S L+++ NYT KD V+V NGA  L I T +                                             +S PP   D   + ++   +
Subjt:  MTADSSILNQSKNYTSKDSVIVGNGAWFLDIGTSLVG-------------------------------------------SSLPPSTFD---STSIEPFA

Query:  NPMITQVKADIFKTCHPTSL--------------------GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNL
        + M+T+ KA IFKT HP +L                    GYTQVPGLDY  TF+PV+K T VR+VLS+A+TNKW LRQLDVKNAFLN TL ++VIWNNL
Subjt:  NPMITQVKADIFKTCHPTSL--------------------GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNL

Query:  LGIL
        L  L
Subjt:  LGIL

TrEMBL top hitse value%identityAlignment
A0A438EBA0 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-2148.94Show/hide
Query:  NGAWFL---DIGTSLVGSSLPPSTFDSTSIEPFANPMITQVKADIFKTCHPTSLGYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAF
        NG W L    + T++VGS     T      + F +  + ++KA +       + GYTQVPGLDY  TFSPVVKATTVR+VLS+A+TNKW LRQLDV NAF
Subjt:  NGAWFL---DIGTSLVGSSLPPSTFDSTSIEPFANPMITQVKADIFKTCHPTSLGYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAF

Query:  LNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR
        LN TL +HV      G           +LKKALY LKQ PR
Subjt:  LNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR

A0A438GM61 Retrovirus-related Pol polyprotein from transposon RE18.4e-2339.41Show/hide
Query:  LDIGTSLVGSSLPPSTFDSTSIEPFA--------NPMITQVKADIFKTCHPTSL----------------------------------------------
        + + TSL GSSLPP      SIE  A        +PMIT+ KA IFKT HP +L                                              
Subjt:  LDIGTSLVGSSLPPSTFDSTSIEPFA--------NPMITQVKADIFKTCHPTSL----------------------------------------------

Query:  ---------------------------------------GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLL
                                               GYTQVPGLDY  TFSPVVKATTVR+VLS+A+TNKW LRQLDVKNAFLNDTL++  IWNNLL
Subjt:  ---------------------------------------GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLL

Query:  GIL
        GIL
Subjt:  GIL

A0A438GVS0 Retrovirus-related Pol polyprotein from transposon RE14.2e-2245.03Show/hide
Query:  FLDIGTSLVGSSLPPSTFDSTSIEPF--------ANPMITQVKADIFKTCHPTSLGYTQ-----------------------------VPGLDYIGTFSP
        FL + TSL GS LPP      SIE          ++PMIT+ KA IFK  HP +LG                                VPGLDY  TFS 
Subjt:  FLDIGTSLVGSSLPPSTFDSTSIEPF--------ANPMITQVKADIFKTCHPTSLGYTQ-----------------------------VPGLDYIGTFSP

Query:  VVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR
        VVKATTVR+VL +A+TNKW LRQLDVKN FLN TL +HV      G           +LKKALY LKQ PR
Subjt:  VVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR

A0A438HTE5 Retrovirus-related Pol polyprotein from transposon RE19.9e-2439.71Show/hide
Query:  MTADSSILNQSKNYTSKDSVIVGNGAWFLDIGTSLVG-------------------------------------------SSLPPSTFD---STSIEPFA
        MT D S L+++ NYT KD V+V NGA  L I T +                                             +S PP   D   + ++   +
Subjt:  MTADSSILNQSKNYTSKDSVIVGNGAWFLDIGTSLVG-------------------------------------------SSLPPSTFD---STSIEPFA

Query:  NPMITQVKADIFKTCHPTSL--------------------GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNL
        + M+T+ KA IFKT HP +L                    GYTQVPGLDY  TF+PV+K T VR+VLS+A+TNKW LRQLDVKNAFLN TL ++VIWNNL
Subjt:  NPMITQVKADIFKTCHPTSL--------------------GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNL

Query:  LGIL
        L  L
Subjt:  LGIL

A5C5R8 Integrase catalytic domain-containing protein5.5e-2249.65Show/hide
Query:  NGAWFL---DIGTSLVGSSLPPSTFDSTSIEPFANPMITQVKADIFKTCHPTSLGYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAF
        NG W L    + T++VGS     T      + F +  + ++KA +       + GYTQVPGLDY  TFSPVVKATTVR+VLS+A+TNKW LRQLDVKNAF
Subjt:  NGAWFL---DIGTSLVGSSLPPSTFDSTSIEPFANPMITQVKADIFKTCHPTSLGYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAF

Query:  LNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR
        LN TL +HV      G           +LKKALY LKQ PR
Subjt:  LNDTLIDHVIWNNLLG-----------ILKKALYELKQDPR

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.7e-0738.82Show/hide
Query:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLGI---------LKKALYELKQDPR
        G+TQ   +DY  TF+PV + ++ R +LS+ I     + Q+DVK AFLN TL + +      GI         L KA+Y LKQ  R
Subjt:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLGI---------LKKALYELKQDPR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-0739.08Show/hide
Query:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLGI-----------LKKALYELKQDPR
        G+ Q  G+D+   FSPVVK T++R +LS+A +    + QLDVK AFL+  L + +      G            L K+LY LKQ PR
Subjt:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLGI-----------LKKALYELKQDPR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.6e-1651.72Show/hide
Query:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWN-----------NLLGILKKALYELKQDPR
        GY Q PGLDY  TFSPV+K+T++RIVL +A+   W +RQLDV NAFL  TL D V  +           N +  L+KALY LKQ PR
Subjt:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWN-----------NLLGILKKALYELKQDPR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.3e-1650.57Show/hide
Query:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLGI-----------LKKALYELKQDPR
        GY Q PGLDY  TFSPV+K+T++RIVL +A+   W +RQLDV NAFL  TL D V  +   G            L+KA+Y LKQ PR
Subjt:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWNNLLGI-----------LKKALYELKQDPR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.3e-1139.56Show/hide
Query:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWN---------------NLLGILKKALYELKQDPR
        GYTQ  G+D+I TFSPV K T+V+++L+I+    ++L QLD+ NAFLN  L + +                  N +  LKK++Y LKQ  R
Subjt:  GYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWSLRQLDVKNAFLNDTLIDHVIWN---------------NLLGILKKALYELKQDPR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGCCGACTCATCTATTTTGAATCAGTCTAAAAATTACACGAGTAAGGACTCTGTCATCGTAGGAAACGGTGCATGGTTTTTGGACATTGGTACTTCTCTTGTAGG
TTCCTCTTTGCCACCCTCGACTTTTGATTCGACCTCTATTGAACCTTTTGCTAATCCTATGATCACACAAGTCAAAGCTGATATCTTCAAGACTTGTCATCCAACAAGTC
TGGGTTATACTCAGGTTCCTGGTCTCGACTACATTGGCACTTTCAGTCCAGTTGTCAAAGCTACCACTGTCCGTATTGTGCTTTCTATTGCGATCACAAATAAATGGTCT
CTTCGGCAACTTGATGTCAAGAATGCTTTCCTCAATGACACTCTTATTGATCATGTCATATGGAACAACCTCCTGGGTATATTAAAGAAAGCCCTCTATGAGTTAAAGCA
AGATCCTCGTGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCGCCGACTCATCTATTTTGAATCAGTCTAAAAATTACACGAGTAAGGACTCTGTCATCGTAGGAAACGGTGCATGGTTTTTGGACATTGGTACTTCTCTTGTAGG
TTCCTCTTTGCCACCCTCGACTTTTGATTCGACCTCTATTGAACCTTTTGCTAATCCTATGATCACACAAGTCAAAGCTGATATCTTCAAGACTTGTCATCCAACAAGTC
TGGGTTATACTCAGGTTCCTGGTCTCGACTACATTGGCACTTTCAGTCCAGTTGTCAAAGCTACCACTGTCCGTATTGTGCTTTCTATTGCGATCACAAATAAATGGTCT
CTTCGGCAACTTGATGTCAAGAATGCTTTCCTCAATGACACTCTTATTGATCATGTCATATGGAACAACCTCCTGGGTATATTAAAGAAAGCCCTCTATGAGTTAAAGCA
AGATCCTCGTGTCTAG
Protein sequenceShow/hide protein sequence
MTADSSILNQSKNYTSKDSVIVGNGAWFLDIGTSLVGSSLPPSTFDSTSIEPFANPMITQVKADIFKTCHPTSLGYTQVPGLDYIGTFSPVVKATTVRIVLSIAITNKWS
LRQLDVKNAFLNDTLIDHVIWNNLLGILKKALYELKQDPRV