; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G007100 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G007100
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr12:5405534..5409810
RNA-Seq ExpressionCmoCh12G007100
SyntenyCmoCh12G007100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042227.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.2e-1544.44Show/hide
Query:  AALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLTA
        AALW+ LL LPLT ALAACP       + +    R +  + +      + EGV+ET R+SRCE                                    A
Subjt:  AALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLTA

Query:  ATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL
        ATREILI+FLPV LPARGRKVQW+FL G EKVKLWAGC+VS  L
Subjt:  ATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL

KAA0064098.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]9.3e-1545.52Show/hide
Query:  AALWARLLGLPLTSALAACPLPS-PSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLT
        AALW+ LL LPLT ALAACP  +    GE +       D   I      + EGV+ET R+SRCE                                    
Subjt:  AALWARLLGLPLTSALAACPLPS-PSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLT

Query:  AATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL
        AATREILI+FLPV LP RGRKVQW+FL G EKVKLWAGC+VS  L
Subjt:  AATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL

KAG6573513.1 hypothetical protein SDJN03_27400, partial [Cucurbita argyrosperma subsp. sororia]5.8e-5767Show/hide
Query:  MRSKPRGPCLNLSPYSQPSLTRLLALLNAPPRSKLSFIQRTRKWLNESKEERVVGQLYAQGQSKSPSPYLNVATCNHRAAAALWARLLGLPLTSALAACP
        MRSKPRGPCLNLSPY+QPSLTRLL LLNAPPRSKLSF+  T                    QSK PS YLN+ATCNH  AAALWA LLGLPLTSALAACP
Subjt:  MRSKPRGPCLNLSPYSQPSLTRLLALLNAPPRSKLSFIQRTRKWLNESKEERVVGQLYAQGQSKSPSPYLNVATCNHRAAAALWARLLGLPLTSALAACP

Query:  LPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLTAATREILIVFLPVLLPARGRK
        LPSPSFGEY FHNT SSDSYSIAMLYSF++EGVVETLRESRCE  G + P +                 NLLD ELV TAAT EILIVFLPVLL   GR+
Subjt:  LPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLTAATREILIVFLPVLLPARGRK

KAG6585891.1 hypothetical protein SDJN03_18624, partial [Cucurbita argyrosperma subsp. sororia]2.0e-7399.31Show/hide
Query:  MRSKPRGPCLNLSPYSQPSLTRLLALLNAPPRSKLSFIQRTRKWLNESKEERVVGQLYAQGQSKSPSPYLNVATCNHRAAAALWARLLGLPLTSALAACP
        MRSKPRGPCLNLSPYSQPSLTRLLALLNAPPRSKLSFIQRTRKWLNESKEERVVGQLYAQGQSKSPSPYLNVATCNHRAAAALWARLLGLPLTSALAACP
Subjt:  MRSKPRGPCLNLSPYSQPSLTRLLALLNAPPRSKLSFIQRTRKWLNESKEERVVGQLYAQGQSKSPSPYLNVATCNHRAAAALWARLLGLPLTSALAACP

Query:  LPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEG
        LPSPSFGEYSFHNTRSSDSYSIAMLYSF+NEGVVETLRESRCEG
Subjt:  LPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEG

KAG6590352.1 hypothetical protein SDJN03_15775, partial [Cucurbita argyrosperma subsp. sororia]3.0e-5374.1Show/hide
Query:  MRSKPRGPCLNLSPYSQPSLTRLLALLNAPPRSKLSFIQRT--------RKW-----LNES---------KEERVVGQLYAQGQSKSPSPYLNVATCNHR
        MRSKPRGPCLNLSPYSQPSLTRLLALLNAPPRSKLSF+  T        RKW     L ES         + E   G+L     SKSPSPYLN+ATCNH 
Subjt:  MRSKPRGPCLNLSPYSQPSLTRLLALLNAPPRSKLSFIQRT--------RKW-----LNES---------KEERVVGQLYAQGQSKSPSPYLNVATCNHR

Query:  AAAALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEG
        AAAALWARLLGLPLTSALAACPLPSPSFGEY FHNT SSDSYSIAMLYSF++EGVVETLRESRCEG
Subjt:  AAAALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEG

TrEMBL top hitse value%identityAlignment
A0A0D2PZN2 Uncharacterized protein9.4e-0531.78Show/hide
Query:  QSKSPSPYLNVA---TCNHRAAAALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEG----VVETLRESRCEGWGLKDPTTFR
        Q+KSP P  + +   T NH AAAAL A LL LP+T ALAA                    S ++  LYS + EG        + +SR    GL+DPT FR
Subjt:  QSKSPSPYLNVA---TCNHRAAAALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEG----VVETLRESRCEGWGLKDPTTFR

Query:  PQGLKLSFLFSSFTSNLLDFELVLTAATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTLCSAGAILYEALDIGVVPSSSTISNSISHSG
        P+GL+L     SFT                       +LLPA GRKV +    G  +     G    + L S+ A     L     P    + ++   SG
Subjt:  PQGLKLSFLFSSFTSNLLDFELVLTAATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTLCSAGAILYEALDIGVVPSSSTISNSISHSG

Query:  DNLYPSGS-TSTLA
         N+ P  S  ST+A
Subjt:  DNLYPSGS-TSTLA

A0A5A7TKL6 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-1544.44Show/hide
Query:  AALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLTA
        AALW+ LL LPLT ALAACP       + +    R +  + +      + EGV+ET R+SRCE                                    A
Subjt:  AALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLTA

Query:  ATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL
        ATREILI+FLPV LPARGRKVQW+FL G EKVKLWAGC+VS  L
Subjt:  ATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL

A0A5A7URS5 Uncharacterized protein1.1e-1343.75Show/hide
Query:  AALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLTA
        AALW+ LL LPLT AL AC   +      +       D   I      + EGV+ET R+SRCE                                    A
Subjt:  AALWARLLGLPLTSALAACPLPSPSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLTA

Query:  ATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL
        ATREILI+FLPV LPARGRKVQW+FL G EKVKLWAGC+VS  L
Subjt:  ATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL

A0A5A7V903 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-1545.52Show/hide
Query:  AALWARLLGLPLTSALAACPLPS-PSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLT
        AALW+ LL LPLT ALAACP  +    GE +       D   I      + EGV+ET R+SRCE                                    
Subjt:  AALWARLLGLPLTSALAACPLPS-PSFGEYSFHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLT

Query:  AATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL
        AATREILI+FLPV LP RGRKVQW+FL G EKVKLWAGC+VS  L
Subjt:  AATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQVSSTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAGCAAGCCTAGAGGTCCCTGCCTTAACCTGTCCCCATACAGCCAGCCCTCACTCACCAGGCTGCTGGCATTGCTAAATGCTCCGCCACGAAGCAAGCTCTCTTT
CATACAACGGACTCGTAAGTGGCTAAATGAGTCGAAGGAAGAAAGAGTAGTTGGGCAACTTTATGCACAAGGGCAGTCGAAGTCCCCGTCCCCCTATCTCAATGTGGCGA
CGTGTAACCATAGAGCTGCCGCTGCCCTTTGGGCACGCCTCCTAGGCTTACCACTCACCTCTGCTCTTGCAGCATGCCCTCTTCCTTCCCCTTCCTTCGGGGAATACAGC
TTTCATAATACGCGAAGTAGTGATAGCTATTCTATCGCTATGCTATATTCTTTCAATAACGAAGGCGTGGTTGAGACTCTAAGAGAAAGTCGATGCGAAGGGTGGGGCTT
GAAGGACCCTACCACATTCCGACCCCAGGGCCTGAAATTGTCTTTTCTTTTCTCCTCTTTCACCAGTAACCTCTTGGATTTTGAGTTAGTTCTAACAGCTGCCACACGTG
AGATCCTGATCGTCTTCCTCCCTGTGTTGTTGCCCGCTAGGGGAAGGAAAGTTCAGTGGGTTTTCCTTCCAGGACCGGAGAAGGTCAAACTCTGGGCGGGCTGCCAGGTT
TCATCTACGCTATGTTCAGCAGGGGCCATTCTGTACGAGGCTCTTGATATAGGGGTTGTTCCTAGCTCAAGTACAATTTCGAATTCCATCTCACACTCTGGGGACAACCT
GTACCCTAGTGGTTCAACTTCTACTCTTGCGTGTTTTACAAGATTAGTGGAGATAAAAGAGTGGGTTGACCCTGAATCAAATAAGACCCTGCGCGACAACGTGGCCTTCT
TGGTGATAGTTGAAACATACCCTGTTCTTAAACTGACACTGTCTTGGGTGATGTCTCCCACAATTCGAGCAACGCGCTCTTCCATTGAGACCATTAATGGGGCGAAACGG
GACAGACGGGTGAACTCTCGCTCATACTCCTCTACTGACCTGTTTCCCTGCTCTATGCTTAGGAATTCGCATTGCTTCTTATATCGCACCACTGATTTGAAGCATTTTCG
AAGAAACGCTTTTCTAAACTGTGTCCAGGTGATTTCGTCCTCGTCTGCAGCAATCGCCCGAGAGGTTGATTGCCACCACAGTTTAGCATCTTCGCGTAACAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAGCAAGCCTAGAGGTCCCTGCCTTAACCTGTCCCCATACAGCCAGCCCTCACTCACCAGGCTGCTGGCATTGCTAAATGCTCCGCCACGAAGCAAGCTCTCTTT
CATACAACGGACTCGTAAGTGGCTAAATGAGTCGAAGGAAGAAAGAGTAGTTGGGCAACTTTATGCACAAGGGCAGTCGAAGTCCCCGTCCCCCTATCTCAATGTGGCGA
CGTGTAACCATAGAGCTGCCGCTGCCCTTTGGGCACGCCTCCTAGGCTTACCACTCACCTCTGCTCTTGCAGCATGCCCTCTTCCTTCCCCTTCCTTCGGGGAATACAGC
TTTCATAATACGCGAAGTAGTGATAGCTATTCTATCGCTATGCTATATTCTTTCAATAACGAAGGCGTGGTTGAGACTCTAAGAGAAAGTCGATGCGAAGGGTGGGGCTT
GAAGGACCCTACCACATTCCGACCCCAGGGCCTGAAATTGTCTTTTCTTTTCTCCTCTTTCACCAGTAACCTCTTGGATTTTGAGTTAGTTCTAACAGCTGCCACACGTG
AGATCCTGATCGTCTTCCTCCCTGTGTTGTTGCCCGCTAGGGGAAGGAAAGTTCAGTGGGTTTTCCTTCCAGGACCGGAGAAGGTCAAACTCTGGGCGGGCTGCCAGGTT
TCATCTACGCTATGTTCAGCAGGGGCCATTCTGTACGAGGCTCTTGATATAGGGGTTGTTCCTAGCTCAAGTACAATTTCGAATTCCATCTCACACTCTGGGGACAACCT
GTACCCTAGTGGTTCAACTTCTACTCTTGCGTGTTTTACAAGATTAGTGGAGATAAAAGAGTGGGTTGACCCTGAATCAAATAAGACCCTGCGCGACAACGTGGCCTTCT
TGGTGATAGTTGAAACATACCCTGTTCTTAAACTGACACTGTCTTGGGTGATGTCTCCCACAATTCGAGCAACGCGCTCTTCCATTGAGACCATTAATGGGGCGAAACGG
GACAGACGGGTGAACTCTCGCTCATACTCCTCTACTGACCTGTTTCCCTGCTCTATGCTTAGGAATTCGCATTGCTTCTTATATCGCACCACTGATTTGAAGCATTTTCG
AAGAAACGCTTTTCTAAACTGTGTCCAGGTGATTTCGTCCTCGTCTGCAGCAATCGCCCGAGAGGTTGATTGCCACCACAGTTTAGCATCTTCGCGTAACAAATAA
Protein sequenceShow/hide protein sequence
MRSKPRGPCLNLSPYSQPSLTRLLALLNAPPRSKLSFIQRTRKWLNESKEERVVGQLYAQGQSKSPSPYLNVATCNHRAAAALWARLLGLPLTSALAACPLPSPSFGEYS
FHNTRSSDSYSIAMLYSFNNEGVVETLRESRCEGWGLKDPTTFRPQGLKLSFLFSSFTSNLLDFELVLTAATREILIVFLPVLLPARGRKVQWVFLPGPEKVKLWAGCQV
SSTLCSAGAILYEALDIGVVPSSSTISNSISHSGDNLYPSGSTSTLACFTRLVEIKEWVDPESNKTLRDNVAFLVIVETYPVLKLTLSWVMSPTIRATRSSIETINGAKR
DRRVNSRSYSSTDLFPCSMLRNSHCFLYRTTDLKHFRRNAFLNCVQVISSSSAAIAREVDCHHSLASSRNK