; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003668 (gene) of Chayote v1 genome

Gene IDSed0003668
OrganismSechium edule (Chayote v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG10:27799074..27800684
RNA-Seq ExpressionSed0003668
SyntenySed0003668
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]5.0e-2149.32Show/hide
Query:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS
        +AR+M+ KSKLEN+K     +++Y  K+K LVDSLAA G+K+ +EDHIMHIL GL SE++ TVSVIS + +T +LQ+VYS LL+ E R  RN +IN DG+
Subjt:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS

Query:  TVSINLTQKTGMSSNGQ-VSSSSNSSNSNRGKNKG----KKHWNNN
          S+NLTQ+T  S++ Q +        +NR KN G    +++WN+N
Subjt:  TVSINLTQKTGMSSNGQ-VSSSSNSSNSNRGKNKG----KKHWNNN

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]5.0e-2149.32Show/hide
Query:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS
        +AR+M+ KSKLEN+K     +++Y  K+K LVDSLAA G+K+ +EDHIMHIL GL SE++ TVSVIS + +T +LQ+VYS LL+ E R  RN +IN DG+
Subjt:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS

Query:  TVSINLTQKTGMSSNGQ-VSSSSNSSNSNRGKNKG----KKHWNNN
          S+NLTQ+T  S++ Q +        +NR KN G    +++WN+N
Subjt:  TVSINLTQKTGMSSNGQ-VSSSSNSSNSNRGKNKG----KKHWNNN

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]3.4e-1747.33Show/hide
Query:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS
        +AR+M+ K KLEN K     +++Y LKIKNLVDSLA  G+K++ EDHIMHILAGLG E+D  +SVI+ +    +LQ+V S LL QE R  RNL IN DGS
Subjt:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS

Query:  TVSINLTQKTGMSSNGQVSS------SSNSSNSNRGKN---KGKKHWNNN
          S+NLT       N    S       SN S   RG N     +++W  N
Subjt:  TVSINLTQKTGMSSNGQVSS------SSNSSNSNRGKN---KGKKHWNNN

XP_022156747.1 uncharacterized protein LOC111023586 [Momordica charantia]4.3e-1255.84Show/hide
Query:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQK
        +AR+M+ KSKLENMK     ++ Y LKIKNLVDSLA  G+++  +DHIMHILA LG E+D  VSVIS ++   S+Q+
Subjt:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQK

XP_022158089.1 uncharacterized protein LOC111024658 [Momordica charantia]3.6e-1138.22Show/hide
Query:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS
        + ++M+ K++L+N++     ++EY  +IKNLVDSL A G+ +  EDHIMHIL+GLGSEY+ TVSVI+ K+   ++Q V + LL+ + R+ + ++   D +
Subjt:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS

Query:  --TVSINLT-QKTGMSS-NGQVSSSSN---------SSNSNRGKNK----GKKHWNN
          + SINL  Q+ G ++ N  VS +SN         S+++NRG+ +    G + WN+
Subjt:  --TVSINLT-QKTGMSS-NGQVSSSSN---------SSNSNRGKNK----GKKHWNN

TrEMBL top hitse value%identityAlignment
A0A6J1C6N9 dr1-associated corepressor homolog isoform X12.4e-2149.32Show/hide
Query:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS
        +AR+M+ KSKLEN+K     +++Y  K+K LVDSLAA G+K+ +EDHIMHIL GL SE++ TVSVIS + +T +LQ+VYS LL+ E R  RN +IN DG+
Subjt:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS

Query:  TVSINLTQKTGMSSNGQ-VSSSSNSSNSNRGKNKG----KKHWNNN
          S+NLTQ+T  S++ Q +        +NR KN G    +++WN+N
Subjt:  TVSINLTQKTGMSSNGQ-VSSSSNSSNSNRGKNKG----KKHWNNN

A0A6J1C8R2 dr1-associated corepressor homolog isoform X22.4e-2149.32Show/hide
Query:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS
        +AR+M+ KSKLEN+K     +++Y  K+K LVDSLAA G+K+ +EDHIMHIL GL SE++ TVSVIS + +T +LQ+VYS LL+ E R  RN +IN DG+
Subjt:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS

Query:  TVSINLTQKTGMSSNGQ-VSSSSNSSNSNRGKNKG----KKHWNNN
          S+NLTQ+T  S++ Q +        +NR KN G    +++WN+N
Subjt:  TVSINLTQKTGMSSNGQ-VSSSSNSSNSNRGKNKG----KKHWNNN

A0A6J1DLT9 uncharacterized protein LOC1110217571.6e-1747.33Show/hide
Query:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS
        +AR+M+ K KLEN K     +++Y LKIKNLVDSLA  G+K++ EDHIMHILAGLG E+D  +SVI+ +    +LQ+V S LL QE R  RNL IN DGS
Subjt:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS

Query:  TVSINLTQKTGMSSNGQVSS------SSNSSNSNRGKN---KGKKHWNNN
          S+NLT       N    S       SN S   RG N     +++W  N
Subjt:  TVSINLTQKTGMSSNGQVSS------SSNSSNSNRGKN---KGKKHWNNN

A0A6J1DYD5 uncharacterized protein LOC1110246581.8e-1138.22Show/hide
Query:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS
        + ++M+ K++L+N++     ++EY  +IKNLVDSL A G+ +  EDHIMHIL+GLGSEY+ TVSVI+ K+   ++Q V + LL+ + R+ + ++   D +
Subjt:  MARIMERKSKLENMK-----IEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGS

Query:  --TVSINLT-QKTGMSS-NGQVSSSSN---------SSNSNRGKNK----GKKHWNN
          + SINL  Q+ G ++ N  VS +SN         S+++NRG+ +    G + WN+
Subjt:  --TVSINLT-QKTGMSS-NGQVSSSSN---------SSNSNRGKNK----GKKHWNN

A0A803QGZ4 Uncharacterized protein1.8e-1141.28Show/hide
Query:  ARIMERKSKLE-----NMKIEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGST
        AR+++ KS+L      N+ I +YC K+K L DSL+  G  +   D IMH+L GLG EYDP V  ++   +  SL+ + S LL  ESRL R+ T++D  S 
Subjt:  ARIMERKSKLE-----NMKIEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGST

Query:  VSINLTQKT
        ++ NL++ T
Subjt:  VSINLTQKT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.3e-0630.08Show/hide
Query:  RKSKLENMKIEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGSTVSINLTQKTG
        R + ++++ + EYC K+K+L D L  V   I+    +MH+L GL  +YD  ++VI  K   PS  +  S LL +ESRL        + S  S++ T    
Subjt:  RKSKLENMKIEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGSTVSINLTQKTG

Query:  MSS--------NGQVSSSSNSSNSNRGKNKGKK
        +S+          +     +++NSN G+ + KK
Subjt:  MSS--------NGQVSSSSNSSNSNRGKNKGKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACGAATTATGGAACGCAAATCAAAACTAGAGAATATGAAGATAGAAGAGTATTGTCTCAAGATTAAGAATCTTGTTGATTCTTTAGCTGCGGTAGGAAGGAAGAT
AGCTCTCGAAGATCACATAATGCATATTCTTGCTGGACTAGGATCTGAATATGATCCTACTGTGTCTGTTATCTCTGATAAAGAAGAAACGCCGTCACTGCAAAAAGTCT
ATTCGGCTTTACTAGCTCAAGAAAGTCGATTATTAAGGAACTTAACGATAAATGATGATGGTTCTACTGTTTCCATTAATCTTACTCAGAAAACTGGTATGAGTTCGAAT
GGTCAGGTTTCTTCATCATCTAATAGTTCCAACTCAAATCGAGGAAAGAACAAAGGAAAGAAGCATTGGAATAACAATTAA
mRNA sequenceShow/hide mRNA sequence
TTTCTCTCCAATGTGTCCTTGAAGATATACTTTCTTCTAAAACTTGACTTGGTATCAGAGCTTGAAAGGGTCTAATCGACTGAAGCGATTCTGAGAATTAGGGCACACAA
CCATGCAGAACTCCTCTCAAGAAAGAGGAACTTCCGATGTTGTCAACTCCGGCTCAAGTGCGAAAGTGATAAATCTAGGGAATAAGATTTCAACGGTGAAGCTCGACGAG
GAAAACTTTCTTCTCTGGAAGTTACAGGTAACCACTGCACTTAGAGGCCATGGATTGTTACAATACATCAAGGAAGATTGTGAAGTTCCGACGAAATTTCTTGAATCTGG
AAGATCTTCATCGTCTGATCTGGATCAAGAAATCAATCCGTTGTATGATGCTTGGATTCGTCAAGATAGTCTCGTTACGGCCTGGCTGCTTGGCTCCATGCCAAATTCAT
TGCTCGCCGAAATGCTGGATTGTGAAACAGCAAAAGAGGTATAGAAAATAGTGAATTCAAGATTTCATCTAGAAATATGGCACGAATTATGGAACGCAAATCAAAACTAG
AGAATATGAAGATAGAAGAGTATTGTCTCAAGATTAAGAATCTTGTTGATTCTTTAGCTGCGGTAGGAAGGAAGATAGCTCTCGAAGATCACATAATGCATATTCTTGCT
GGACTAGGATCTGAATATGATCCTACTGTGTCTGTTATCTCTGATAAAGAAGAAACGCCGTCACTGCAAAAAGTCTATTCGGCTTTACTAGCTCAAGAAAGTCGATTATT
AAGGAACTTAACGATAAATGATGATGGTTCTACTGTTTCCATTAATCTTACTCAGAAAACTGGTATGAGTTCGAATGGTCAGGTTTCTTCATCATCTAATAGTTCCAACT
CAAATCGAGGAAAGAACAAAGGAAAGAAGCATTGGAATAACAATTAACCCCAATGTCAAATATGTGGAAAATTTGGTCATACTGCTGTGAAATGTTATTTTCAATTTGAA
AGAGGATCTCAAGGATGTAGCAGTGAGGGAAGTTCGAGTAGTTCAAGTTCTACGGGATCACAGGCTAATGTGTTTATGGTTCAACAAGATATGAACCAGGATAATCATTG
GTATCCAGATTCCGGAGCTTCAAATCATGTTACAAATGATCTCTCGAGTCTGACTATCTCGTCTGAATATCAAGGAGATGGTAAAGTGCACATTGGCAATGGACAACGCT
AGTGGTAAGGTTCTTCTGCAAGGACGATTAGTTGATGGGCTATATCAATTCTGTCTGGAGAAAGCTGATTCCCAAAATTCTACTACTCAAGTCAAGTTTATCAATCCTTC
AAAGCCCAGTGTGTTCAATACGAGTGCTTTTAATACAAATGTATCCCCTATTGAATGTAATAAAACTGATCTGAATATGTTTGATATTTGGCATAATAGGCTAGGG
Protein sequenceShow/hide protein sequence
MARIMERKSKLENMKIEEYCLKIKNLVDSLAAVGRKIALEDHIMHILAGLGSEYDPTVSVISDKEETPSLQKVYSALLAQESRLLRNLTINDDGSTVSINLTQKTGMSSN
GQVSSSSNSSNSNRGKNKGKKHWNNN