; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020888 (gene) of Chayote v1 genome

Gene IDSed0020888
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationLG03:16910163..16914994
RNA-Seq ExpressionSed0020888
SyntenySed0020888
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAA22288.1 polyprotein [Oryza australiensis]1.3e-3571.43Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKTAFLN  L EDVYM  P+GF DP +PG++ +LQKSIYGLK+ SRSWN+RFDE  K F F+KNEEE CVYKK+SG ++ FL+LYVDDILLIGND  M
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        LESVK
Subjt:  LESVK

KAF5757832.1 putative RNA-directed DNA polymerase [Helianthus annuus]3.1e-3470.48Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKTAFLN  L EDVYM  PEGF DP NP +V +L KSIYGLK+ SR+WNLRFD+  KQF F+KNE+E CVYKK SG S+TFL+LYVDDILLIGN+  M
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        L+ VK
Subjt:  LESVK

KAF5783325.1 putative RNA-directed DNA polymerase [Helianthus annuus]6.2e-3572.38Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKTAFLN  L EDVYM  PEGF DP NP +V +L KSIYGLK+ SRSWNLRFD+  KQF F+KNE+E CVYKK SG S+TFL+LYVDDILLIGND  M
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        L  VK
Subjt:  LESVK

KAF5788164.1 putative RNA-directed DNA polymerase [Helianthus annuus]6.2e-3572.38Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKTAFLN  L EDVYM  PEGF DP NP +V +L KSIYGLK+ SRSWNLRFD+  KQF F+KNE+E CVYKK SG S+TFL+LYVDDILLIGND  M
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        L+ VK
Subjt:  LESVK

KAG7543183.1 Reverse transcriptase RNA-dependent DNA polymerase [Arabidopsis thaliana x Arabidopsis arenosa]1.6e-3573.33Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKTAFLN  L+EDVYMT PEGF  P N G+V +LQ+SIYGL+K SRSWNLRFDEA K+F F++NEEE CVYKK SG +V FLVLYVDDILLIGND  +
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        L+SVK
Subjt:  LESVK

TrEMBL top hitse value%identityAlignment
A0A6D2HJE2 CCHC-type domain-containing protein3.3e-3470.48Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKTAFLN  L+EDVYM  PEGF  P N G+V +LQ++IYGLK+ SRSWNLRFDEA K+F F++NEEE CVYKK SG +V FLVLYVDDI LIGND  +
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        L+SVK
Subjt:  LESVK

D3IVT9 Putative retrotransposon protein9.7e-3470.48Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKTAFLN  L EDVYMT PEGF DP N  +V +LQKSIYGLK+ SRSWN+RFDE  K+F F+KN+EE CVY K+SG ++  L+LYVDDILLIGND  M
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        LESVK
Subjt:  LESVK

D3IVU0 Putative retrotransposon protein1.3e-3369.52Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKTAFLN  L EDVYMT PEGF DP N  +V +LQKSIYGLK+ SRSWN+RFDE  K+F F+KN+EE CVY K+SG ++  L+LYVDDILL+GND  M
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        LESVK
Subjt:  LESVK

O23864 Polyprotein6.1e-3671.43Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKTAFLN  L EDVYM  P+GF DP +PG++ +LQKSIYGLK+ SRSWN+RFDE  K F F+KNEEE CVYKK+SG ++ FL+LYVDDILLIGND  M
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        LESVK
Subjt:  LESVK

Q2QQ28 Retrotransposon protein, putative, Ty1-copia subclass9.7e-3469.52Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDVKT FLN  L EDVYMT P+GF DP +  ++ +LQKSIYGLK+ SRSWN+RFDE  K  RF+KNEEE CVYKK+SG ++ FL+LYVDDILLIGND  M
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
        LESVK
Subjt:  LESVK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-1439.25Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVY--KKLSGISVTFLVLYVDDILLIGNDR
        MDVKTAFLN  LKE++YM  P+G     N   V +L K+IYGLK+ +R W   F++A K+  F+ +  + C+Y   K +     +++LYVDD+++   D 
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVY--KKLSGISVTFLVLYVDDILLIGNDR

Query:  TMLESVK
        T + + K
Subjt:  TMLESVK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-1942.45Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVY-KKLSGISVTFLVLYVDDILLIGNDRT
        +DVKTAFL+  L+E++YM  PEGF+       V +L KS+YGLK+  R W ++FD   K   +LK   + CVY K+ S  +   L+LYVDD+L++G D+ 
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVY-KKLSGISVTFLVLYVDDILLIGNDRT

Query:  MLESVK
        ++  +K
Subjt:  MLESVK

P25600 Putative transposon Ty5-1 protein YCL074W2.0e-1233.33Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        MDV TAFLN+ + E +Y+  P GF +  NP  V+ L   +YGLK+    WN   +   K+  F ++E E  +Y + +     ++ +YVDD+L+      +
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVK
         + VK
Subjt:  LESVK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.8e-1636.79Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        +DV  AFL   L +DVYM+ P GF D   P  V +L+K++YGLK+  R+W +          F+ +  +  ++    G S+ ++++YVDDIL+ GND T+
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVKD
        L +  D
Subjt:  LESVKD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.4e-1536.79Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM
        +DV  AFL   L ++VYM+ P GF D   P  V RL+K+IYGLK+  R+W +          F+ +  +  ++    G S+ ++++YVDDIL+ GND  +
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTM

Query:  LESVKD
        L+   D
Subjt:  LESVKD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.0e-1433.94Show/hide
Query:  MDVKTAFLNAFLKEDVYMTHPEGFQ----DPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGN
        +D+  AFLN  L E++YM  P G+     D   P  V  L+KSIYGLK+ SR W L+F      F F+++  +   + K++      +++YVDDI++  N
Subjt:  MDVKTAFLNAFLKEDVYMTHPEGFQ----DPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGN

Query:  DRTMLESVK
        +   ++ +K
Subjt:  DRTMLESVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTAAAACCGCTTTCTTGAATGCGTTCTTGAAAGAGGATGTGTATATGACACACCCTGAAGGTTTTCAAGATCCAGCCAATCCTGGGAGAGTATACAGGCTTCA
AAAATCTATTTATGGATTGAAAAAAGTATCTAGGAGTTGGAACCTCAGATTTGATGAGGCATTCAAACAGTTTAGGTTCCTTAAAAATGAAGAGGAATTTTGTGTATACA
AGAAGTTAAGTGGGATCAGTGTTACCTTCCTTGTTCTGTATGTAGATGACATACTACTCATAGGAAACGATAGAACCATGCTTGAATCAGTCAAAGATTGA
mRNA sequenceShow/hide mRNA sequence
CGAACTTGAAATAATGCAATTTTATATATTATTTTCTAAAAAATCTATCCTAAAGAGAAATGAAACATTGTGCTTTCCAACTTCAAGGTCTCTCTCTGCTTCATTAACCC
TACCTCATCAAATCCTCTTCTTGAATGTCAAGGAAACAACCTCCGGTATCACAACGGTAGGGGAATTGATGGAATCTTCGACTAGGAGTTCGCTGCCCACTATTGTATTC
GACCTTTAAATTTCCTCCCTCGCATCCCGACCCATTTCACTCTTTTTTACCTTCATTTTCATTCATCGCGATTTTACACACATTTCACCCTTTTCTCTCCAACTCCAACC
CCCTTCATCGCCATTGCCACTACCGAAGCACCTAAAGTCCTCATCATCTCGATCACCGCCAGCTTCATCTGCCTGTCGCAACCTATATTTGAAAGGGAATATGGAAAGGG
AAATAAAGAGGCTAGGGAAAAGACCTATGTTGTAGGCATTTAAATTTTAATTAATTCTTCACATTTCCTTGTGATTGAAAGTAGTAACGAAAATAGTAAGAGATACCAAA
CAAGCTCAAGGGGTCTAAATCTAAGTATTATAATAAGCAAAAATTGTTGAACATTGAGGTGTATCTTCTAACAATTTGTTGGTTGTTACAAAGCTGTGGGCCATGTTACT
TTATTTAGGCAAGTTTCTTTTTGGTGTGGTTGTATGGAATAAACAACCCCTAAGGATGGCACAATGGTTGAAGACTTGGTTTTTTAGGGTATGCTCCCCTCAAGGTCTCA
GGTTCGAGACTAAGTTGTGACATTACTTCTTCGATGTAACTTAAGTGCAACTTCGCACAGGGGATAGTATATCATGCTTTTTTGTTATATTGTGTAAACTAACTAGTTTT
CACACCCCTCTTAAACATTTTAAGAATTATAAGTTTATTGATATTGATTTTTTCTTTTTGGGCAAGTGTAATGTACAAATTATTTAGTCATATATGAAATAATATTTGTT
GAATAAGACTTGTGGGCGCATAATATAAATGTTTCTTGACAGGTAAAGACAAACAAAACTGGTTATAAATAGGAAGCCCACAGTAGCAATGCTAAAATCCATAAGGATAA
GCTCCTTGCGATTGTTGCCTATCATGATTATGAAATATGGCAGATGGATGTTAAAACCGCTTTCTTGAATGCGTTCTTGAAAGAGGATGTGTATATGACACACCCTGAAG
GTTTTCAAGATCCAGCCAATCCTGGGAGAGTATACAGGCTTCAAAAATCTATTTATGGATTGAAAAAAGTATCTAGGAGTTGGAACCTCAGATTTGATGAGGCATTCAAA
CAGTTTAGGTTCCTTAAAAATGAAGAGGAATTTTGTGTATACAAGAAGTTAAGTGGGATCAGTGTTACCTTCCTTGTTCTGTATGTAGATGACATACTACTCATAGGAAA
CGATAGAACCATGCTTGAATCAGTCAAAGATTGACTTAAAAATTGTTCCTCTATGAAAGACATTGGAGAGGCTGAGTACATTCTAGGAATAAGAATCTATAGAGATAGAT
CCAAAAGAATGATTGGACTTAGTCAGGAAACTTATATTGATAAGGTTCTTACTAAGTTCAATATGGAAACTCTAAGAGAGGTTTCATTCCCATGCAACATGGCATATCGA
TTAGCAAGACTCAATGTCCTACAAGTCCTATTGAGGCTAAGCGTATGAGTATTGTTCCTTATGCTTTGGCAATAGGTTCAATCATGTATGCCATGATTTGTACTCGACCG
GACGATGTGCTCATGCTTTTAGCATATGTAACAGATACCAGTCTAATCCTAGTGATACACACTGGATAGTAGTGAAAAATATTCTTAAGTACTTGAGAAAAACGAAGGAT
AATTTTATGGTTTATGGTGGAGTTAATGAGTTGGTTGTTACTGGATAAACTGATGCAAACATTGCAGATCCACTGACTAAACCATTACCGCAACCCAAACATGAGAGTCA
TACTAGGACTATGAGTATTAGAC
Protein sequenceShow/hide protein sequence
MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVKD