; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002870 (gene) of Chayote v1 genome

Gene IDSed0002870
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationLG10:6788386..6790035
RNA-Seq ExpressionSed0002870
SyntenySed0002870
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY98609.1 haloacid dehalogenase-like hydrolase (HAD) superfamily protein [Actinidia rufa]2.1e-3847.8Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V Q L  DNY +W+ +M++ALS++NK+GF+DG++ K  G+ T+ L+SW+ NNN+VI W+LNSVSK IS+SI+F+  A  IW DLK+RFQ+ + PR+FQL+
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF
         +L    QDQ  V+ YF+KLK++W+E   Y+  CSCG C+  G + +N+  + E+   F
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF

KAA8536734.1 hypothetical protein F0562_029212 [Nyssa sinensis]4.2e-3948.43Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V Q L  +NY  WS +ML+ALS++NK+GFVDG +P+  G+  + L SW+ NNNIVI W+LNS+SK IS+SI+F   A+ IW DL++RFQ+++ PR+FQLK
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF
         +L  L+Q+Q SV+ YF+K+K++W+E   Y+  CSCG C   G + +N + + E+   F
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF

XP_008457013.1 PREDICTED: uncharacterized protein LOC103496792 [Cucumis melo]5.5e-3951.95Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V + + ++NYV WS SM+LA+SI+NK+GF+D  + K  G L   L  W+ NNN+VI W+LNS SK I SSILFT  A+  W DL++ FQ+++ PR+FQLK
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNE
        C L+TLKQDQ+SVT YF+ +KSLWDEY +Y   C+CG C+  G +++  F+  E
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNE

XP_022154973.1 uncharacterized protein LOC111022117 [Momordica charantia]7.7e-4153.9Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V +PL   NYV+WS SM +ALSI+NK+GF++G+LPK  G L   L  W+ N ++VI W LNSVSK IS+S++FT+    IW DLK+RFQ ++ P++FQL+
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNE
         DLATL QDQ SVT Y++KLK+LWDEY +Y+  C+CG CS  G+R +  F++ E
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNE

XP_038895765.1 uncharacterized protein LOC120083929 [Benincasa hispida]2.4e-4254.09Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V + L +DNYV+WS SM+L L I+NK+GF+DG+LP+ TG L H    W+HNNN+V+ W+L SVSKSISSSILFT+ AQ IW DL++ FQR++ PR+F LK
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF
         +L++LKQDQ SVT YF+K+KS  DEY +Y+  C+CG C+  G +++  F++ E+   F
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF

TrEMBL top hitse value%identityAlignment
A0A1S3C5T4 uncharacterized protein LOC1034967922.7e-3951.95Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V + + ++NYV WS SM+LA+SI+NK+GF+D  + K  G L   L  W+ NNN+VI W+LNS SK I SSILFT  A+  W DL++ FQ+++ PR+FQLK
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNE
        C L+TLKQDQ+SVT YF+ +KSLWDEY +Y   C+CG C+  G +++  F+  E
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNE

A0A5J5B2C5 Uncharacterized protein2.0e-3948.43Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V Q L  +NY  WS +ML+ALS++NK+GFVDG +P+  G+  + L SW+ NNNIVI W+LNS+SK IS+SI+F   A+ IW DL++RFQ+++ PR+FQLK
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF
         +L  L+Q+Q SV+ YF+K+K++W+E   Y+  CSCG C   G + +N + + E+   F
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF

A0A6J1DKR8 uncharacterized protein LOC1110218311.3e-3857.86Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V QPL E+NY +WS SML+ALSI+NK+GF+DG++ +  G L   L +W+HNN++VI W+LNSVSK ISSSILF++ A+ IW DLK RF++ + PR+FQLK
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCS
         DLA L Q+QQSV+ YF+KLK++WDE   Y+  CSC   S
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCS

A0A6J1DLQ9 uncharacterized protein LOC1110221173.7e-4153.9Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V +PL   NYV+WS SM +ALSI+NK+GF++G+LPK  G L   L  W+ N ++VI W LNSVSK IS+S++FT+    IW DLK+RFQ ++ P++FQL+
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNE
         DLATL QDQ SVT Y++KLK+LWDEY +Y+  C+CG CS  G+R +  F++ E
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNE

A0A7J0FKC9 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.0e-3847.8Show/hide
Query:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK
        V Q L  DNY +W+ +M++ALS++NK+GF+DG++ K  G+ T+ L+SW+ NNN+VI W+LNSVSK IS+SI+F+  A  IW DLK+RFQ+ + PR+FQL+
Subjt:  VHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLK

Query:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF
         +L    QDQ  V+ YF+KLK++W+E   Y+  CSCG C+  G + +N+  + E+   F
Subjt:  CDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.2e-2138.24Show/hide
Query:  EDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLKCDLATL
        EDNYV W       L +  K GF+DGTLPK     +     W   N +V+ WL+NS++  +  S+++ + A  +WEDL+  F    D +++QL+  LATL
Subjt:  EDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNRFQRKSDPRVFQLKCDLATL

Query:  KQDQQSVTTYFSKLKSLWDEYRTYQ--SECSCGLCS
        +Q   SV  YF KL  +W E   Y    EC CG C+
Subjt:  KQDQQSVTTYFSKLKSLWDEYRTYQ--SECSCGLCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTCAACCGAAGTTCTGTCTCAAACGAACGATTCGCCTCTTCCGGAAGTTCAGACATCTTCAGAAGTTCATCAACCCCTTGCTGAAGACAACTATGTGACTTGGAG
TCACTCGATGTTGTTGGCCCTTTCCATCCGAAACAAGGTAGGTTTCGTTGATGGCACCTTGCCGAAGTCCACAGGTTCTTTAACCCATTCTTTATCTTCCTGGATGCACA
ATAACAACATTGTCATCGGTTGGCTTCTTAATTCTGTGTCGAAGTCGATTTCTTCAAGCATTCTTTTCACGGATTTGGCACAAACAATCTGGGAAGATCTCAAGAATCGT
TTTCAACGAAAGAGTGACCCTCGTGTCTTCCAATTGAAGTGCGATTTAGCAACATTGAAACAAGATCAGCAATCTGTGACTACCTATTTCTCCAAATTGAAATCTCTTTG
GGATGAGTATCGAACTTATCAATCAGAGTGTTCTTGTGGTTTGTGTTCTTATAGAGGTCACAGGGCTATCAATGCTTTTATAAAAAATGAGTTCTCATTTATTTTTTAA
mRNA sequenceShow/hide mRNA sequence
GAAGACTTCCAAGCGTTTAGCCCAGTGGGTACAAATAAGTTGGGTGGGTTATTTTAACTCGGTGCGTGGATTGCACTCCAAGAAATAACCTGGCGAGTAACTCTCCCTCT
TCTCGGTGGTTTTCGGCGCATTTCTTCTCTCGATTCTTTCACGATCAATGATTTCTCATATTTCTCCATGTTGATTCGGCTAGTAGGACTTGCAGGAGATTTGGTGCGAG
AGCTGTATTTTGTATATTGTTGGAGAAATTCACCTTATGGAAATTGTCCCACATTGGAAATGTGAGGATTCTACAACACCTTTATAAGGAATATGAGTTACTCCCCTCAT
TGCCAAATAGTTTTGAGGTGGAACCTCATATTCTCTAATATGGTATCAGAGCTCTTATCGCTTTCATCTTCTTGATTCGTTCTTTGAATCCTTTTTTCATTTCATCTTCA
TGATGTCAACCGAAGTTCTGTCTCAAACGAACGATTCGCCTCTTCCGGAAGTTCAGACATCTTCAGAAGTTCATCAACCCCTTGCTGAAGACAACTATGTGACTTGGAGT
CACTCGATGTTGTTGGCCCTTTCCATCCGAAACAAGGTAGGTTTCGTTGATGGCACCTTGCCGAAGTCCACAGGTTCTTTAACCCATTCTTTATCTTCCTGGATGCACAA
TAACAACATTGTCATCGGTTGGCTTCTTAATTCTGTGTCGAAGTCGATTTCTTCAAGCATTCTTTTCACGGATTTGGCACAAACAATCTGGGAAGATCTCAAGAATCGTT
TTCAACGAAAGAGTGACCCTCGTGTCTTCCAATTGAAGTGCGATTTAGCAACATTGAAACAAGATCAGCAATCTGTGACTACCTATTTCTCCAAATTGAAATCTCTTTGG
GATGAGTATCGAACTTATCAATCAGAGTGTTCTTGTGGTTTGTGTTCTTATAGAGGTCACAGGGCTATCAATGCTTTTATAAAAAATGAGTTCTCATTTATTTTTTAATG
GGACTCAATGAATCTTTCGATCATGCTCGATCTTAAATCTTGCTCATGGATCGCCAACCTGAAACTTCCAAGGCGTTTTCTCTTATATCTCAAGAAAAGCAACAACGTCA
ATTACCTCTTCTTGCTTAACCACCTGCTATTTCTCTTGTTGTTGCTCAGGGTAATGGTTTTCGCCATGATAGACCCTTTAGCACCAATTGCGATCGTCAAGGACACACCA
TAGACAAGTTCTATCGTCTGCACGAGTTTCCACCGAGTTATCGCAATTGCAATAACCCATCAGTGAACAATATGACTGCCCCATTTTTTGTTGCAAACCCATCAGTGAAC
AATGTGGCTACAACTCAAGCCTGTGATATCTCTACTCCTACAAAAACTATCATTGACAACAAATGATTGAGCCCAATATCAAAATATCTACATGCTAAGTTGAATGCAAC
CAAGATTTAACCAAAAATTTCTTCTACGCATCTTGTAGGTAAGAGCAATTATCTTATGACTCTTTCCTCTAAATCCATTAAATCATCCCCTACAAGTTGGATATTAGATT
CAGGTGCTGTTGCTCATATCTGTTTTCATCGCCC
Protein sequenceShow/hide protein sequence
MMSTEVLSQTNDSPLPEVQTSSEVHQPLAEDNYVTWSHSMLLALSIRNKVGFVDGTLPKSTGSLTHSLSSWMHNNNIVIGWLLNSVSKSISSSILFTDLAQTIWEDLKNR
FQRKSDPRVFQLKCDLATLKQDQQSVTTYFSKLKSLWDEYRTYQSECSCGLCSYRGHRAINAFIKNEFSFIF