; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006613 (gene) of Chayote v1 genome

Gene IDSed0006613
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationLG14:14990314..14992134
RNA-Seq ExpressionSed0006613
SyntenySed0006613
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042148.1 putative retroelement pol polyprotein [Cucumis melo var. makuwa]5.4e-2338.33Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPPG
        M L+++   AR+Q+LL++P+PSVS+AF+L++QE  Q+++    + + + ALA+S ++N      + Q     +DRP+CTH  + GHT D+CYK+HGY P 
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPPG

Query:  YKPK-----NARDVTTTGNSSNSPATIALTATPSSQEIRD--SAIAQCHNILVVLQTTLATTQPASDFTSFHVTGQVHYE
        YKPK     N+   T++ NS+NS  T+    +P+   I +   A+ QC N+L  LQ   A +   S   + H+ GQVH E
Subjt:  YKPK-----NARDVTTTGNSSNSPATIALTATPSSQEIRD--SAIAQCHNILVVLQTTLATTQPASDFTSFHVTGQVHYE

XP_022141216.1 uncharacterized protein LOC111011669 [Momordica charantia]1.4e-2337.37Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSC-----RDRPVCTHYGLIGHTIDRCYKIH
        M LN++F   R+Q+LLM+P  ++++AF+L+ QE  QR+   L T SS+ A ++   + L +T   S ++R+      ++RP CTH  L GHT+DRCYK+H
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSC-----RDRPVCTHYGLIGHTIDRCYKIH

Query:  GYPPGY----KPKNARDVTTTGNSSNSPATIALTATPSSQEIRDSAIAQCHNILVVLQTTLATTQPASD---FTSFHVTGQVHYEDDWQG
        GYPPG+    K  N+   +    ++ S  T ++ ++ ++  + +    Q   +L  LQ+ LA+ +P SD    +S HV GQV +EDDWQG
Subjt:  GYPPGY----KPKNARDVTTTGNSSNSPATIALTATPSSQEIRDSAIAQCHNILVVLQTTLATTQPASD---FTSFHVTGQVHYEDDWQG

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]5.4e-2336.68Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPPG
        M LN+++   R+QILLMDP+P ++K F+L+IQEE QR++  +  P  S+A+AV++ S       + +      +R  CTH GL GH ID+CYK+HGYPPG
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPPG

Query:  YKPKN--AR--DVTTTGNSSNSPATIA---------LTATPSSQEIRDSAIA--------QCHNILVVLQTTLATTQPASDFTSFHVTGQVHYEDDWQG
        Y+  N  AR   +     +S+S   +A         +T++P+ Q   +S+ A        Q   ++ +LQ+ L   +P +     HV GQV  E+DWQG
Subjt:  YKPKN--AR--DVTTTGNSSNSPATIA---------LTATPSSQEIRDSAIA--------QCHNILVVLQTTLATTQPASDFTSFHVTGQVHYEDDWQG

XP_022154973.1 uncharacterized protein LOC111022117 [Momordica charantia]2.0e-3042.42Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQT-SNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPP
        M LN++F   R+QILLMDP PS+ KAF+LI QEE QR +PL  TPS ++ LAV+Q+ S+     G  Q + SC   P CT+ G+ GHT+D+CY++HG+P 
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQT-SNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPP

Query:  GYKPK------------NARDVTTTGNSSNSPAT-----IALTATPSS--QEIRDSAIAQCHNILVVLQTTLATTQPASDFTSFHVTGQVHYEDDWQG
        GY+ K            +    T++  +SNSP+T     I+ T+  SS    +   A +QCHNIL +LQ+ L   +  S+  S ++ G+VH++DDWQG
Subjt:  GYKPK------------NARDVTTTGNSSNSPAT-----IALTATPSS--QEIRDSAIAQCHNILVVLQTTLATTQPASDFTSFHVTGQVHYEDDWQG

XP_022158788.1 uncharacterized protein LOC111025254 [Momordica charantia]7.0e-2336.36Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRS---CRDRPVCTHYGLIGHTIDRCYKIHGY
        M LND+F    +Q+LLM+P PS+++  +L+ QE  QR++  L  PS +  L  + +    ++P  + SS S    +D+PVCTH G+IGHT D+CY++HGY
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRS---CRDRPVCTHYGLIGHTIDRCYKIHGY

Query:  PPGYKPKNARDVTTTGNSSNSPATIALTATPSSQEIRDSAIA----QCHNILVVLQTTLATTQPASDFTSF--HVTGQVHYEDDWQG
        PPG++    +   +  ++S S +    +A+  S    DS  +    QC  +L +L + L+T Q  +D  S   HV G V +E+ WQG
Subjt:  PPGYKPKNARDVTTTGNSSNSPATIALTATPSSQEIRDSAIA----QCHNILVVLQTTLATTQPASDFTSF--HVTGQVHYEDDWQG

TrEMBL top hitse value%identityAlignment
A0A5D3DT02 Putative retroelement pol polyprotein2.6e-2338.33Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPPG
        M L+++   AR+Q+LL++P+PSVS+AF+L++QE  Q+++    + + + ALA+S ++N      + Q     +DRP+CTH  + GHT D+CYK+HGY P 
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPPG

Query:  YKPK-----NARDVTTTGNSSNSPATIALTATPSSQEIRD--SAIAQCHNILVVLQTTLATTQPASDFTSFHVTGQVHYE
        YKPK     N+   T++ NS+NS  T+    +P+   I +   A+ QC N+L  LQ   A +   S   + H+ GQVH E
Subjt:  YKPK-----NARDVTTTGNSSNSPATIALTATPSSQEIRD--SAIAQCHNILVVLQTTLATTQPASDFTSFHVTGQVHYE

A0A6J1CIG1 uncharacterized protein LOC1110116696.9e-2437.37Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSC-----RDRPVCTHYGLIGHTIDRCYKIH
        M LN++F   R+Q+LLM+P  ++++AF+L+ QE  QR+   L T SS+ A ++   + L +T   S ++R+      ++RP CTH  L GHT+DRCYK+H
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSC-----RDRPVCTHYGLIGHTIDRCYKIH

Query:  GYPPGY----KPKNARDVTTTGNSSNSPATIALTATPSSQEIRDSAIAQCHNILVVLQTTLATTQPASD---FTSFHVTGQVHYEDDWQG
        GYPPG+    K  N+   +    ++ S  T ++ ++ ++  + +    Q   +L  LQ+ LA+ +P SD    +S HV GQV +EDDWQG
Subjt:  GYPPGY----KPKNARDVTTTGNSSNSPATIALTATPSSQEIRDSAIAQCHNILVVLQTTLATTQPASD---FTSFHVTGQVHYEDDWQG

A0A6J1CXR2 uncharacterized protein LOC1110152392.6e-2336.68Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPPG
        M LN+++   R+QILLMDP+P ++K F+L+IQEE QR++  +  P  S+A+AV++ S       + +      +R  CTH GL GH ID+CYK+HGYPPG
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPPG

Query:  YKPKN--AR--DVTTTGNSSNSPATIA---------LTATPSSQEIRDSAIA--------QCHNILVVLQTTLATTQPASDFTSFHVTGQVHYEDDWQG
        Y+  N  AR   +     +S+S   +A         +T++P+ Q   +S+ A        Q   ++ +LQ+ L   +P +     HV GQV  E+DWQG
Subjt:  YKPKN--AR--DVTTTGNSSNSPATIA---------LTATPSSQEIRDSAIA--------QCHNILVVLQTTLATTQPASDFTSFHVTGQVHYEDDWQG

A0A6J1DLQ9 uncharacterized protein LOC1110221179.9e-3142.42Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQT-SNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPP
        M LN++F   R+QILLMDP PS+ KAF+LI QEE QR +PL  TPS ++ LAV+Q+ S+     G  Q + SC   P CT+ G+ GHT+D+CY++HG+P 
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQT-SNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPP

Query:  GYKPK------------NARDVTTTGNSSNSPAT-----IALTATPSS--QEIRDSAIAQCHNILVVLQTTLATTQPASDFTSFHVTGQVHYEDDWQG
        GY+ K            +    T++  +SNSP+T     I+ T+  SS    +   A +QCHNIL +LQ+ L   +  S+  S ++ G+VH++DDWQG
Subjt:  GYKPK------------NARDVTTTGNSSNSPAT-----IALTATPSS--QEIRDSAIAQCHNILVVLQTTLATTQPASDFTSFHVTGQVHYEDDWQG

A0A6J1DX32 uncharacterized protein LOC1110252543.4e-2336.36Show/hide
Query:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRS---CRDRPVCTHYGLIGHTIDRCYKIHGY
        M LND+F    +Q+LLM+P PS+++  +L+ QE  QR++  L  PS +  L  + +    ++P  + SS S    +D+PVCTH G+IGHT D+CY++HGY
Subjt:  MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRS---CRDRPVCTHYGLIGHTIDRCYKIHGY

Query:  PPGYKPKNARDVTTTGNSSNSPATIALTATPSSQEIRDSAIA----QCHNILVVLQTTLATTQPASDFTSF--HVTGQVHYEDDWQG
        PPG++    +   +  ++S S +    +A+  S    DS  +    QC  +L +L + L+T Q  +D  S   HV G V +E+ WQG
Subjt:  PPGYKPKNARDVTTTGNSSNSPATIALTATPSSQEIRDSAIA----QCHNILVVLQTTLATTQPASDFTSF--HVTGQVHYEDDWQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCTCAATGATACTTTTGGACCGGCTCGTTCCCAAATTTTATTGATGGATCCACTGCCTTCGGTGAGTAAGGCATTTGCCCTCATAATCCAAGAAGAGCATCAACG
CTCAATGCCTCTTCTCCCCACTCCATCTTCGTCTATCGCTCTGGCTGTTTCTCAGACTTCCAATTTGATGAAGACCCCTGGCTATTCTCAGTCTTCTCGGTCATGCAGAG
ATCGCCCAGTTTGCACACACTATGGATTGATCGGACATACCATCGATCGTTGCTATAAGATCCATGGTTATCCCCCTGGTTATAAACCCAAGAATGCTCGAGATGTGACT
ACTACAGGAAATTCTTCAAATTCTCCCGCCACCATAGCATTGACTGCTACACCTTCATCCCAAGAAATTCGTGATTCTGCCATTGCACAGTGCCACAACATTCTTGTTGT
GTTGCAAACCACTCTTGCAACAACTCAACCTGCTTCTGATTTTACTTCCTTCCATGTTACAGGACAAGTGCACTATGAGGATGATTGGCAAGGGTAG
mRNA sequenceShow/hide mRNA sequence
AATCATCGCGTGCTTTTCTCTTCTTCTTCCCCAAAAATTAGGGTTCGTCTTCTTCGCTCGATCCTCCATGGATACCACTGAGTTCGTTCTGGTTCAAGCTCCAGTTCTTC
TACATCTTCTGATTCATCCGCCTCCAAAATCATTCAAAGCCCGTATTCCTTCACTTCCCATGATACTTCCAATCTCGTTCTGGTTTCATATCTCCTCAATGAAGACAATT
ATCTCACATGGACGCGATCTATGCGACTAGGCTTATCGATCAGGAATAAACTCATAATGGTTCGATTGAAAATCCTACCGGTCCTCTACTTCAATCTTGGATCCGAAGTA
ATAACATCGTAATCGCTTGGATTCTCAACTCGGTTTCGAAGGGCATTTCTTCGAGTATTTTGTTTTCTGAAGATGCTCGTGCTATTTGGTTGGATTTGAAGGAACGATTT
CAACGCAAACATGGGCTTCACATATAGCAATTGAAACGAGATCTTGCCGTTCTCACACAAGGTCAGCAATCTGTATCTGTTTATTTTTCCAAGCTCAAGACTATTTGGGA
TGAACTGGATACATATCGTCCATCTTATTCTTGTAATCTCTGTTCTTGTGGAGGCAATAAGGCAATCACAAATTTCTTCCACAATGAGTATCTGCTCTGTTTCTTGATGA
GGCTCAATGATACTTTTGGACCGGCTCGTTCCCAAATTTTATTGATGGATCCACTGCCTTCGGTGAGTAAGGCATTTGCCCTCATAATCCAAGAAGAGCATCAACGCTCA
ATGCCTCTTCTCCCCACTCCATCTTCGTCTATCGCTCTGGCTGTTTCTCAGACTTCCAATTTGATGAAGACCCCTGGCTATTCTCAGTCTTCTCGGTCATGCAGAGATCG
CCCAGTTTGCACACACTATGGATTGATCGGACATACCATCGATCGTTGCTATAAGATCCATGGTTATCCCCCTGGTTATAAACCCAAGAATGCTCGAGATGTGACTACTA
CAGGAAATTCTTCAAATTCTCCCGCCACCATAGCATTGACTGCTACACCTTCATCCCAAGAAATTCGTGATTCTGCCATTGCACAGTGCCACAACATTCTTGTTGTGTTG
CAAACCACTCTTGCAACAACTCAACCTGCTTCTGATTTTACTTCCTTCCATGTTACAGGACAAGTGCACTATGAGGATGATTGGCAAGGGTAGTCTACATGATGGTTGCT
ACGTACTGCACTACTCTCCCTCCACTATAGTTGCTTCTGTAAAGAAAGTATCAGCAACAACCTGGCATGCAAGACTTGGTCACCCTTCCTTTTCTCGTTTAAATGTACAT
AAGGATAGTCTTTGCTTAAATTTCCCTAAATCTCTACATGATATACCATGTGAGATATGTCCTTTGTCCAAGCAAAAGAAACTATCATTTGAATGCAATAACAATTTGTC
TTTGAATATTTTTTATCTTATTCATGCTGATACTTGGGGCCCATTCTCGGTTGCCTCTACTAACGGATACAGATACTTTTTAAC
Protein sequenceShow/hide protein sequence
MRLNDTFGPARSQILLMDPLPSVSKAFALIIQEEHQRSMPLLPTPSSSIALAVSQTSNLMKTPGYSQSSRSCRDRPVCTHYGLIGHTIDRCYKIHGYPPGYKPKNARDVT
TTGNSSNSPATIALTATPSSQEIRDSAIAQCHNILVVLQTTLATTQPASDFTSFHVTGQVHYEDDWQG