; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027111 (gene) of Chayote v1 genome

Gene IDSed0027111
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationLG06:34679945..34682316
RNA-Seq ExpressionSed0027111
SyntenySed0027111
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]1.3e-4959.41Show/hide
Query:  YAAQVSTDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRK
        +    S+  TT       AT G  SS TP R+VN L+E WVT D L+LGWLYNSMT +VA Q+MG+   +DLWDA Q  FG+ SRAEED+LRQ+ QTTRK
Subjt:  YAAQVSTDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRK

Query:  GNSKMSEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEK
        GN+KM EYL VMK + DNLGQ+GS +  RALISQVLLGLDE YN V+  +QGKPD+ W+++Q++LL+FEK
Subjt:  GNSKMSEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEK

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]1.7e-4966.45Show/hide
Query:  ESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVM
        E+S T  +    SS      +NPLYESWVT DQL+LGWLYNSMT EVATQVMGYE + DLW A+Q LFG+ S+AEEDYLRQ+FQ TRKG+ KM+++LRVM
Subjt:  ESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVM

Query:  KCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNE
        K HADNLGQ GS +  R+LISQVLLGLDEEYNPVVAT+QGK  + W  +Q E
Subjt:  KCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNE

XP_038904321.1 uncharacterized protein LOC120090675 [Benincasa hispida]5.0e-4960.51Show/hide
Query:  SSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVMKC
        S+ G++  G SS++ T  VNP Y +W+ +DQL+LGWLYNSMT ++A QVMG+E ++DLW  +Q LFGI SRAEEDYLR +FQTTRKGN KM +YLR MK 
Subjt:  SSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVMKC

Query:  HADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEKR
        + DNL Q GS +  RAL+ QVLLGLDEEYN +VAT+QG+ DM W+++Q++LLL+E+R
Subjt:  HADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEKR

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]1.6e-4759.04Show/hide
Query:  STDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKM
        ST       S  G++  G SS+     VNP YESW+ +DQL+LGWLYNSMT EVA QVMG E +KDLW ++  LFG+ SR EEDYLR +FQTTRKGN KM
Subjt:  STDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKM

Query:  SEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEKR
         EYL+ MK + DNL Q GS +  R L+SQVLLGLDEEYN +VA +QG+ DM W+++Q+ELLL+E+R
Subjt:  SEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEKR

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]1.6e-4759.04Show/hide
Query:  STDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKM
        ST       S  G++  G SS+     VNP YESW+ +DQL+LGWLYNSMT EVA QVMG E +KDLW ++  LFG+ SR EEDYLR +FQTTRKGN KM
Subjt:  STDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKM

Query:  SEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEKR
         EYL+ MK + DNL Q GS +  R L+SQVLLGLDEEYN +VA +QG+ DM W+++Q+ELLL+E+R
Subjt:  SEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEKR

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein6.3e-5059.41Show/hide
Query:  YAAQVSTDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRK
        +    S+  TT       AT G  SS TP R+VN L+E WVT D L+LGWLYNSMT +VA Q+MG+   +DLWDA Q  FG+ SRAEED+LRQ+ QTTRK
Subjt:  YAAQVSTDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRK

Query:  GNSKMSEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEK
        GN+KM EYL VMK + DNLGQ+GS +  RALISQVLLGLDE YN V+  +QGKPD+ W+++Q++LL+FEK
Subjt:  GNSKMSEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEK

A0A5A7UY76 Integrase, catalytic core2.7e-4064.23Show/hide
Query:  EESSSTG---ATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEY
        +ES S G   ATDG  S +T T+ VNP ++ WVT D L+LGW+YNSMT EVA Q+MG+ T+KDL +A+Q LFG+ SR EED+LR  FQTTRKGNSKM +Y
Subjt:  EESSSTG---ATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEY

Query:  LRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNP
        LR+MK +A+NLGQ GS I  R+LISQVLLGLDE YNP
Subjt:  LRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNP

A0A5D3E3L7 Uncharacterized protein1.9e-4658.28Show/hide
Query:  TDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMS
        T G    S  +  T    S++  +++VNP YE WVT D L+LG +YNSM  +VA Q+MG+ T+KDLW+A+Q LFGI SRAEE +LR  FQTTR+GN KM 
Subjt:  TDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMS

Query:  EYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFE
        +YLR+MK +ADNLGQ GS +  R LISQVLLGLDE YNPV A +QGKPD+ W+++Q+ELL+FE
Subjt:  EYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFE

A0A6J1D5J0 uncharacterized protein LOC1110175018.3e-5066.45Show/hide
Query:  ESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVM
        E+S T  +    SS      +NPLYESWVT DQL+LGWLYNSMT EVATQVMGYE + DLW A+Q LFG+ S+AEEDYLRQ+FQ TRKG+ KM+++LRVM
Subjt:  ESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVM

Query:  KCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNE
        K HADNLGQ GS +  R+LISQVLLGLDEEYNPVVAT+QGK  + W  +Q E
Subjt:  KCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNE

A0A6J1DCW4 uncharacterized protein LOC1110195985.7e-4355.69Show/hide
Query:  VSTDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSK
        V TD  T    ST       S ++PT  +NP YE+W+ +D+L+LGWLYNSM  +VA QVMG+ TS++LW AVQ LFG+ SRAE DYL+Q+FQ T KG+ +
Subjt:  VSTDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSK

Query:  MSEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEKR
        M EYL++MK HADNL   GS +S R L+SQVL GLDEEYNP+V  VQGK ++ W  +  ELL +EKR
Subjt:  MSEYLRVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEKR

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-1448.72Show/hide
Query:  RTFGCACFPNLRPYQKHKFDFHSAKCVYLGPSTNHKGFKCLH-PDGRMIITRHVTFCENEFPFATTFAAPKVEPPQQQ
        R FGCAC+P LRPY +HK D  S +CV+LG S     + CLH    R+ I+RHV F EN FPF+   A   + P Q+Q
Subjt:  RTFGCACFPNLRPYQKHKFDFHSAKCVYLGPSTNHKGFKCLH-PDGRMIITRHVTFCENEFPFATTFAAPKVEPPQQQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.5e-1348.57Show/hide
Query:  FEK-RTFGCACFPNLRPYQKHKFDFHSAKCVYLGPSTNHKGFKCLH-PDGRMIITRHVTFCENEFPFATT
        +EK + FGCAC+P LRPY +HK +  S +C ++G S     + CLH P GR+  +RHV F E  FPF+TT
Subjt:  FEK-RTFGCACFPNLRPYQKHKFDFHSAKCVYLGPSTNHKGFKCLH-PDGRMIITRHVTFCENEFPFATT

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.2e-0627.62Show/hide
Query:  NPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVMKCHADNLGQIGSKISDRALIS
        +PLY+ W   + +V+ WL NSMT ++   VM  ET+  +W+ ++ +F      +   LR+   T R+G   + EY          L ++  ++S+ A I 
Subjt:  NPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVMKCHADNLGQIGSKISDRALIS

Query:  QVLLG
        +   G
Subjt:  QVLLG

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.2e-0525.44Show/hide
Query:  SWVTIDQLVLGWLYNSMT-QEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVMKCHADNLGQIGSKISDRALISQVLL
        +W   D +V   LY ++T ++     +   TS+D+W  ++  F  +  A    L    +T   G+ ++++Y R MK  AD+L  +   ++DR L+  VL 
Subjt:  SWVTIDQLVLGWLYNSMT-QEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVMKCHADNLGQIGSKISDRALISQVLL

Query:  GLDEEYNPVVATVQ
        GL+ +++ ++  ++
Subjt:  GLDEEYNPVVATVQ

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.7e-1027.92Show/hide
Query:  GATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYE-TSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVMKCHA
        G  DG   S+TPT +     + W   D LV  W+Y ++T  +   ++    T++DLW +++ LF  +  A         +TT   +  + EY + +K  +
Subjt:  GATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYE-TSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYLRVMKCHA

Query:  DNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEK
        D L  + S ISDR L+  +L GL E+Y+ ++  ++ K           +LL E+
Subjt:  DNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGCTGCTCAAGTTTCCACAGATGGTACAACAGAAGAAAGCTCCAGTACCGGTGCAACTGATGGTGGCAAGTCTAGTGCAACACCTACTCGAGTGGTGAATCCTCT
TTACGAGTCGTGGGTCACCATTGATCAATTGGTACTTGGGTGGTTATACAACTCAATGACACAAGAGGTGGCTACACAAGTCATGGGGTATGAAACCTCCAAGGATTTAT
GGGATGCTGTCCAAACCTTGTTTGGAATCCACTCAAGAGCAGAAGAAGACTATCTACGTCAACTCTTTCAAACGACGAGAAAAGGTAACTCGAAAATGAGTGAATATTTA
CGTGTTATGAAATGTCATGCAGACAATCTTGGCCAGATTGGAAGCAAAATCTCAGATCGAGCATTGATCTCCCAAGTTCTCTTGGGACTTGATGAGGAATACAATCCCGT
CGTAGCCACGGTGCAAGGAAAACCTGATATGCGTTGGATCAACCTGCAAAATGAACTTCTTCTCTTTGAAAAAAGAACGTTTGGATGTGCCTGCTTCCCCAATTTACGTC
CTTATCAAAAACATAAGTTTGATTTTCATTCTGCTAAATGTGTTTACTTAGGACCAAGCACTAATCATAAAGGTTTCAAATGCCTCCATCCTGATGGGAGGATGATCATC
ACACGACATGTCACCTTCTGTGAAAATGAATTTCCCTTTGCGACTACATTCGCTGCTCCAAAAGTTGAGCCACCACAACAACAGGTTGATGTCACTCGCAACTGCCTCTT
TTGCATTGGTTTCCAGTACCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATGCTGCTCAAGTTTCCACAGATGGTACAACAGAAGAAAGCTCCAGTACCGGTGCAACTGATGGTGGCAAGTCTAGTGCAACACCTACTCGAGTGGTGAATCCTCT
TTACGAGTCGTGGGTCACCATTGATCAATTGGTACTTGGGTGGTTATACAACTCAATGACACAAGAGGTGGCTACACAAGTCATGGGGTATGAAACCTCCAAGGATTTAT
GGGATGCTGTCCAAACCTTGTTTGGAATCCACTCAAGAGCAGAAGAAGACTATCTACGTCAACTCTTTCAAACGACGAGAAAAGGTAACTCGAAAATGAGTGAATATTTA
CGTGTTATGAAATGTCATGCAGACAATCTTGGCCAGATTGGAAGCAAAATCTCAGATCGAGCATTGATCTCCCAAGTTCTCTTGGGACTTGATGAGGAATACAATCCCGT
CGTAGCCACGGTGCAAGGAAAACCTGATATGCGTTGGATCAACCTGCAAAATGAACTTCTTCTCTTTGAAAAAAGAACGTTTGGATGTGCCTGCTTCCCCAATTTACGTC
CTTATCAAAAACATAAGTTTGATTTTCATTCTGCTAAATGTGTTTACTTAGGACCAAGCACTAATCATAAAGGTTTCAAATGCCTCCATCCTGATGGGAGGATGATCATC
ACACGACATGTCACCTTCTGTGAAAATGAATTTCCCTTTGCGACTACATTCGCTGCTCCAAAAGTTGAGCCACCACAACAACAGGTTGATGTCACTCGCAACTGCCTCTT
TTGCATTGGTTTCCAGTACCTTTAA
Protein sequenceShow/hide protein sequence
MYAAQVSTDGTTEESSSTGATDGGKSSATPTRVVNPLYESWVTIDQLVLGWLYNSMTQEVATQVMGYETSKDLWDAVQTLFGIHSRAEEDYLRQLFQTTRKGNSKMSEYL
RVMKCHADNLGQIGSKISDRALISQVLLGLDEEYNPVVATVQGKPDMRWINLQNELLLFEKRTFGCACFPNLRPYQKHKFDFHSAKCVYLGPSTNHKGFKCLHPDGRMII
TRHVTFCENEFPFATTFAAPKVEPPQQQVDVTRNCLFCIGFQYL