; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011912 (gene) of Chayote v1 genome

Gene IDSed0011912
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationLG09:10392893..10393784
RNA-Seq ExpressionSed0011912
SyntenySed0011912
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]3.0e-4947.14Show/hide
Query:  DTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILNSVSKEIAST------------
        D QLNP+ +HHS   T+ +VTQ LT A NY SW RAML+A+SG+NK  F+ G I K ++  L  +W CNND+L SWILNSVSKEIA++            
Subjt:  DTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILNSVSKEIAST------------

Query:  ----------------------------MTVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIPTISK
                                    +T+E Y+TKLKTIWQ+LN++R  ++C+CGGLK F+  L+ EY+M FLMGLN+SYA++R QILLM P+P+I+ 
Subjt:  ----------------------------MTVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIPTISK

Query:  VFGLIIQEERQRTACNSSVSSLEPIAL
        VF L+IQEE+QR+A       ++P+AL
Subjt:  VFGLIIQEERQRTACNSSVSSLEPIAL

KAA8537887.1 hypothetical protein F0562_027533 [Nyssa sinensis]4.5e-3741.63Show/hide
Query:  IDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLT--NPKLHSSW-KCNNDVLTSWILNSVSKEIASTM-------
        ID   +P+ +HH  +  +VLV+Q LT   NY +W R+M +ALS KNK  F+ G+I + +    KL  +W +CNN VL SWILNSVSK++A+++       
Subjt:  IDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLT--NPKLHSSW-KCNNDVLTSWILNSVSKEIASTM-------

Query:  ---------------------------------TVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIP
                                         +V  YFT+LK +W +LN+FRP+  C+CG  K  LK+   E VM FLMGLNESY+ +R Q+LLMDP+P
Subjt:  ---------------------------------TVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIP

Query:  TISKVFGLIIQEERQRTACNSSVSSLEPIALMA
         I+KVF LI+QEERQR   + +  ++EP AL++
Subjt:  TISKVFGLIIQEERQRTACNSSVSSLEPIALMA

QHO44758.1 uncharacterized protein DS421_6g173460 [Arachis hypogaea]5.3e-3847.19Show/hide
Query:  NPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVK---LTNPKLHSSWKCNNDVLTSWILNSVSKEIASTMTVEVYFTKLKTI
        N + +H S     +LV+Q L    NY SW R+M +ALSGK K  F+ G++ K     NP L  +W+C ND++T W+LNS+SK+IA++     YFTKLK +
Subjt:  NPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVK---LTNPKLHSSWKCNNDVLTSWILNSVSKEIASTMTVEVYFTKLKTI

Query:  WQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIPTISKVFGLIIQEERQRTACNSSV
        W++LN F+ +  CSCGG+K+   +LD EYVM+FLMGLN++ A++R+QILL DP+P I KVF L++Q+E+Q+   +S +
Subjt:  WQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIPTISKVFGLIIQEERQRTACNSSV

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]2.2e-5251.07Show/hide
Query:  MAEQE------EFSVETVATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILN
        MA+QE        S     TI++QLNP+L+HHS   T++LVTQQL  A+NY SW R+MLIALSGKNK  F+ GTI K  N  L ++WKCNND++TSWI+N
Subjt:  MAEQE------EFSVETVATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILN

Query:  SVSKEIAS----------------------------------------TMTVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLN
        SVSKEIA+                                        T+++E Y+TKLKT+WQ+L D+RP  +C+C GLK   +F   EYVM FLMGLN
Subjt:  SVSKEIAS----------------------------------------TMTVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLN

Query:  ESYASIRTQILLMDPIPTISKVFGLIIQEERQR
        ESYA IR QILLMDPIP ++KVF L+IQEERQR
Subjt:  ESYASIRTQILLMDPIPTISKVFGLIIQEERQR

XP_022883901.1 uncharacterized protein LOC111400747 [Olea europaea var. sylvestris]1.7e-3641.04Show/hide
Query:  AEQEEFSVETV--ATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLT--NPKLHSSWKCNNDVLTSWILNSVS
        ++Q E S  +V  ++ID   +P+ ++HS +   VLV+QQL V  NY S  RAM+IALS KNK  F+ G+I++    +P+L ++W  NN+++ SWILNSVS
Subjt:  AEQEEFSVETV--ATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLT--NPKLHSSWKCNNDVLTSWILNSVS

Query:  KEIASTM----------------------------------------TVEVYFTKLKTIWQDLNDFRPI---DECSCGGLKLFLKFLDFEYVMVFLMGLN
        KEI++++                                        TV VYFTKLKTIW++L+++RP+    +C+C G+K   K+ +  Y M FLMGLN
Subjt:  KEIASTM----------------------------------------TVEVYFTKLKTIWQDLNDFRPI---DECSCGGLKLFLKFLDFEYVMVFLMGLN

Query:  ESYASIRTQILLMDPIPTISKVFGLIIQEERQRTACNSSVSSLEPIALMAF
        +SY+ IR QILLMDP+P I+KVF LI QEE QR       S  +P   +AF
Subjt:  ESYASIRTQILLMDPIPTISKVFGLIIQEERQRTACNSSVSSLEPIALMAF

TrEMBL top hitse value%identityAlignment
A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 81.4e-4947.14Show/hide
Query:  DTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILNSVSKEIAST------------
        D QLNP+ +HHS   T+ +VTQ LT A NY SW RAML+A+SG+NK  F+ G I K ++  L  +W CNND+L SWILNSVSKEIA++            
Subjt:  DTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILNSVSKEIAST------------

Query:  ----------------------------MTVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIPTISK
                                    +T+E Y+TKLKTIWQ+LN++R  ++C+CGGLK F+  L+ EY+M FLMGLN+SYA++R QILLM P+P+I+ 
Subjt:  ----------------------------MTVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIPTISK

Query:  VFGLIIQEERQRTACNSSVSSLEPIAL
        VF L+IQEE+QR+A       ++P+AL
Subjt:  VFGLIIQEERQRTACNSSVSSLEPIAL

A0A5J5B7Z9 Uncharacterized protein2.2e-3741.63Show/hide
Query:  IDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLT--NPKLHSSW-KCNNDVLTSWILNSVSKEIASTM-------
        ID   +P+ +HH  +  +VLV+Q LT   NY +W R+M +ALS KNK  F+ G+I + +    KL  +W +CNN VL SWILNSVSK++A+++       
Subjt:  IDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLT--NPKLHSSW-KCNNDVLTSWILNSVSKEIASTM-------

Query:  ---------------------------------TVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIP
                                         +V  YFT+LK +W +LN+FRP+  C+CG  K  LK+   E VM FLMGLNESY+ +R Q+LLMDP+P
Subjt:  ---------------------------------TVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIP

Query:  TISKVFGLIIQEERQRTACNSSVSSLEPIALMA
         I+KVF LI+QEERQR   + +  ++EP AL++
Subjt:  TISKVFGLIIQEERQRTACNSSVSSLEPIALMA

A0A5J5BKC2 Uncharacterized protein1.1e-3641.6Show/hide
Query:  ATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKL--TNPKLHSSWKCNNDVLTSWILNSVSKEIASTM------
        + I+   NP+ +HHS +   +LV+QQLT   NY +W RAMLIALS KNK  FV G+I++   T   L +SW  NN+++ SWILNSVSKEI++++      
Subjt:  ATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKL--TNPKLHSSWKCNNDVLTSWILNSVSKEIASTM------

Query:  ----------------------------------TVEVYFTKLKTIWQDLNDFR---PIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLM
                                          +V +YFTKLKTIW++L+++R      +CSCGG+K        EY+M FLMGL++S++ +R Q+LLM
Subjt:  ----------------------------------TVEVYFTKLKTIWQDLNDFR---PIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLM

Query:  DPIPTISKVFGLIIQEERQRTACNSSVSSLEPIALMAF
        DP+P I++VF LI+QEE+QR   NSS  S      MAF
Subjt:  DPIPTISKVFGLIIQEERQRTACNSSVSSLEPIALMAF

A0A6J1CXR2 uncharacterized protein LOC1110152391.1e-5251.07Show/hide
Query:  MAEQE------EFSVETVATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILN
        MA+QE        S     TI++QLNP+L+HHS   T++LVTQQL  A+NY SW R+MLIALSGKNK  F+ GTI K  N  L ++WKCNND++TSWI+N
Subjt:  MAEQE------EFSVETVATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILN

Query:  SVSKEIAS----------------------------------------TMTVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLN
        SVSKEIA+                                        T+++E Y+TKLKT+WQ+L D+RP  +C+C GLK   +F   EYVM FLMGLN
Subjt:  SVSKEIAS----------------------------------------TMTVEVYFTKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLN

Query:  ESYASIRTQILLMDPIPTISKVFGLIIQEERQR
        ESYA IR QILLMDPIP ++KVF L+IQEERQR
Subjt:  ESYASIRTQILLMDPIPTISKVFGLIIQEERQR

A0A6J1DIP8 uncharacterized protein LOC1110203993.1e-3640.83Show/hide
Query:  SVETVATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILNSVSKEIAST----
        ++  +  I+   NP+ +HHS  ++ VLV+  LT   NY SW R+MLIAL+ KNK  FV G+IV+ T   LHS   CNN V+ SWILNS+SKEI+++    
Subjt:  SVETVATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILNSVSKEIAST----

Query:  ------------------------------------MTVEVYFTKLKTIWQDLNDFRP---IDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQI
                                            ++V  YFT LKT+W +LN + P      CSCGG+K  + F   E+VM FLMGLNES++ +R Q+
Subjt:  ------------------------------------MTVEVYFTKLKTIWQDLNDFRP---IDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQI

Query:  LLMDPIPTISKVFGLIIQEERQRTACNSSVSSLEPIALMA
        LLM+P PTI++VF L+ QE +QR    S+     P ALMA
Subjt:  LLMDPIPTISKVFGLIIQEERQRTACNSSVSSLEPIALMA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).6.5e-1024.21Show/hide
Query:  NYLSWKRAMLIALSGKNKEEFVKGTIVKLTNP--KLHSSWKCNNDVLTSWILNSVSKEIASTM-------------------------------------
        NY++WK      L    K  F+ GT+ K  +P   L+  W+  N ++  W++NS++ ++  ++                                     
Subjt:  NYLSWKRAMLIALSGKNKEEFVKGTIVKLTNP--KLHSSWKCNNDVLTSWILNSVSKEIASTM-------------------------------------

Query:  ---TVEVYFTKLKTIWQDLNDFRPIDECSCGG-----LKLFLKFLDFEYVMVFLMG--LNESYASIRTQILLMDPIPTISKVFGLIIQEE
           +VE YF KL  +W +L+++ PI EC CGG      K   +  + E    FLMG  LN+ + ++ T+I+   P P++ + F ++   E
Subjt:  ---TVEVYFTKLKTIWQDLNDFRPIDECSCGG-----LKLFLKFLDFEYVMVFLMG--LNESYASIRTQILLMDPIPTISKVFGLIIQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAACAAGAAGAGTTCAGTGTCGAGACAGTAGCAACCATTGACACGCAGTTGAATCCGTTTCTTATGCATCATTCATTCACGTCTACCTCTGTTTTGGTTACGCA
ACAACTGACTGTAGCAGCAAATTACCTCTCATGGAAGAGAGCGATGTTGATAGCCCTATCAGGCAAAAACAAAGAGGAATTTGTGAAAGGAACTATTGTTAAACTCACAA
ATCCCAAGCTTCATTCCTCATGGAAGTGCAATAACGACGTTCTGACCTCTTGGATCTTGAATTCGGTTTCGAAAGAAATTGCTTCGACCATGACTGTCGAGGTATACTTT
ACCAAGTTGAAGACGATTTGGCAAGATCTGAACGATTTTCGTCCCATCGATGAATGCAGCTGTGGTGGTCTCAAACTATTCCTCAAATTTCTTGATTTTGAGTATGTGAT
GGTGTTTCTCATGGGACTAAACGAATCATACGCAAGCATTCGAACCCAAATCCTTCTGATGGATCCAATTCCAACTATCAGTAAAGTATTTGGACTAATAATCCAAGAAG
AAAGACAAAGAACTGCATGCAATTCCTCTGTTTCTTCTTTAGAGCCGATAGCCTTAATGGCTTTCGATTAA
mRNA sequenceShow/hide mRNA sequence
GCTTCTCTTTCTCCTTCTTCGAAACAAAAATTTCTCAGCTTCTCTTTCTCTTTCTCTTTCTCTTTCTCGAATAATGGCTTTACTTTCACATGGTATCAGAGCACTAAGAT
CAATCAGCTTCGTTTTTTCGTCGTTGCCCTAAAATTCTCCCATGGCTGAACAAGAAGAGTTCAGTGTCGAGACAGTAGCAACCATTGACACGCAGTTGAATCCGTTTCTT
ATGCATCATTCATTCACGTCTACCTCTGTTTTGGTTACGCAACAACTGACTGTAGCAGCAAATTACCTCTCATGGAAGAGAGCGATGTTGATAGCCCTATCAGGCAAAAA
CAAAGAGGAATTTGTGAAAGGAACTATTGTTAAACTCACAAATCCCAAGCTTCATTCCTCATGGAAGTGCAATAACGACGTTCTGACCTCTTGGATCTTGAATTCGGTTT
CGAAAGAAATTGCTTCGACCATGACTGTCGAGGTATACTTTACCAAGTTGAAGACGATTTGGCAAGATCTGAACGATTTTCGTCCCATCGATGAATGCAGCTGTGGTGGT
CTCAAACTATTCCTCAAATTTCTTGATTTTGAGTATGTGATGGTGTTTCTCATGGGACTAAACGAATCATACGCAAGCATTCGAACCCAAATCCTTCTGATGGATCCAAT
TCCAACTATCAGTAAAGTATTTGGACTAATAATCCAAGAAGAAAGACAAAGAACTGCATGCAATTCCTCTGTTTCTTCTTTAGAGCCGATAGCCTTAATGGCTTTCGATT
AA
Protein sequenceShow/hide protein sequence
MAEQEEFSVETVATIDTQLNPFLMHHSFTSTSVLVTQQLTVAANYLSWKRAMLIALSGKNKEEFVKGTIVKLTNPKLHSSWKCNNDVLTSWILNSVSKEIASTMTVEVYF
TKLKTIWQDLNDFRPIDECSCGGLKLFLKFLDFEYVMVFLMGLNESYASIRTQILLMDPIPTISKVFGLIIQEERQRTACNSSVSSLEPIALMAFD