; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017438 (gene) of Chayote v1 genome

Gene IDSed0017438
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationLG01:68515839..68517637
RNA-Seq ExpressionSed0017438
SyntenySed0017438
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG99318.1 transposable element gene [Prunus dulcis]6.0e-3144.9Show/hide
Query:  NKVSSNIWHDRLGHPCDVVFKQVVKNLQIPV---FVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWL
        NK     WH RLGHP   VF+QV+ +  +P    F  +  C  C  G  S+  F  S S T  PLEL+HSDVWGP+P +S++G+KYYV  +DDF+++ W+
Subjt:  NKVSSNIWHDRLGHPCDVVFKQVVKNLQIPV---FVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWL

Query:  FPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        +P+ YKSDV      F + +ENLLD K+K  RSD GGE+++ + + F
Subjt:  FPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

KAF8394586.1 hypothetical protein HHK36_020800 [Tetracentron sinense]1.4e-3244.3Show/hide
Query:  AYTGNKVSSNIWHDRLGHPCDVVFKQVVKNLQI-PVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFT
        A+T + VSS++WH RLGHP     +QV  ++ +     H   C  C  G  SR  F  S+S +  PLEL+H+DVWGP+   SING K+YV+ IDDFSR+ 
Subjt:  AYTGNKVSSNIWHDRLGHPCDVVFKQVVKNLQI-PVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFT

Query:  WLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        W+FP+ +KS V    + F S++E + DRK+K  ++DGGGEY++     F
Subjt:  WLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

PKU80502.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]9.3e-3244.3Show/hide
Query:  AYTGNKVSSNIWHDRLGHPCDVVFKQVV-KNLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFT
        A +   +SS +WH RLGHP   V K +   N ++ +  ++  C  CL     +  F  S + TH PLEL+HSDVWGP+P  S  G +YY+ L+DD+SRF 
Subjt:  AYTGNKVSSNIWHDRLGHPCDVVFKQVV-KNLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFT

Query:  WLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        WLFP+  KSDV +T   F++ IE   +RK+K  R+DGGGE+VN   ++F
Subjt:  WLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

PKU87026.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum]2.7e-3148.23Show/hide
Query:  SNIWHDRLGHPCDVVFKQVVK-NLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICYK
        SNIWH RLGHP     + + K N  + + V    C  C A    + +F  S + ++  LELIHSDVWGP+P  S    +YYV  +DDFSRFTWLFP+ +K
Subjt:  SNIWHDRLGHPCDVVFKQVVK-NLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICYK

Query:  SDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        S+V++  I+F + IENL   K+KC R+DGG EYVNH L+ F
Subjt:  SDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

TQE01220.1 hypothetical protein C1H46_013127 [Malus baccata]7.6e-3440.67Show/hide
Query:  AYTGNKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHSKH--CPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRF
        AY G  V S+IWH RLGHP + +   +++N  IPV + S+H  C  C+ G +SR  FP     +    E +H+D+WGP+P  S+ GH+YYV+++D+++R+
Subjt:  AYTGNKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHSKH--CPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRF

Query:  TWLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
         W+FP+C KSDV    ++F + + N     +K  +SDGGGEY +H+ + F
Subjt:  TWLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

TrEMBL top hitse value%identityAlignment
A0A2N9E5R8 Integrase catalytic domain-containing protein6.7e-3650.7Show/hide
Query:  SSNIWHDRLGHPCDVVFKQVV-KNLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICY
        SS +WH RLGHP  +V + V+ K+L +PV  ++  C HCLAG + +  FP S S T  PLE++HSDVWGPAP  S NG +YYV+ +D+F+RFTW FP+ +
Subjt:  SSNIWHDRLGHPCDVVFKQVV-KNLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICY

Query:  KSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        K  V S+ +HF S +ENLL  K+K  R+D GGEY  H  ++F
Subjt:  KSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

A0A2N9G7E3 Integrase catalytic domain-containing protein6.7e-3651.63Show/hide
Query:  AYTGNKVSSNIWHDRLGHPCDVVFKQVVKNLQI-PVFVHSKH--CPHCLAGTISRQIFPTSNSSTHV--PLELIHSDVWGPAPENSINGHKYYVSLIDDF
        AYT  KVSS+ WH RLGHP   + + V K+L   P+   S +  C HC  G +S+   P S+S TH   PL+L+HSDVWGPAP  SING +YYVS IDDF
Subjt:  AYTGNKVSSNIWHDRLGHPCDVVFKQVVKNLQI-PVFVHSKH--CPHCLAGTISRQIFPTSNSSTHV--PLELIHSDVWGPAPENSINGHKYYVSLIDDF

Query:  SRFTWLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        S+FTW FP+ +KS V ST +HF S +ENLL+ K+K  R+D GGEY +   + +
Subjt:  SRFTWLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

A0A2N9GWG5 Uncharacterized protein7.4e-3551.41Show/hide
Query:  SSNIWHDRLGHPCDVVFKQVVKN-LQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICY
        SS +WH+RLGHP   V K V++N L++PV   +  C HCL G + +  FP S S T  PLE++HSDVWGPAP  S N  +YYV+ +DDF+RFTW FP+  
Subjt:  SSNIWHDRLGHPCDVVFKQVVKN-LQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICY

Query:  KSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        KS V S+ +HF S +ENLL  K+K  R+D GGEY  H  ++F
Subjt:  KSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

A0A2N9HBZ5 Uncharacterized protein2.3e-3650.7Show/hide
Query:  SSNIWHDRLGHPCDVVFKQVV-KNLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICY
        SS +WH RLGHP  +V + V+ K+L +PV  ++  C HCLAG + +  FP S S T  PLE++HSDVWGPAP  S+NG +YYV+ +D+F+RFTW FP+ +
Subjt:  SSNIWHDRLGHPCDVVFKQVV-KNLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICY

Query:  KSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        K  V S+ +HF S +ENLL  K+K  R+D GGEY  H  ++F
Subjt:  KSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

A0A2N9HKM9 Uncharacterized protein7.4e-3551.41Show/hide
Query:  SSNIWHDRLGHPCDVVFKQVVKN-LQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICY
        SS +WH+RLGHP   V K V++N L++PV   +  C HCL G + +  FP S S T  PLE++HSDVWGPAP  S N  +YYV+ +DDF+RFTW FP+  
Subjt:  SSNIWHDRLGHPCDVVFKQVVKN-LQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICY

Query:  KSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        KS V S+ +HF S +ENLL  K+K  R+D GGEY  H  ++F
Subjt:  KSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.2e-1529.76Show/hide
Query:  MNLQAYTGNKVSSN---IWHDRLGHPCDVVFKQV-----------VKNLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHV--PLELIHSDVWGPAPEN
        +N QAY+ N    N   +WH+R GH  D    ++           + NL++   +    C  CL G  +R  F      TH+  PL ++HSDV GP    
Subjt:  MNLQAYTGNKVSSN---IWHDRLGHPCDVVFKQV-----------VKNLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHV--PLELIHSDVWGPAPEN

Query:  SINGHKYYVSLIDDFSRFTWLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
        +++   Y+V  +D F+ +   + I YKSDV S    F++  E   + KV     D G EY+++ +  F
Subjt:  SINGHKYYVSLIDDFSRFTWLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-1934.25Show/hide
Query:  NKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHS--KHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLF
        +++S ++WH R+GH  +   + + K   I     +  K C +CL G   R  F TS+      L+L++SDV GP    S+ G+KY+V+ IDD SR  W++
Subjt:  NKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHS--KHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLF

Query:  PICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF
         +  K  V      F +++E    RK+K  RSD GGEY +   E +
Subjt:  PICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETF

Q12491 Transposon Ty2-B Gag-Pol polyprotein7.2e-1124.54Show/hide
Query:  NKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHSK------------HCPHCLAGTISRQIFPTSN----SSTHVPLELIHSDVWGPAPENSINGHKYY
        NK    + H  LGH     F+ + K+L+     + K             CP CL G  ++      +      ++ P + +H+D++GP      +   Y+
Subjt:  NKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHSK------------HCPHCLAGTISRQIFPTSN----SSTHVPLELIHSDVWGPAPENSINGHKYY

Query:  VSLIDDFSRFTWLFPICYKSDVS--STVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETFF
        +S  D+ +RF W++P+  + + S  +     ++ I+N  + +V   + D G EY N TL  FF
Subjt:  VSLIDDFSRFTWLFPICYKSDVS--STVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETFF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-2336.49Show/hide
Query:  MNLQAYTGNKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHSK---HCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLID
        ++L A   +K + + WH RLGHP   +   V+ N  + V   S     C  CL    ++  F  S  ++  PLE I+SDVW  +P  S + ++YYV  +D
Subjt:  MNLQAYTGNKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHSK---HCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLID

Query:  DFSRFTWLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYV
         F+R+TWL+P+  KS V  T I F +++EN    ++  F SD GGE+V
Subjt:  DFSRFTWLFPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-2337.86Show/hide
Query:  NKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHSK---HCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWL
        +K + + WH RLGHP   +   V+ N  +PV   S     C  C      +  F  S  ++  PLE I+SDVW  +P  SI+ ++YYV  +D F+R+TWL
Subjt:  NKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHSK---HCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWL

Query:  FPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYV
        +P+  KS V  T I F S++EN    ++    SD GGE+V
Subjt:  FPICYKSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYV

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein7.2e-0633.33Show/hide
Query:  KVSSNIWHDRLGHPCDVVFKQVVKN--LQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWG
        K  + +WH RL H      + +VK   L        K C  C+ G   R  F T   +T  PL+ +HSD+WG
Subjt:  KVSSNIWHDRLGHPCDVVFKQVVKN--LQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTTCAAGCTTATACTGGGAATAAAGTTTCAAGCAATATTTGGCATGATAGACTAGGCCATCCTTGTGATGTTGTTTTTAAACAAGTTGTTAAAAACCTACAAAT
TCCTGTATTTGTTCATTCTAAACATTGTCCACACTGTTTAGCTGGGACAATATCTAGACAAATTTTTCCTACTTCTAATAGTTCTACTCATGTACCTTTAGAACTCATTC
ATAGTGATGTTTGGGGCCCTGCTCCAGAGAATTCAATAAATGGGCATAAGTACTATGTTTCCTTAATTGATGATTTCTCTAGATTTACATGGTTGTTTCCTATATGCTAC
AAATCTGATGTTTCATCTACTGTTATTCATTTTATCTCCATGATTGAAAATTTATTAGATAGAAAGGTAAAATGTTTTCGTAGTGATGGAGGAGGCGAATATGTGAATCA
TACTTTAGAAACTTTTTTTTGA
mRNA sequenceShow/hide mRNA sequence
TTTTTTTTTTCTCTCTCTGACCTACTTTACCAATTCTTCACCCTTCGTCTACAGTTGCTTTCTTCAACCTAATTTTCCATTTCTTCTCTATAACTAAAATCCATGGCGAC
TATCGATCCAACACCTATTTCTTCCATTTATCTTTTAACGAATATTTGCAATCGATCTTTATATTCAAAGAATTCAATCTATTGTTCATCGTCTTTCTGCTGTTTCTGTT
CAAATTGACCCTGAAGATCTTGCAGTATACACCATCAATGGGCTCCCCTCCGCCTACAATGTCTTTCGAACATCACTGAGAACCAGATCTCAGTCTATTACCTTTGATGA
GTTACAAACTCTGTTAAAAACAAAAGAAGCCGAAATTGAAAGACAGTCCAAGATCGATGAAGCATCAACTGTGTCCACATTAGCCATGATGGCAAATTCTAATGGCTCTA
AACGAGGTTCTTGGAGAGGTTCAAACCGTGGGCGAGGTCGCAACAATGGAGGAAGGGGACGTGGTTTTTATAACCCAAGATTCTTTTATGGGAACGACTACAATCAAGGC
TCATTTATGGGAACAATACTAATCCGGGATCTTTCTCCAGAATGACCAATATGTCTACTCCTCTGCCAATCGACTCAATTACATCAAACACCCCACTTCCATGTCAAATC
TGCAAGAAACTTGGGCATGATGCTTTAGACTGTTATCACATGATGAATTTCTCTTACCAAGGACGCCATCCACCTGCTAAACTCGCTGCAATGGCTCAATACAGAGAAAA
TAACACTAGACCTTCACAGCCTAGACACAACCAAGAAGCTATAGTATGGCTTACTGATAGTGGTTGCAACGCACATCTTACCAATGATGTCTCTAATCTCAACTCCTCCC
TTCCTAATCCTGTTGATGACTCTGTGATCATAGGAAATGGACAAGGTATTCAAATCTCCCATCAAGGTTAAGGACCTAGTGTGAATGGATTGTACCCTATACAAACTAAT
CCTTCATCATCTATGAATCTTCAAGCTTATACTGGGAATAAAGTTTCAAGCAATATTTGGCATGATAGACTAGGCCATCCTTGTGATGTTGTTTTTAAACAAGTTGTTAA
AAACCTACAAATTCCTGTATTTGTTCATTCTAAACATTGTCCACACTGTTTAGCTGGGACAATATCTAGACAAATTTTTCCTACTTCTAATAGTTCTACTCATGTACCTT
TAGAACTCATTCATAGTGATGTTTGGGGCCCTGCTCCAGAGAATTCAATAAATGGGCATAAGTACTATGTTTCCTTAATTGATGATTTCTCTAGATTTACATGGTTGTTT
CCTATATGCTACAAATCTGATGTTTCATCTACTGTTATTCATTTTATCTCCATGATTGAAAATTTATTAGATAGAAAGGTAAAATGTTTTCGTAGTGATGGAGGAGGCGA
ATATGTGAATCATACTTTAGAAACTTTTTTTTGATAACAAAGGAATTTTGCACAAAAAATCTTGTCCTCACATACCACAACAAAATGGCATTGCCGAGAGGAAACACAAA
CATTTAGTGAACACTGCTATCTCCTTAATGTCCCGTTCTTATATTCCTCTTAAATATTGGTCTTTTGCTCTCTCTACTGCAGC
Protein sequenceShow/hide protein sequence
MNLQAYTGNKVSSNIWHDRLGHPCDVVFKQVVKNLQIPVFVHSKHCPHCLAGTISRQIFPTSNSSTHVPLELIHSDVWGPAPENSINGHKYYVSLIDDFSRFTWLFPICY
KSDVSSTVIHFISMIENLLDRKVKCFRSDGGGEYVNHTLETFF