; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0026271 (gene) of Chayote v1 genome

Gene IDSed0026271
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG01:42451962..42456291
RNA-Seq ExpressionSed0026271
SyntenySed0026271
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW46690.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.6e-2737.13Show/hide
Query:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN
        DKR ++KS++ KH P LV LQETK               R++ W SLDA G++G +++MW+       + + G+FS++C      +GF++  +G+YGPS 
Subjt:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN

Query:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG
               WEEL  +  +CN PW + GDFNV R+  E S+ R +S  M+ F+ FI+++ L+D  L  G
Subjt:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG

RVX14364.1 hypothetical protein CK203_017180 [Vitis vinifera]7.3e-2824.75Show/hide
Query:  EVFNLIN-SINCLGGKDKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TM
        E + L++ ++  +   DK  ++K ++ KH P LV  QETK               R++ W SLDA GS+G +++MW+       +++ G+FS +C     
Subjt:  EVFNLIN-SINCLGGKDKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TM

Query:  ADGFLFRITGIYGPSNSNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG--CILESYI---NKTEL
         +GF+  + G+Y P         WEEL  +  + N PW +   FNV R+  E S+ R +        EF+ K  +       G     E  I   +K  +
Subjt:  ADGFLFRITGIYGPSNSNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG--CILESYI---NKTEL

Query:  LGINLDDSITMNLSTKIGSWPTSYLDGGLGIYSIKERNNALLSKWIWRYAINTHDKLQRRLPLLHISPQWCSLCQKQAETQEHLLINCSFATSFWNLIRT
        +G+      T+   +KIG+W  S  D    ++  KE         +W                       CSLC++  E+  H+LI+C      W L+ +
Subjt:  LGINLDDSITMNLSTKIGSWPTSYLDGGLGIYSIKERNNALLSKWIWRYAINTHDKLQRRLPLLHISPQWCSLCQKQAETQEHLLINCSFATSFWNLIRT

Query:  TF--YWIFPLPNKPSTLLMNVLTAHPFDRQSFIKTKLILWENFMRAFCWHIWKERNNRIFQDKANAFDRFMDNVTNTVVSWIKIYPAFDIYSFDSLINNW
        +F   W+F      S L+ N+L    +  +   K +  +W        W IW ERN R+F ++  +     +    +++ W + +   D  SF +   +W
Subjt:  TF--YWIFPLPNKPSTLLMNVLTAHPFDRQSFIKTKLILWENFMRAFCWHIWKERNNRIFQDKANAFDRFMDNVTNTVVSWIKIYPAFDIYSFDSLINNW

RVX17759.1 hypothetical protein CK203_004386 [Vitis vinifera]3.1e-2637.13Show/hide
Query:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN
        DKR ++KS++ KH P LV LQETK               R++ W SLDA G++G +++MW+       + + G+FS++C      +GF++  +G+YGPS 
Subjt:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN

Query:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG
               WEEL  +  + N PW + GDFNV R+  E S+ R +S  M+ F+ FIE++ L+D  L  G
Subjt:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]1.4e-2638.51Show/hide
Query:  SINCLGGKDKRAMVKSLIIKHNPTLVILQETKSR--------------DVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSITMADGFLFRIT
        ++  LG   KRA +K  I    P +VIL ETKS                +AW SLDA G+SG II++W+  + S  ++  G FS++    +AD F + +T
Subjt:  SINCLGGKDKRAMVKSLIIKHNPTLVILQETKSR--------------DVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSITMADGFLFRIT

Query:  GIYGPSNSNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG
        G+Y P       LFW+EL  L  +C   W+L  DFN+ RW+HE SS  P    M  FN FI+   LID  ++NG
Subjt:  GIYGPSNSNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.5e-3344.85Show/hide
Query:  KRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSITMADGFLFRITGIYGPSNSN
        K A++K  I + NP +VILQETK              +  + W++LDA G +  I+I+WND      ++ +G FSLT +  ++DGFLF ++GIYGPS + 
Subjt:  KRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSITMADGFLFRITGIYGPSNSN

Query:  PSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG
           LFW+EL  L+D+C   W+L GDFNVTRW+ E+S+ RP++  M  FN FIE  SLID+ L+NG
Subjt:  PSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG

TrEMBL top hitse value%identityAlignment
A0A438EG68 Transposon TX1 uncharacterized 149 kDa protein1.8e-2737.13Show/hide
Query:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN
        DKR ++KS++ KH P LV LQETK               R++ W SLDA G++G +++MW+       + + G+FS++C      +GF++  +G+YGPS 
Subjt:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN

Query:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG
               WEEL  +  +CN PW + GDFNV R+  E S+ R +S  M+ F+ FI+++ L+D  L  G
Subjt:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG

A0A438K974 Endo/exonuclease/phosphatase domain-containing protein1.5e-2637.13Show/hide
Query:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN
        DKR ++KS++ KH P LV LQETK               R++ W SLDA G++G +++MW+       + + G+FS++C      +GF++  +G+YGPS 
Subjt:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN

Query:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG
               WEEL  +  + N PW + GDFNV R+  E S+ R +S  M+ F+ FIE++ L+D  L  G
Subjt:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG

A0A438KG84 Transposon TX1 uncharacterized 149 kDa protein3.3e-2636.53Show/hide
Query:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN
        DKR ++KS++ KH P LV LQETK               R++ W SLDA G++G +++MW+       + + G+FS++C      +GF++  +G+YGPS 
Subjt:  DKRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSI-TMADGFLFRITGIYGPSN

Query:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG
               WEEL  +  + N PW + GDFNV R+  E S+ R +S  M+ F+ FI+++ L+D  L  G
Subjt:  SNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG

A0A6J1CVN2 uncharacterized protein LOC1110146576.7e-2738.51Show/hide
Query:  SINCLGGKDKRAMVKSLIIKHNPTLVILQETKSR--------------DVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSITMADGFLFRIT
        ++  LG   KRA +K  I    P +VIL ETKS                +AW SLDA G+SG II++W+  + S  ++  G FS++    +AD F + +T
Subjt:  SINCLGGKDKRAMVKSLIIKHNPTLVILQETKSR--------------DVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSITMADGFLFRIT

Query:  GIYGPSNSNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG
        G+Y P       LFW+EL  L  +C   W+L  DFN+ RW+HE SS  P    M  FN FI+   LID  ++NG
Subjt:  GIYGPSNSNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG

A0A6J1E2G6 uncharacterized protein LOC1110254057.4e-3444.85Show/hide
Query:  KRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSITMADGFLFRITGIYGPSNSN
        K A++K  I + NP +VILQETK              +  + W++LDA G +  I+I+WND      ++ +G FSLT +  ++DGFLF ++GIYGPS + 
Subjt:  KRAMVKSLIIKHNPTLVILQETK--------------SRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCSITMADGFLFRITGIYGPSNSN

Query:  PSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG
           LFW+EL  L+D+C   W+L GDFNVTRW+ E+S+ RP++  M  FN FIE  SLID+ L+NG
Subjt:  PSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein2.4e-0532.76Show/hide
Query:  HISPQW-CSLCQKQAETQEHLLINCSFATSFWNLIRTTFYWIFPLPN---KPSTLLMNVLTAHPFDRQSFIKTKLILWENFMRAFCWHIWKERNNRIFQD
        HIS  W C+ C    ET  H+L +C FA   WNL       I P+     +   LL   +   P    +     L  W       CWHIWK RN  IFQ 
Subjt:  HISPQW-CSLCQKQAETQEHLLINCSFATSFWNLIRTTFYWIFPLPN---KPSTLLMNVLTAHPFDRQSFIKTKLILWENFMRAFCWHIWKERNNRIFQD

Query:  KANAFDRFMDNVTNTV
          N+    ++ VT  V
Subjt:  KANAFDRFMDNVTNTV

AT3G25270.1 Ribonuclease H-like superfamily protein2.9e-0627.7Show/hide
Query:  LSKWIWRY---AINTHDKLQRRLPLLHISPQWCSLCQKQAETQEHLLINCSFATSFWNL-------IRTTFYWIFPLPNKPSTLLMNVLTAHPFDRQSFI
        +  ++W+    A+ T D L+RR    H  PQ C  C ++ ET +HL  +C +A   W         +RTT      +  K   LL + L     +RQ  +
Subjt:  LSKWIWRY---AINTHDKLQRRLPLLHISPQWCSLCQKQAETQEHLLINCSFATSFWNL-------IRTTFYWIFPLPNKPSTLLMNVLTAHPFDRQSFI

Query:  KTKLILWENFMRAFCWHIWKERNNRIFQDKANAFDRFMDNVTNTVVSW
           L +W        W +WK RN  +FQ K+ ++   +    N V  W
Subjt:  KTKLILWENFMRAFCWHIWKERNNRIFQDKANAFDRFMDNVTNTVVSW

AT4G10613.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-0531.19Show/hide
Query:  INTHDKLQRRLPL----LHISPQWCSLCQKQAETQEHLLINCSFATSFWNLIRTTFYWIFPLPNKPSTLLMNVLTAHPFDRQSFIKTKLILWENFMRAFC
        I+  D+L  R  L    L ISP  C LC +  ET++HL++ C F++S WN+++     + P+  +    L++ +        S I+ KL+      +A  
Subjt:  INTHDKLQRRLPL----LHISPQWCSLCQKQAETQEHLLINCSFATSFWNLIRTTFYWIFPLPNKPSTLLMNVLTAHPFDRQSFIKTKLILWENFMRAFC

Query:  WHIWKERNN
          IWK+RNN
Subjt:  WHIWKERNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCACCCCAGTCACATCACAAAAAACCAAAATCTCACCATTACCATCAACGAGAAACCCGCTTTTTTTATCCCTGGGACCAAATCATCTACTAACCTCCCCTC
TCATCTGTCTTATGATACAGAAAACCATTTATCCTCCCCATATCCTTCCACACCAAACACACCCAAGCCATACCTGACCGAACCAGCACCCCTCCAAATATCATACCCAC
CTTCTTCAAATTTTGCGGATGCCACCAGATCTTTTTATCGTAAGAAACCACTCAAGTTGGACCCCAGTATACATCCATCGGAGCTACTTGGAGTGATGATAACGTGGCTG
AGACCATTGGGCTTGGGCATCTTGCCCCTTCCTACGAAGGTATCTAAACAAATGAAGGAGTCACGCAAGAAAAACAACCAATATCGAGAAGTATTTAATCTGATAAATTC
CATCAACTGTCTTGGTGGCAAGGATAAGAGAGCTATGGTTAAAAGTTTAATTATTAAACACAATCCCACCTTAGTCATTCTTCAAGAAACTAAATCTCGAGATGTTGCCT
GGACTTCCCTTGATGCAGTGGGTTCATCTGGAAGCATCATCATTATGTGGAACGATCATGCATTCTCGGTTACAGACATAAAGAAAGGTACTTTCTCCCTCACCTGCTCT
ATTACCATGGCTGATGGGTTTCTTTTCAGAATAACAGGAATTTATGGTCCATCCAATTCGAATCCCTCTCCTCTTTTCTGGGAAGAACTTGAGAAACTTGCTGATATCTG
TAATGGACCATGGGTCTTAGAAGGTGACTTCAACGTTACTAGATGGACACATGAAAGATCTTCTCATAGACCGGTGTCTCCTGATATGAAAAGCTTCAATGAATTCATTG
AAAAATATTCCCTGATTGATCTTTCTTTATCTAACGGATGTATACTTGAATCATATATTAATAAAACTGAATTATTGGGTATCAACTTGGATGATTCTATTACCATGAAC
CTGTCTACCAAGATTGGCTCATGGCCAACATCTTATCTGGATGGTGGTCTCGGCATTTATAGCATAAAAGAGCGTAACAATGCTCTTCTTTCCAAATGGATTTGGAGATA
TGCTATTAACACACACGATAAGCTGCAAAGAAGACTGCCTCTTCTTCACATCTCCCCTCAATGGTGTTCCCTTTGCCAAAAACAAGCGGAGACTCAAGAGCACCTGCTGA
TCAATTGCTCATTTGCAACTTCCTTTTGGAACTTAATCCGAACAACATTTTATTGGATTTTCCCCTTGCCGAACAAGCCCTCTACTTTGCTGATGAATGTTCTTACGGCT
CATCCTTTTGATAGACAAAGTTTCATAAAGACAAAGTTGATCTTATGGGAAAACTTTATGAGGGCTTTTTGTTGGCATATTTGGAAAGAGAGGAACAATCGTATCTTCCA
GGATAAAGCTAATGCGTTCGATAGATTTATGGACAATGTTACTAACACGGTGGTCTCATGGATCAAAATTTACCCCGCTTTTGACATTTATAGCTTTGACTCTCTAATTA
ACAATTGGAGAGCATATTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCACCCCAGTCACATCACAAAAAACCAAAATCTCACCATTACCATCAACGAGAAACCCGCTTTTTTTATCCCTGGGACCAAATCATCTACTAACCTCCCCTC
TCATCTGTCTTATGATACAGAAAACCATTTATCCTCCCCATATCCTTCCACACCAAACACACCCAAGCCATACCTGACCGAACCAGCACCCCTCCAAATATCATACCCAC
CTTCTTCAAATTTTGCGGATGCCACCAGATCTTTTTATCGTAAGAAACCACTCAAGTTGGACCCCAGTATACATCCATCGGAGCTACTTGGAGTGATGATAACGTGGCTG
AGACCATTGGGCTTGGGCATCTTGCCCCTTCCTACGAAGGTATCTAAACAAATGAAGGAGTCACGCAAGAAAAACAACCAATATCGAGAAGTATTTAATCTGATAAATTC
CATCAACTGTCTTGGTGGCAAGGATAAGAGAGCTATGGTTAAAAGTTTAATTATTAAACACAATCCCACCTTAGTCATTCTTCAAGAAACTAAATCTCGAGATGTTGCCT
GGACTTCCCTTGATGCAGTGGGTTCATCTGGAAGCATCATCATTATGTGGAACGATCATGCATTCTCGGTTACAGACATAAAGAAAGGTACTTTCTCCCTCACCTGCTCT
ATTACCATGGCTGATGGGTTTCTTTTCAGAATAACAGGAATTTATGGTCCATCCAATTCGAATCCCTCTCCTCTTTTCTGGGAAGAACTTGAGAAACTTGCTGATATCTG
TAATGGACCATGGGTCTTAGAAGGTGACTTCAACGTTACTAGATGGACACATGAAAGATCTTCTCATAGACCGGTGTCTCCTGATATGAAAAGCTTCAATGAATTCATTG
AAAAATATTCCCTGATTGATCTTTCTTTATCTAACGGATGTATACTTGAATCATATATTAATAAAACTGAATTATTGGGTATCAACTTGGATGATTCTATTACCATGAAC
CTGTCTACCAAGATTGGCTCATGGCCAACATCTTATCTGGATGGTGGTCTCGGCATTTATAGCATAAAAGAGCGTAACAATGCTCTTCTTTCCAAATGGATTTGGAGATA
TGCTATTAACACACACGATAAGCTGCAAAGAAGACTGCCTCTTCTTCACATCTCCCCTCAATGGTGTTCCCTTTGCCAAAAACAAGCGGAGACTCAAGAGCACCTGCTGA
TCAATTGCTCATTTGCAACTTCCTTTTGGAACTTAATCCGAACAACATTTTATTGGATTTTCCCCTTGCCGAACAAGCCCTCTACTTTGCTGATGAATGTTCTTACGGCT
CATCCTTTTGATAGACAAAGTTTCATAAAGACAAAGTTGATCTTATGGGAAAACTTTATGAGGGCTTTTTGTTGGCATATTTGGAAAGAGAGGAACAATCGTATCTTCCA
GGATAAAGCTAATGCGTTCGATAGATTTATGGACAATGTTACTAACACGGTGGTCTCATGGATCAAAATTTACCCCGCTTTTGACATTTATAGCTTTGACTCTCTAATTA
ACAATTGGAGAGCATATTTCTGA
Protein sequenceShow/hide protein sequence
MASHPSHITKNQNLTITINEKPAFFIPGTKSSTNLPSHLSYDTENHLSSPYPSTPNTPKPYLTEPAPLQISYPPSSNFADATRSFYRKKPLKLDPSIHPSELLGVMITWL
RPLGLGILPLPTKVSKQMKESRKKNNQYREVFNLINSINCLGGKDKRAMVKSLIIKHNPTLVILQETKSRDVAWTSLDAVGSSGSIIIMWNDHAFSVTDIKKGTFSLTCS
ITMADGFLFRITGIYGPSNSNPSPLFWEELEKLADICNGPWVLEGDFNVTRWTHERSSHRPVSPDMKSFNEFIEKYSLIDLSLSNGCILESYINKTELLGINLDDSITMN
LSTKIGSWPTSYLDGGLGIYSIKERNNALLSKWIWRYAINTHDKLQRRLPLLHISPQWCSLCQKQAETQEHLLINCSFATSFWNLIRTTFYWIFPLPNKPSTLLMNVLTA
HPFDRQSFIKTKLILWENFMRAFCWHIWKERNNRIFQDKANAFDRFMDNVTNTVVSWIKIYPAFDIYSFDSLINNWRAYF