; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027751 (gene) of Chayote v1 genome

Gene IDSed0027751
OrganismSechium edule (Chayote v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG02:43045835..43047332
RNA-Seq ExpressionSed0027751
SyntenySed0027751
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]3.9e-4763.89Show/hide
Query:  MYESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQV
        ++E WV  D L+LGWLYNSMT +VA Q+MG+ N +DLWDA Q  FG+QSRAEED+LRQ+ QTTRK N KM EYLL MK + DNLGQ G+ +P R L+SQV
Subjt:  MYESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQV

Query:  LLGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQR
        LLGLDE YN V+  IQGKPD+ W D+Q++LL+FEK L++QNTQ+
Subjt:  LLGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQR

XP_038887140.1 uncharacterized protein LOC120077331 [Benincasa hispida]5.1e-4758.38Show/hide
Query:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL
        YE+W++VDQL+LGWLYNSMT EV  QVMG   +KDLW +IQ LF +QSR EEDYLR +FQ TRK NLKM +YL +MK +ADNL + G+ +P RTLVSQVL
Subjt:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL

Query:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSF----------SHNTYVHMTNRSTSS
        LGLDEEYN++VATIQG+ DM W D+Q ELL++E+RLE+Q+ Q+ T  F          ++  +V+  N+S SS
Subjt:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSF----------SHNTYVHMTNRSTSS

XP_038904321.1 uncharacterized protein LOC120090675 [Benincasa hispida]1.1e-4665.03Show/hide
Query:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL
        Y +W+ VDQL+LGWLYNSMT ++A QVMG+E ++DLW  IQ LFGIQSRAEEDYLR +FQTTRK NLKM +YL TMK + DNL Q G+ +P R LV QVL
Subjt:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL

Query:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQR
        LGLDEEYN++VATIQG+ DM W D+Q++LL++E+RLE+Q+ ++
Subjt:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQR

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]2.0e-5166.88Show/hide
Query:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL
        YESW+ VDQL+LGWLYNSMT EVA QVMG E +KDLW +I  LFG+QSR EEDYLR +FQTTRK NLKM EYL TMK + DNL Q G+ +P RTLVSQVL
Subjt:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL

Query:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSFSH--NTYVHMTN
        LGLDEEYN++VA IQG+ DM W D+Q+ELL++E+RLE+Q+ Q+TT  F+   N  V+MTN
Subjt:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSFSH--NTYVHMTN

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]2.0e-5166.88Show/hide
Query:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL
        YESW+ VDQL+LGWLYNSMT EVA QVMG E +KDLW +I  LFG+QSR EEDYLR +FQTTRK NLKM EYL TMK + DNL Q G+ +P RTLVSQVL
Subjt:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL

Query:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSFSH--NTYVHMTN
        LGLDEEYN++VA IQG+ DM W D+Q+ELL++E+RLE+Q+ Q+TT  F+   N  V+MTN
Subjt:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSFSH--NTYVHMTN

TrEMBL top hitse value%identityAlignment
A0A0A0LXB7 Uncharacterized protein1.6e-3852.6Show/hide
Query:  SWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVLLG
        SW ++    +  LYNS+T EV  Q++G+ N+KD+W+A    FG++SRAEED+LRQ FQTTRK N  M +YL  MK +ADNLGQ  + IP R L+SQVLLG
Subjt:  SWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVLLG

Query:  LDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSFSHNTYVHM
        LDE YN V+  IQGKP++ W D+Q++LL+FEKRL++QN+Q+   +   N  ++M
Subjt:  LDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSFSHNTYVHM

A0A5A7SIT7 Uncharacterized protein1.9e-4763.89Show/hide
Query:  MYESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQV
        ++E WV  D L+LGWLYNSMT +VA Q+MG+ N +DLWDA Q  FG+QSRAEED+LRQ+ QTTRK N KM EYLL MK + DNLGQ G+ +P R L+SQV
Subjt:  MYESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQV

Query:  LLGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQR
        LLGLDE YN V+  IQGKPD+ W D+Q++LL+FEK L++QNTQ+
Subjt:  LLGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQR

A0A5D3E3L7 Uncharacterized protein3.5e-4162.04Show/hide
Query:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL
        YE WV  D L+LG +YNSM  +VA Q+MG+  +KDLW+AIQ LFGI+SRAEE +LR  FQTTR+ N KM +YL  MK +ADNLGQ G+ +P+R L+SQVL
Subjt:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL

Query:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLE
        LGLDE YN V A IQGKPD+ W D+Q+ELL+FE  +E
Subjt:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLE

A0A6J1D5J0 uncharacterized protein LOC1110175011.8e-4563.82Show/hide
Query:  MYESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQV
        +YESWV  DQL+LGWLYNSMT EVATQVMGYEN+ DLW AIQ LFG+QS+AEEDYLRQ+FQ TRK +LKM ++L  MK HADNLGQ G+ +P R+L+SQV
Subjt:  MYESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQV

Query:  LLGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSFSHN
        LLGLDEEYN VVATIQGK  + W ++Q E        + QN Q +   F++N
Subjt:  LLGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSFSHN

A0A6J1DCW4 uncharacterized protein LOC1110195983.7e-4358.33Show/hide
Query:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL
        YE+W++VD+L+LGWLYNSM ++VA QVMG+  S++LW A+Q LFG+QSRAE DYL+Q+FQ T K +L+M+EYL  MK HADNL   G+ +  R LVSQVL
Subjt:  YESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVL

Query:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRT
         GLDEEYN +V  +QGK ++ W+++  ELL +EKRLEYQN+ ++
Subjt:  LGLDEEYNSVVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGAATCTTGGGTCATAGTTGATCAATTGGTGCTCGGATGGCTCTATAACTCCATGACGTCAGAGGTTGCGACTCAAGTGATGGGGTATGAGAATTCCAAAGATCT
ATGGGATGCCATCCAAACTCTTTTTGGAATTCAATCTCGAGCCGAAGAAGACTACTTACGTCAACTATTCCAAACTACTCGAAAAGTTAATCTAAAAATGGTTGAATATT
TACTTACTATGAAGTGTCATGCAGATAATCTTGGCCAAACTGGAAACCTCATTCCTAATCGAACACTGGTGTCTCAAGTTTTGTTGGGGCTTGACGAGGAATACAATTCA
GTCGTTGCTACGATACAAGGAAAACCAGACATGCGATGGACTGATTTGCAAAATGAACTTTTGGTGTTTGAGAAGAGGCTAGAATATCAGAATACACAACGAACCACCAC
CTCTTTTAGCCACAATACCTATGTCCACATGACGAATAGAAGCACCAGTTCCTTAAACCCACCTAAGCCCCCTTAA
mRNA sequenceShow/hide mRNA sequence
AAAACCTTGAATTTATCTCCAGCCTCAGTGTTCTAGGTTTAACTTGGTATCGACGCTTTAAGGCCAACGCCGCTTCCATCCCGACTTCGGCAATGGTTACTCAAAATTCC
ACAACTTTTTCCACCCATCCACTCAATCAATTGCTTACTCAAATTACCTCAATTAAGCTTGATAGGAGTTTTTTTTTTGTGTGGAAAAACCTTGCCATGCCGATCTTGCG
AAGCCATAAATTAGAAGGTCATTTGTTTGGAACCATGTTGTGTCCTCCGATGTATGGAACCCAGGATGTGCCTATTGGAACAGCTGTTGCAGGTGCAAGCTTCAGTTCAG
GAGCTGTTGATGGTGGTGACTCCAGCACTACACCCCTTTAGGTTGTCAACCCAATGTATGAATCTTGGGTCATAGTTGATCAATTGGTGCTCGGATGGCTCTATAACTCC
ATGACGTCAGAGGTTGCGACTCAAGTGATGGGGTATGAGAATTCCAAAGATCTATGGGATGCCATCCAAACTCTTTTTGGAATTCAATCTCGAGCCGAAGAAGACTACTT
ACGTCAACTATTCCAAACTACTCGAAAAGTTAATCTAAAAATGGTTGAATATTTACTTACTATGAAGTGTCATGCAGATAATCTTGGCCAAACTGGAAACCTCATTCCTA
ATCGAACACTGGTGTCTCAAGTTTTGTTGGGGCTTGACGAGGAATACAATTCAGTCGTTGCTACGATACAAGGAAAACCAGACATGCGATGGACTGATTTGCAAAATGAA
CTTTTGGTGTTTGAGAAGAGGCTAGAATATCAGAATACACAACGAACCACCACCTCTTTTAGCCACAATACCTATGTCCACATGACGAATAGAAGCACCAGTTCCTTAAA
CCCACCTAAGCCCCCTTAACAGAATGCAAATACCAGTTTTGGTGGCAGACAACAATTTGGAATGGTACACGAGGTGGTCATCATGGATATCGTGGACGAGGGCGAGGTCG
TGGCTACAATGGAACTCCACACAACACTCAAAATAGTTCTGCCAATCGGACAATCTGCTAATTTTGTGGTCGTACAGGGCACATAGCTCTTTCCTGCTACAACCTACGCC
ATTTTTACCCGCCCATATGCAAGGGTATGGAAATCCCGCCAATTTTCAAGTGCATGGAAATAATACAAATGGTGTGCAAACTCGTAACATCCAACCCTCCACGACTTTTA
TGGCTACACCTTATGGAAATCAATCTGTTGCTAGTCCTGAGACCGTTGTTGATCCCTCGTGGTATGTTGACAGTGGAGCTTCTAGCCATATCATTTCTGATCTTGCCAGT
CTAACCAATCCAGTTGAATATGGAGGTACAGATTATGTTGTTGTTGGTAATGGATCTAAACTTCCTATTTCGTTTGTTGGTCAAACATGCATAAATAATGGACAATGCAA
TTTAAATCTTGATTATGTGCTATATGTGCCAGAAATTTCAAAAAATGTAGTCAGTGTATCTAAACTTG
Protein sequenceShow/hide protein sequence
MYESWVIVDQLVLGWLYNSMTSEVATQVMGYENSKDLWDAIQTLFGIQSRAEEDYLRQLFQTTRKVNLKMVEYLLTMKCHADNLGQTGNLIPNRTLVSQVLLGLDEEYNS
VVATIQGKPDMRWTDLQNELLVFEKRLEYQNTQRTTTSFSHNTYVHMTNRSTSSLNPPKPP