; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001032 (gene) of Snake gourd v1 genome

Gene IDTan0001032
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationLG01:94052508..94060036
RNA-Seq ExpressionTan0001032
SyntenyTan0001032
Gene Ontology termsNA
InterPro domainsIPR004264 - Transposase, Tnp1/En/Spm-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]9.1e-4655.17Show/hide
Query:  LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH--------------------
        +LKG PCHLAIGS DNVVA+G M+ SD Q  T+HG+PLG +N+RV VD+I+ ED  LPIP++GE+E+L+Q++GNFVAWP                     
Subjt:  LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH--------------------

Query:  --------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW
                 HVTIKLLNRY + +MQ  D IQI LNEH+FG+EK IYL  DDI+ YCGM EIGYS I+ YIA LW
Subjt:  --------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW

XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]1.2e-4547.16Show/hide
Query:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV
        QS    ET  SR++ SR+         G++       + E  E   + +L     + KG PCHLAIGS DNVVAVG M+ SD Q  T+HG+PLG EN+RV
Subjt:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV

Query:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI
         VD+ + ED  LPIP++G++E+L+Q++GNFVAWP                              HVTIKLLNRY M +MQ +D IQI+L+EH+FG+EK I
Subjt:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI

Query:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW
        YL RDDI+ YCGM EIGYS I+ YIA LW
Subjt:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW

XP_016901190.1 PREDICTED: uncharacterized protein LOC103493028 isoform X2 [Cucumis melo]1.2e-4547.16Show/hide
Query:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV
        QS    ET  SR++ SR+         G++       + E  E   + +L     + KG PCHLAIGS DNVVAVG M+ SD Q  T+HG+PLG EN+RV
Subjt:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV

Query:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI
         VD+ + ED  LPIP++G++E+L+Q++GNFVAWP                              HVTIKLLNRY M +MQ +D IQI+L+EH+FG+EK I
Subjt:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI

Query:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW
        YL RDDI+ YCGM EIGYS I+ YIA LW
Subjt:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]6.5e-4452.6Show/hide
Query:  LKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH---------------------
        ++G PCHLA+ S DN+VAVGT++ ++ Q  TVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q++G FVAWP                      
Subjt:  LKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH---------------------

Query:  -------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW
                HV+IKLLNRYVMLSMQ +DT++I L++ +FG+EK IYL R+DI+ YC M+EIGYS I+ YIAYLW
Subjt:  -------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW

XP_031740251.1 uncharacterized protein LOC101213947 [Cucumis sativus]9.1e-4655.17Show/hide
Query:  LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH--------------------
        +LKG PCHLAIGS DNVVA+G M+ SD Q  T+HG+PLG +N+RV VD+I+ ED  LPIP++GE+E+L+Q++GNFVAWP                     
Subjt:  LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH--------------------

Query:  --------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW
                 HVTIKLLNRY + +MQ  D IQI LNEH+FG+EK IYL  DDI+ YCGM EIGYS I+ YIA LW
Subjt:  --------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X15.8e-4647.16Show/hide
Query:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV
        QS    ET  SR++ SR+         G++       + E  E   + +L     + KG PCHLAIGS DNVVAVG M+ SD Q  T+HG+PLG EN+RV
Subjt:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV

Query:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI
         VD+ + ED  LPIP++G++E+L+Q++GNFVAWP                              HVTIKLLNRY M +MQ +D IQI+L+EH+FG+EK I
Subjt:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI

Query:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW
        YL RDDI+ YCGM EIGYS I+ YIA LW
Subjt:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW

A0A1S4DZN2 uncharacterized protein LOC103493028 isoform X25.8e-4647.16Show/hide
Query:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV
        QS    ET  SR++ SR+         G++       + E  E   + +L     + KG PCHLAIGS DNVVAVG M+ SD Q  T+HG+PLG EN+RV
Subjt:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV

Query:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI
         VD+ + ED  LPIP++G++E+L+Q++GNFVAWP                              HVTIKLLNRY M +MQ +D IQI+L+EH+FG+EK I
Subjt:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI

Query:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW
        YL RDDI+ YCGM EIGYS I+ YIA LW
Subjt:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW

A0A5D3CYL9 ULP_PROTEASE domain-containing protein5.8e-4647.16Show/hide
Query:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV
        QS    ET  SR++ SR+         G++       + E  E   + +L     + KG PCHLAIGS DNVVAVG M+ SD Q  T+HG+PLG EN+RV
Subjt:  QSPASRETSTSRTASSRRSH-----LHGRESSPSPPSLSEEWEVSTLYLL-----LLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRV

Query:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI
         VD+ + ED  LPIP++G++E+L+Q++GNFVAWP                              HVTIKLLNRY M +MQ +D IQI+L+EH+FG+EK I
Subjt:  VVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH----------------------------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLI

Query:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW
        YL RDDI+ YCGM EIGYS I+ YIA LW
Subjt:  YLHRDDILHYCGMVEIGYSWIVAYIAYLW

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X43.2e-4452.6Show/hide
Query:  LKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH---------------------
        ++G PCHLA+ S DN+VAVGT++ ++ Q  TVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q++G FVAWP                      
Subjt:  LKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH---------------------

Query:  -------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW
                HV+IKLLNRYVMLSMQ +DT++I L++ +FG+EK IYL R+DI+ YC M+EIGYS I+ YIAYLW
Subjt:  -------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW

A0A6J1C398 uncharacterized protein LOC111007859 isoform X33.2e-4452.6Show/hide
Query:  LKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH---------------------
        ++G PCHLA+ S DN+VAVGT++ ++ Q  TVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q++G FVAWP                      
Subjt:  LKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPH---------------------

Query:  -------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW
                HV+IKLLNRYVMLSMQ +DT++I L++ +FG+EK IYL R+DI+ YC M+EIGYS I+ YIAYLW
Subjt:  -------AHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGATTGCTATTTCAGGGGACGATTGTGTGTGCAAAGCACGACAAACCGAAAAATCGAAATTTTGAGTGAAGCCCATAACTCACTATTTTCTATGCATCCTGGCAA
AGTACCAAGATGTACCGAGATCTCAAAGCGATCGTTTTGTTGGAGTGGTGTTTTAATCAAACCCGGCGAGAGGGAGACCGTAATTTTGTTTAGCTTATATGAATTGTTCC
TTAAGATGAGATTAATTATGAAAATTAAAGAAAAAGGGCAATCTCAAGGGCGTTTTGGGGATTATTGTTTAAGTGAGGGGTGTGTAAAATTTAACCTAACCCTAGCCCTT
AGTCTCTTGGTATTTAACCATAAGAACAAAACCCTAGTCCAATCACATCACCTCCATCTCATTTTACAAACCCTAACCCTAATTTACCAGTCGCCTGCAAGCCGGGAAAC
CTCGACCTCGCGAACAGCCTCCAGCCGCCGTTCACACCTGCACGGCCGAGAGTCCTCTCCTTCACCGCCGTCATTATCGGAAGAATGGGAGGTAAGTACATTATACTTAT
TGCTTTTAAAAGGAACTCCATGCCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTGTAGGCACAATGTACACGTCTGACGCTCAATTTTCCACAGTCCATGGAGTT
CCCTTAGGAGTTGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTGAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATCTATGGG
AAATTTTGTGGCATGGCCTCATGCTCACGTGACTATTAAACTTCTGAATCGTTATGTAATGTTATCGATGCAAGAAGATGATACGATTCAAATCACGTTGAACGAGCACA
TGTTCGGGGAGGAAAAGTTAATTTATTTACATCGCGATGATATCCTGCATTATTGTGGGATGGTGGAGATAGGGTACTCCTGGATAGTCGCATACATTGCGTATCTTTGG
ACTTTTAACACAAAAACTGCATATAAACAAGAAGAAATCGACGAGATTCGAGTACAATGGACGAATTTTGTTGGTAGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGATTGCTATTTCAGGGGACGATTGTGTGTGCAAAGCACGACAAACCGAAAAATCGAAATTTTGAGTGAAGCCCATAACTCACTATTTTCTATGCATCCTGGCAA
AGTACCAAGATGTACCGAGATCTCAAAGCGATCGTTTTGTTGGAGTGGTGTTTTAATCAAACCCGGCGAGAGGGAGACCGTAATTTTGTTTAGCTTATATGAATTGTTCC
TTAAGATGAGATTAATTATGAAAATTAAAGAAAAAGGGCAATCTCAAGGGCGTTTTGGGGATTATTGTTTAAGTGAGGGGTGTGTAAAATTTAACCTAACCCTAGCCCTT
AGTCTCTTGGTATTTAACCATAAGAACAAAACCCTAGTCCAATCACATCACCTCCATCTCATTTTACAAACCCTAACCCTAATTTACCAGTCGCCTGCAAGCCGGGAAAC
CTCGACCTCGCGAACAGCCTCCAGCCGCCGTTCACACCTGCACGGCCGAGAGTCCTCTCCTTCACCGCCGTCATTATCGGAAGAATGGGAGGTAAGTACATTATACTTAT
TGCTTTTAAAAGGAACTCCATGCCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTGTAGGCACAATGTACACGTCTGACGCTCAATTTTCCACAGTCCATGGAGTT
CCCTTAGGAGTTGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTGAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATCTATGGG
AAATTTTGTGGCATGGCCTCATGCTCACGTGACTATTAAACTTCTGAATCGTTATGTAATGTTATCGATGCAAGAAGATGATACGATTCAAATCACGTTGAACGAGCACA
TGTTCGGGGAGGAAAAGTTAATTTATTTACATCGCGATGATATCCTGCATTATTGTGGGATGGTGGAGATAGGGTACTCCTGGATAGTCGCATACATTGCGTATCTTTGG
ACTTTTAACACAAAAACTGCATATAAACAAGAAGAAATCGACGAGATTCGAGTACAATGGACGAATTTTGTTGGTAGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MEDCYFRGRLCVQSTTNRKIEILSEAHNSLFSMHPGKVPRCTEISKRSFCWSGVLIKPGERETVILFSLYELFLKMRLIMKIKEKGQSQGRFGDYCLSEGCVKFNLTLAL
SLLVFNHKNKTLVQSHHLHLILQTLTLIYQSPASRETSTSRTASSRRSHLHGRESSPSPPSLSEEWEVSTLYLLLLKGTPCHLAIGSKDNVVAVGTMYTSDAQFSTVHGV
PLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPHAHVTIKLLNRYVMLSMQEDDTIQITLNEHMFGEEKLIYLHRDDILHYCGMVEIGYSWIVAYIAYLW
TFNTKTAYKQEEIDEIRVQWTNFVGRFV