; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021568 (gene) of Snake gourd v1 genome

Gene IDTan0021568
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPollen Ole e 1 allergen and extensin family protein, putative
Genome locationLG05:885087..885811
RNA-Seq ExpressionTan0021568
SyntenyTan0021568
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605976.1 hypothetical protein SDJN03_03293, partial [Cucurbita argyrosperma subsp. sororia]3.2e-7181.67Show/hide
Query:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG
        MAL SLVT  LL  + A  E S ST+ ILKGKVSCLDCDAAYDLS IVVM KCEK GKVVTATTAKDGGF  ELPSDECEARLGGGRNQLYA RKDMVA 
Subjt:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG

Query:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        IVR    GSGS DVYGTSTPLAFCSGCRCR  I S+  K CKAAARKFGSSKTF+LPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

XP_022958347.1 uncharacterized protein LOC111459595 [Cucurbita moschata]1.8e-7181.67Show/hide
Query:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG
        MAL SLVT  LL  + A  E S ST+ ILKGKVSCLDCDAAYDLS IVVM KCEK GKVVTATTAKDGGF  ELPSDECEARLGGGRNQLYA RKDMVA 
Subjt:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG

Query:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        IVR    GSGS DVYGTSTPLAFCSGCRCR  I S+  K CKAAARKFGSSKTF+LPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

XP_022995160.1 uncharacterized protein LOC111490782 [Cucurbita maxima]5.4e-7180.56Show/hide
Query:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG
        MAL SLVT  LLA + A  E STST+ ILKGKVSCLDCDA YDLS IVVM KCEK G+VVTATTAKDGGF  ELPSDECEARLGGGRNQLYA RKDMVA 
Subjt:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG

Query:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        IVR    GSGS DVYGTSTPLAFCSGCRCR  I ++  + CKAAARKFGSSKTF+LPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

XP_023533664.1 uncharacterized protein LOC111795458 [Cucurbita pepo subsp. pepo]1.5e-6878.33Show/hide
Query:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG
        MAL SLVT  LL  + A  + S ST+ ILKGKVSCLDCDA YDLS IVVM KCE+ GKVVTATTAKDGGF  ELPSDECEARLGGGRNQLYA RKDMVA 
Subjt:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG

Query:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        IV     GSGS D+YGTSTPLAFCSGCRCR  I S+  + CKAAARKFGSSKTF+LPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

XP_038902068.1 uncharacterized protein LOC120088711 [Benincasa hispida]3.7e-7281.97Show/hide
Query:  MALVSLVT-VFLLAMVV--ARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDM
        MA+ SLVT  FLL +VV  AR E+ST T+H+LKGKV CLDCDAAYDLSGIVVMAKCEKV KVVTATTAKDGGF AELPSD+CEARLGGGRNQLYA RKDM
Subjt:  MALVSLVT-VFLLAMVV--ARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDM

Query:  VAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        VAGIVR   G  GSDDVYG +TPLAFCS CRCR + +SEAEKYCKAA  KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIGIP
Subjt:  VAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

TrEMBL top hitse value%identityAlignment
A0A0A0KGW8 Uncharacterized protein2.1e-6574.21Show/hide
Query:  MALVSLV-TVFLLAMVV------ARFEVSTSTSH-ILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYA
        MA+ SLV   F+L +VV      A  E++ +T H ILKGKV CLDC A+YDLSGIVVMAKCEKVGKVVTATTA DGGF AELPSDECEARL GGRNQLYA
Subjt:  MALVSLV-TVFLLAMVV------ARFEVSTSTSH-ILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYA

Query:  TRKDMVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPM--ISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        +RKD+VAGIV+   G  GSD++YG STPLAFCS CRCR +   S+EAEKYCKA A KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIGIP
Subjt:  TRKDMVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPM--ISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

A0A1S3BLR1 uncharacterized protein LOC1034914222.1e-6574.03Show/hide
Query:  LVSLVTVFLLAMVVARFEVSTSTSH-ILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAGI
        ++ +V V   A   A  E++ +T H ILKGKV CLDC A+YDL+GIVVMAKCEKVGKVVTATTAKDGGF AELPSDECEARL GGRNQLYA  KDMVAGI
Subjt:  LVSLVTVFLLAMVVARFEVSTSTSH-ILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAGI

Query:  VRASSGGSGSDDVYGTSTPLAFCSGCRCRPM--ISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        V+   G  GSD++YG STPLAFCS CRCR +   S+EAEKYCK  A KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIGIP
Subjt:  VRASSGGSGSDDVYGTSTPLAFCSGCRCRPM--ISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

A0A5D3BC61 Bile acid-inducible operon CD2.1e-6574.03Show/hide
Query:  LVSLVTVFLLAMVVARFEVSTSTSH-ILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAGI
        ++ +V V   A   A  E++ +T H ILKGKV CLDC A+YDL+GIVVMAKCEKVGKVVTATTAKDGGF AELPSDECEARL GGRNQLYA  KDMVAGI
Subjt:  LVSLVTVFLLAMVVARFEVSTSTSH-ILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAGI

Query:  VRASSGGSGSDDVYGTSTPLAFCSGCRCRPM--ISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        V+   G  GSD++YG STPLAFCS CRCR +   S+EAEKYCK  A KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIGIP
Subjt:  VRASSGGSGSDDVYGTSTPLAFCSGCRCRPM--ISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

A0A6J1H1L0 uncharacterized protein LOC1114595958.9e-7281.67Show/hide
Query:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG
        MAL SLVT  LL  + A  E S ST+ ILKGKVSCLDCDAAYDLS IVVM KCEK GKVVTATTAKDGGF  ELPSDECEARLGGGRNQLYA RKDMVA 
Subjt:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG

Query:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        IVR    GSGS DVYGTSTPLAFCSGCRCR  I S+  K CKAAARKFGSSKTF+LPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

A0A6J1K3C6 uncharacterized protein LOC1114907822.6e-7180.56Show/hide
Query:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG
        MAL SLVT  LLA + A  E STST+ ILKGKVSCLDCDA YDLS IVVM KCEK G+VVTATTAKDGGF  ELPSDECEARLGGGRNQLYA RKDMVA 
Subjt:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAG

Query:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        IVR    GSGS DVYGTSTPLAFCSGCRCR  I ++  + CKAAARKFGSSKTF+LPLPPEWGLAPSSYYFPFFPIIGIP
Subjt:  IVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27385.1 Pollen Ole e 1 allergen and extensin family protein1.9e-2642.93Show/hide
Query:  MALVS-LVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPS---DECEARLGGGRNQLYATRKD
        MAL S  + VFL +  ++   V +++  +++GKVSC DC   YD SGI V   C       T TT K G F++ELPS     CEA L G   QLYA++ +
Subjt:  MALVS-LVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPS---DECEARLGGGRNQLYATRKD

Query:  MVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        + + IV+   GG    D YG S+ L F               K C  +   F SSKT +LP+PPEWGLAP+SYY PF PIIGIP
Subjt:  MVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

AT2G27385.2 Pollen Ole e 1 allergen and extensin family protein1.9e-2642.93Show/hide
Query:  MALVS-LVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPS---DECEARLGGGRNQLYATRKD
        MAL S  + VFL +  ++   V +++  +++GKVSC DC   YD SGI V   C       T TT K G F++ELPS     CEA L G   QLYA++ +
Subjt:  MALVS-LVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPS---DECEARLGGGRNQLYATRKD

Query:  MVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        + + IV+   GG    D YG S+ L F               K C  +   F SSKT +LP+PPEWGLAP+SYY PF PIIGIP
Subjt:  MVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

AT2G27385.3 Pollen Ole e 1 allergen and extensin family protein1.9e-2642.93Show/hide
Query:  MALVS-LVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPS---DECEARLGGGRNQLYATRKD
        MAL S  + VFL +  ++   V +++  +++GKVSC DC   YD SGI V   C       T TT K G F++ELPS     CEA L G   QLYA++ +
Subjt:  MALVS-LVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPS---DECEARLGGGRNQLYATRKD

Query:  MVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
        + + IV+   GG    D YG S+ L F               K C  +   F SSKT +LP+PPEWGLAP+SYY PF PIIGIP
Subjt:  MVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP

AT5G22430.1 Pollen Ole e 1 allergen and extensin family protein7.9e-2037.1Show/hide
Query:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELP------SDECEARLGGGRNQLYATR
        MA  +    F L  +     +  S S ++ GK+SCLDC   +D SGI V+ KC+   K +TA  A DG F + LP      S  C A+L GG  QLYA +
Subjt:  MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELP------SDECEARLGGGRNQLYATR

Query:  KDMVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP
         ++V+ +V++         V  TS PLAF   C   P  S +            G SKT N P    +G  P+S +FPF PIIGIP
Subjt:  KDMVAGIVRASSGGSGSDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTTGTAAGCTTGGTTACAGTATTTCTGTTGGCAATGGTTGTTGCGAGATTTGAGGTTTCAACCTCAACCAGTCACATTCTGAAAGGGAAGGTGTCTTGCCTTGA
CTGCGATGCCGCCTATGATCTCTCAGGGATCGTGGTGATGGCGAAGTGCGAAAAGGTTGGGAAGGTAGTTACAGCTACTACGGCGAAAGATGGTGGTTTCGTGGCGGAGC
TGCCTTCCGATGAGTGCGAGGCCAGGCTCGGCGGTGGTCGGAACCAGCTCTACGCCACCAGAAAAGACATGGTCGCCGGAATCGTGAGGGCCAGTAGTGGTGGTTCGGGT
TCCGATGACGTGTACGGCACCTCCACTCCGCTGGCGTTTTGCAGTGGGTGCCGGTGCCGGCCGATGATCAGCAGCGAAGCCGAGAAATATTGCAAGGCGGCGGCGAGGAA
ATTTGGGTCTTCAAAGACCTTCAACCTTCCTCTGCCACCTGAGTGGGGGCTGGCGCCATCTAGCTACTATTTTCCTTTCTTCCCTATCATCGGCATCCCTTAG
mRNA sequenceShow/hide mRNA sequence
CTGGGCTTCAAAATTGCAACTAATTTTTTTTCCCCTGTAAAATTATGGCTCTTGTAAGCTTGGTTACAGTATTTCTGTTGGCAATGGTTGTTGCGAGATTTGAGGTTTCA
ACCTCAACCAGTCACATTCTGAAAGGGAAGGTGTCTTGCCTTGACTGCGATGCCGCCTATGATCTCTCAGGGATCGTGGTGATGGCGAAGTGCGAAAAGGTTGGGAAGGT
AGTTACAGCTACTACGGCGAAAGATGGTGGTTTCGTGGCGGAGCTGCCTTCCGATGAGTGCGAGGCCAGGCTCGGCGGTGGTCGGAACCAGCTCTACGCCACCAGAAAAG
ACATGGTCGCCGGAATCGTGAGGGCCAGTAGTGGTGGTTCGGGTTCCGATGACGTGTACGGCACCTCCACTCCGCTGGCGTTTTGCAGTGGGTGCCGGTGCCGGCCGATG
ATCAGCAGCGAAGCCGAGAAATATTGCAAGGCGGCGGCGAGGAAATTTGGGTCTTCAAAGACCTTCAACCTTCCTCTGCCACCTGAGTGGGGGCTGGCGCCATCTAGCTA
CTATTTTCCTTTCTTCCCTATCATCGGCATCCCTTAG
Protein sequenceShow/hide protein sequence
MALVSLVTVFLLAMVVARFEVSTSTSHILKGKVSCLDCDAAYDLSGIVVMAKCEKVGKVVTATTAKDGGFVAELPSDECEARLGGGRNQLYATRKDMVAGIVRASSGGSG
SDDVYGTSTPLAFCSGCRCRPMISSEAEKYCKAAARKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGIP