; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001982 (gene) of Snake gourd v1 genome

Gene IDTan0001982
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationLG05:52557710..52559714
RNA-Seq ExpressionTan0001982
SyntenyTan0001982
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573632.1 hypothetical protein SDJN03_27519, partial [Cucurbita argyrosperma subsp. sororia]8.6e-6675.14Show/hide
Query:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSN---NDSQAARPKCIAKLVGGPHQ
        MASL KLFII  LL +VLA TQLAQCNTLKANISCLDCQSNYDFSGNMI V C  VKNL+++IT+ +GSF+T LPSN   +DS+ A   CIAKL+GGPHQ
Subjt:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSN---NDSQAARPKCIAKLVGGPHQ

Query:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
        LYASRK M S +IK TNSNFFT+A AL FSTCK N KCLS+K  + DSKTIDLPLPPEWG PPTSYYFPVLPIIGIP
Subjt:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

XP_008461260.1 PREDICTED: uncharacterized protein LOC103499896 [Cucumis melo]8.9e-6375.82Show/hide
Query:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNN---DSQAA--RPKCIAKLVGGP
        MASL KLFI P+ L MVLA+TQLAQCNTLKA ISCLDCQSNYDFSGN+I VKC++VKNLT++ITK DGSFET LPS+    DS+AA   PKCIAKLVGG 
Subjt:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNN---DSQAA--RPKCIAKLVGGP

Query:  HQLYASRKDMVSPIIKATNSNFFTIATALKFSTCKQ-NRKCLSM-KELIVDSKTIDLP-LPPEWGLPPTSYYFPVLPIIGIP
        HQL+ASRK++VS IIK TNS FFTIATAL+FSTCK+ NRKC ++ KE I DSKT DLP LPPEWG PPTSYY PVLPIIGIP
Subjt:  HQLYASRKDMVSPIIKATNSNFFTIATALKFSTCKQ-NRKCLSM-KELIVDSKTIDLP-LPPEWGLPPTSYYFPVLPIIGIP

XP_022945542.1 uncharacterized protein LOC111449745 [Cucurbita moschata]4.3e-6574.01Show/hide
Query:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSN---NDSQAARPKCIAKLVGGPHQ
        MASL KLFII  LL +VLA TQLAQCNTLKA+ISCLDCQSNYDFSGNM+ V C  VKNL+++IT+ +GSF+T LPSN   +DS+ A   CIAKL+GGPHQ
Subjt:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSN---NDSQAARPKCIAKLVGGPHQ

Query:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
        LYASRK M S +IK TNSNFFT+A AL FSTCK N KCLS+K  + DSKTIDLPLPPEWG PPTSYYFPVLPIIGIP
Subjt:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

XP_022967108.1 uncharacterized protein LOC111466611 [Cucurbita maxima]1.3e-6676.27Show/hide
Query:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPS---NNDSQAARPKCIAKLVGGPHQ
        MASL KLFII  LLLM+LA TQLAQCNTLKANISCLDCQSNYDFSGNMI V C  VKNL+V+IT+ +GSF+T LPS   ++DS+AA   CIAKL+GGPHQ
Subjt:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPS---NNDSQAARPKCIAKLVGGPHQ

Query:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
        LYASRK M S +IK TNSNFFT+A AL FSTCK N KCLS+K  + DSKTIDLPLPPEWG PPTSYYFPVLPIIGIP
Subjt:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

XP_023542373.1 uncharacterized protein LOC111802294 [Cucurbita pepo subsp. pepo]3.0e-6676.84Show/hide
Query:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPS---NNDSQAARPKCIAKLVGGPHQ
        MASL KLFII  LLLMVLA TQLAQCNTLKANISCLDCQSNYDFSGNMI V C  VKNL+V+IT+ +GSF+T LPS   ++DS+AA   CIAKL GGPHQ
Subjt:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPS---NNDSQAARPKCIAKLVGGPHQ

Query:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
        LYASRK M S +IK TNSNFFT+A AL FSTCK N KCLS+K  + DSKTIDLPLPPEWG PPTSYYFPVLPIIGIP
Subjt:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

TrEMBL top hitse value%identityAlignment
A0A0A0LYA1 Uncharacterized protein6.9e-6173.18Show/hide
Query:  MASLIKLFIIPLLLLMVL-ATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAA--RPKCIAKLVGGPHQ
        MASL KLFI P  + +VL A+TQ AQCNTLKA ISCLDCQSNYDFSGN+I VKC++ KNLT++ITK DGSFET LPSN  S+AA   PKCIAKL+GG HQ
Subjt:  MASLIKLFIIPLLLLMVL-ATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAA--RPKCIAKLVGGPHQ

Query:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQ-NRKCLSM-KELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
        L+ASRK+MVS IIK TNS FFTIATALKFSTCK+ +R C ++ KE + DSKT D PLPPEWG PPTSYY PVLPIIGIP
Subjt:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQ-NRKCLSM-KELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

A0A1S3CDU1 uncharacterized protein LOC1034998964.3e-6375.82Show/hide
Query:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNN---DSQAA--RPKCIAKLVGGP
        MASL KLFI P+ L MVLA+TQLAQCNTLKA ISCLDCQSNYDFSGN+I VKC++VKNLT++ITK DGSFET LPS+    DS+AA   PKCIAKLVGG 
Subjt:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNN---DSQAA--RPKCIAKLVGGP

Query:  HQLYASRKDMVSPIIKATNSNFFTIATALKFSTCKQ-NRKCLSM-KELIVDSKTIDLP-LPPEWGLPPTSYYFPVLPIIGIP
        HQL+ASRK++VS IIK TNS FFTIATAL+FSTCK+ NRKC ++ KE I DSKT DLP LPPEWG PPTSYY PVLPIIGIP
Subjt:  HQLYASRKDMVSPIIKATNSNFFTIATALKFSTCKQ-NRKCLSM-KELIVDSKTIDLP-LPPEWGLPPTSYYFPVLPIIGIP

A0A6J1CMJ0 uncharacterized protein LOC1110126392.8e-6273.86Show/hide
Query:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSN-NDSQAARPKCIAKLVGGPHQLY
        MASL  L  +P LLLMVLATTQ AQC TL+A ISCLDC+SNYDFSGN I VKC+KVKNL V+IT+ DGSFET LPS+ ++SQ+    CIAKLVGGPHQLY
Subjt:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSN-NDSQAARPKCIAKLVGGPHQLY

Query:  ASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMK-ELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
        ASRKDM S +IKATNS FFTIATALKFSTCKQ+ KC +MK + I DSKT+DLPLP EWGL P+SYY P LPIIGIP
Subjt:  ASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMK-ELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

A0A6J1G179 uncharacterized protein LOC1114497452.1e-6574.01Show/hide
Query:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSN---NDSQAARPKCIAKLVGGPHQ
        MASL KLFII  LL +VLA TQLAQCNTLKA+ISCLDCQSNYDFSGNM+ V C  VKNL+++IT+ +GSF+T LPSN   +DS+ A   CIAKL+GGPHQ
Subjt:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSN---NDSQAARPKCIAKLVGGPHQ

Query:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
        LYASRK M S +IK TNSNFFT+A AL FSTCK N KCLS+K  + DSKTIDLPLPPEWG PPTSYYFPVLPIIGIP
Subjt:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

A0A6J1HU61 uncharacterized protein LOC1114666116.4e-6776.27Show/hide
Query:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPS---NNDSQAARPKCIAKLVGGPHQ
        MASL KLFII  LLLM+LA TQLAQCNTLKANISCLDCQSNYDFSGNMI V C  VKNL+V+IT+ +GSF+T LPS   ++DS+AA   CIAKL+GGPHQ
Subjt:  MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPS---NNDSQAARPKCIAKLVGGPHQ

Query:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
        LYASRK M S +IK TNSNFFT+A AL FSTCK N KCLS+K  + DSKTIDLPLPPEWG PPTSYYFPVLPIIGIP
Subjt:  LYASRKDMVSPIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27385.1 Pollen Ole e 1 allergen and extensin family protein6.2e-2235.33Show/hide
Query:  FIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAARPKCIAKLVGGPHQLYASRKDMVS
        F+   L    L++  +   + ++  +SC DC ++YD+SG  + V C          T   G F + LPS  +S      C A+L G   QLYAS+ ++ S
Subjt:  FIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAARPKCIAKLVGGPHQLYASRKDMVS

Query:  PIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
         I+K     +   +      +C ++    S       SKT+DLP+PPEWGL PTSYY P LPIIGIP
Subjt:  PIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

AT2G27385.2 Pollen Ole e 1 allergen and extensin family protein6.2e-2235.33Show/hide
Query:  FIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAARPKCIAKLVGGPHQLYASRKDMVS
        F+   L    L++  +   + ++  +SC DC ++YD+SG  + V C          T   G F + LPS  +S      C A+L G   QLYAS+ ++ S
Subjt:  FIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAARPKCIAKLVGGPHQLYASRKDMVS

Query:  PIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
         I+K     +   +      +C ++    S       SKT+DLP+PPEWGL PTSYY P LPIIGIP
Subjt:  PIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

AT2G27385.3 Pollen Ole e 1 allergen and extensin family protein6.2e-2235.33Show/hide
Query:  FIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAARPKCIAKLVGGPHQLYASRKDMVS
        F+   L    L++  +   + ++  +SC DC ++YD+SG  + V C          T   G F + LPS  +S      C A+L G   QLYAS+ ++ S
Subjt:  FIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAARPKCIAKLVGGPHQLYASRKDMVS

Query:  PIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
         I+K     +   +      +C ++    S       SKT+DLP+PPEWGL PTSYY P LPIIGIP
Subjt:  PIIKATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP

AT5G22430.1 Pollen Ole e 1 allergen and extensin family protein3.3e-2338.41Show/hide
Query:  LLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAARPKCIAKLVGGPHQLYASRKDMVSPIIK
        L  L V +  +L+  + +   ISCLDC  ++DFSG  + +KCD  K    ++   DGSF + LP+ +   +    C+AKL+GGP QLYA + ++VS ++K
Subjt:  LLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAARPKCIAKLVGGPHQLYASRKDMVSPIIK

Query:  AT-NSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP
        +  +S   T +  L FS          +  +I DSKTI+ P    +G PP S +FP LPIIGIP
Subjt:  AT-NSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTAATTAAGCTCTTCATTATTCCTTTGCTCTTGCTCATGGTTCTTGCAACAACTCAACTTGCACAATGCAACACTTTGAAGGCCAACATCTCTTGCCTTGA
CTGTCAATCCAACTATGACTTCTCAGGAAATATGATCGCAGTAAAGTGCGACAAAGTGAAAAACCTAACCGTATCAATTACCAAAGTGGATGGATCATTTGAAACTAGGC
TTCCTTCCAACAACGACTCCCAAGCAGCTCGTCCCAAGTGCATAGCCAAGCTTGTAGGGGGACCTCATCAGCTCTACGCTTCAAGGAAAGACATGGTTTCCCCTATCATC
AAGGCAACAAACTCAAACTTCTTCACCATTGCCACTGCTCTCAAGTTCTCCACATGCAAACAAAATAGAAAGTGCTTATCCATGAAAGAGTTAATTGTAGATTCAAAGAC
CATTGATTTGCCTCTGCCACCTGAGTGGGGCCTGCCACCCACAAGCTACTATTTTCCTGTGCTTCCCATCATAGGCATCCCTTGA
mRNA sequenceShow/hide mRNA sequence
CTTGACTTTGAGCACTAATCTATGGCTTCTCTAATTAAGCTCTTCATTATTCCTTTGCTCTTGCTCATGGTTCTTGCAACAACTCAACTTGCACAATGCAACACTTTGAA
GGCCAACATCTCTTGCCTTGACTGTCAATCCAACTATGACTTCTCAGGAAATATGATCGCAGTAAAGTGCGACAAAGTGAAAAACCTAACCGTATCAATTACCAAAGTGG
ATGGATCATTTGAAACTAGGCTTCCTTCCAACAACGACTCCCAAGCAGCTCGTCCCAAGTGCATAGCCAAGCTTGTAGGGGGACCTCATCAGCTCTACGCTTCAAGGAAA
GACATGGTTTCCCCTATCATCAAGGCAACAAACTCAAACTTCTTCACCATTGCCACTGCTCTCAAGTTCTCCACATGCAAACAAAATAGAAAGTGCTTATCCATGAAAGA
GTTAATTGTAGATTCAAAGACCATTGATTTGCCTCTGCCACCTGAGTGGGGCCTGCCACCCACAAGCTACTATTTTCCTGTGCTTCCCATCATAGGCATCCCTTGA
Protein sequenceShow/hide protein sequence
MASLIKLFIIPLLLLMVLATTQLAQCNTLKANISCLDCQSNYDFSGNMIAVKCDKVKNLTVSITKVDGSFETRLPSNNDSQAARPKCIAKLVGGPHQLYASRKDMVSPII
KATNSNFFTIATALKFSTCKQNRKCLSMKELIVDSKTIDLPLPPEWGLPPTSYYFPVLPIIGIP