; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006486 (gene) of Snake gourd v1 genome

Gene IDTan0006486
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationLG06:6566164..6568413
RNA-Seq ExpressionTan0006486
SyntenyTan0006486
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575812.1 Proline-rich protein 3, partial [Cucurbita argyrosperma subsp. sororia]7.0e-7384.02Show/hide
Query:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL
        MASL AV LSLL ++ASA  DDNNGGG YDLMTPKLAKE+RLLS+MIGI+GIILYK GST+ PL+G +ARITCK++DEYGYEAASYTFLSDS+D+NGYFL
Subjt:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL

Query:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        ATLSPSEVEDKR+LKECKAFLELSPLENCQ+PSDLNNGVSGA LHSYK LV+N MKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

XP_022953725.1 proline-rich protein 3-like [Cucurbita moschata]2.7e-7284.02Show/hide
Query:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL
        MASL AV LSLL ++ASA  DDNNGGG YDLMTPKLAKE+RLLS+MIGI+GIILYK GST+ PL+G +ARITCK++DEYGYEAASYTFLSDS+D+NGYFL
Subjt:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL

Query:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        ATLS SEVEDKR+LKECKAFLELSPLENCQ+PSDLNNGVSGALLHSYK LV+N MKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

XP_022991245.1 proline-rich protein 3-like [Cucurbita maxima]2.4e-7384.02Show/hide
Query:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL
        MASL AV LSLL ++ASA  DDNNGGG YDLMTP LAKE+RLLS+MIGI+GIILYKFGST++PL+G +ARITCK++DEYGYEAASYTFLSDS+D+NGYFL
Subjt:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL

Query:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        ATLSPSEV+DKR+LKECKAFLELSPLENCQ+PSDLNNGVSGALLHSYK LV+N MKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

XP_023548997.1 proline-rich protein 3-like [Cucurbita pepo subsp. pepo]2.3e-7182.25Show/hide
Query:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL
        MASL AV LSLL ++ASA  DD+NGGG YDLMTPKLA E+RLLS+MIGI+GIILYK GST+ PL+G +ARITCK++DEYGYEAASYTFLSDS+D+NGYFL
Subjt:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL

Query:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        ATLSPSEVEDKR+LKECKAFLE+SPLENCQ+PSDLNNGVSGALLHSY+ LV+N MKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

XP_038896334.1 proline-rich protein 3-like [Benincasa hispida]1.3e-7182.84Show/hide
Query:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL
        MAS++A+ LSL  +IASA  DDN G  NY LMTPK+ KEERLLS+MIGIEGIILYKFGST+APL G +ARITC+++DEYGYEAASYTFLSDS+D+NGYFL
Subjt:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL

Query:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        ATLSPSEVEDKR+LKECKAFLE+SPLENCQSPSDLNNGVSGALLHSYK LV+NNMKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

TrEMBL top hitse value%identityAlignment
A0A0A0K5H5 Uncharacterized protein4.3e-6879.07Show/hide
Query:  MASLNAVLLSLLFMIASADDNNG----GGNYDLMTPKLAK-EERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNG
        M SL+AV  SLL +       NG    GG+YD MTPKLAK +ERLLS+MIGIEGIILYKFGS+++PLQG +ARITCK++DEYGYEAASYTFLS+S+D NG
Subjt:  MASLNAVLLSLLFMIASADDNNG----GGNYDLMTPKLAK-EERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNG

Query:  YFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        YFLATLSPSEVEDKR+LKECKAFLE+SPLENCQSPSDLNNGVSGALLHSYKFLV+NNMKLFSVGPFLFTCQ+
Subjt:  YFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

A0A1S3BRS8 proline-rich protein 3-like1.1e-6679.07Show/hide
Query:  MASLNAVLLSLL---FMIASAD-DNNGGGNYDLMTPKLAK-EERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNG
        M SL+AV LSLL    ++ SA+ D   G +YD MT KLAK +ERLLS+MIGIEGIILYKFGS+++PLQG +ARITCK++DEYGYEAASYTFLS+S+D NG
Subjt:  MASLNAVLLSLL---FMIASAD-DNNGGGNYDLMTPKLAK-EERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNG

Query:  YFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        YFLATLSPSEVEDKR+LKECKAFLE+SPLENCQSPSDLNNGVSGALLHSYKFLV+NNMKLFSVGPFLFTCQ+
Subjt:  YFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

A0A5A7VL80 Proline-rich protein 3-like6.2e-6778.49Show/hide
Query:  MASLNAVLLSLLFMIASADDNNG----GGNYDLMTPKLAK-EERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNG
        M SL+AV LSLL +       NG    G +YD MT KLAK +ERLLS+MIGIEGIILYKFGS+++PLQG +ARITCK++DEYGYEAASYTFLS+S+D NG
Subjt:  MASLNAVLLSLLFMIASADDNNG----GGNYDLMTPKLAK-EERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNG

Query:  YFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        YFLATLSPSEVEDKR+LKECKAFLE+SPLENCQSPSDLNNGVSGALLHSYKFLV+NNMKLFSVGPFLFTCQ+
Subjt:  YFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

A0A6J1GQG7 proline-rich protein 3-like1.3e-7284.02Show/hide
Query:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL
        MASL AV LSLL ++ASA  DDNNGGG YDLMTPKLAKE+RLLS+MIGI+GIILYK GST+ PL+G +ARITCK++DEYGYEAASYTFLSDS+D+NGYFL
Subjt:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL

Query:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        ATLS SEVEDKR+LKECKAFLELSPLENCQ+PSDLNNGVSGALLHSYK LV+N MKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

A0A6J1JSC5 proline-rich protein 3-like1.2e-7384.02Show/hide
Query:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL
        MASL AV LSLL ++ASA  DDNNGGG YDLMTP LAKE+RLLS+MIGI+GIILYKFGST++PL+G +ARITCK++DEYGYEAASYTFLSDS+D+NGYFL
Subjt:  MASLNAVLLSLLFMIASA--DDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFL

Query:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS
        ATLSPSEV+DKR+LKECKAFLELSPLENCQ+PSDLNNGVSGALLHSYK LV+N MKLFSVGPFLFTCQS
Subjt:  ATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQS

SwissProt top hitse value%identityAlignment
O81417 Protein SEED AND ROOT HAIR PROTECTIVE PROTEIN8.2e-2443.44Show/hide
Query:  IGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHS
        I +EGII  K G    P+QGA ARI C  +D YG E    + LS  TD+ GYF+AT+ PS++   R + +CK +L  SPL +C  P+D+N GV G  L +
Subjt:  IGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHS

Query:  YKFLVNNNMKLFSVGPFLFTCQ
        Y+ L + + KL+  GPF +T +
Subjt:  YKFLVNNNMKLFSVGPFLFTCQ

Q9FZ35 Proline-rich protein 14.5e-1437.19Show/hide
Query:  IEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYK
        + GIIL K G    P+QGA A+I C     Y          SD TD  GYF   L+       + L  C+  L  SP+E C++P+++N G++G     Y 
Subjt:  IEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYK

Query:  FLVNNNMKLFSVGPFLFTCQS
           + N+KLF+VGPF FT  S
Subjt:  FLVNNNMKLFSVGPFLFTCQS

Q9LZJ7 Proline-rich protein 31.8e-1539.17Show/hide
Query:  IEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGA--LLHS
        ++GIIL K G    P+ GA  +I C     YG         S+ TDS GYF   LS + ++D   L  C+  L LSP+E C++P+++N G++G    L+ 
Subjt:  IEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGA--LLHS

Query:  YKFLVNNNMKLFSVGPFLFT
        Y+F  + N++LFSVGPF +T
Subjt:  YKFLVNNNMKLFSVGPFLFT

Arabidopsis top hitse value%identityAlignment
AT1G54970.1 proline-rich protein 13.2e-1537.19Show/hide
Query:  IEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYK
        + GIIL K G    P+QGA A+I C     Y          SD TD  GYF   L+       + L  C+  L  SP+E C++P+++N G++G     Y 
Subjt:  IEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYK

Query:  FLVNNNMKLFSVGPFLFTCQS
           + N+KLF+VGPF FT  S
Subjt:  FLVNNNMKLFSVGPFLFTCQS

AT2G47530.1 Pollen Ole e 1 allergen and extensin family protein6.5e-1632.57Show/hide
Query:  ASLNAVLLSLLFMIASADDNNGGGNY----------DLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDST
        A+ N +LL+++ ++A+AD       Y           + TP L K     +  I IEG IL K G    P+QG   ++ C  +D YG   A  T  S  T
Subjt:  ASLNAVLLSLLFMIASADDNNGGGNY----------DLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDST

Query:  DSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALL--HSYKFLVNNNMKLFSVGPFLFT
        D  GYF   ++         +  CK  LE SP+  C++P+++N GV+GA L   + KFL ++N+ L+++ PF F+
Subjt:  DSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALL--HSYKFLVNNNMKLFSVGPFLFT

AT2G47540.1 Pollen Ole e 1 allergen and extensin family protein1.6e-3857.46Show/hide
Query:  EERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKR---KLKECKAFLELSPLENCQSPSDL
        E  LLSSMIG++G+I  K GS + P+QGAVAR+TC+  DEYGYEA   T LS +TD+ GYFLATLS SEV+D +   K+KEC+AFLELSP + C  P+++
Subjt:  EERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKR---KLKECKAFLELSPLENCQSPSDL

Query:  NNGVSGALLHSYKFLVNN-NMKLFSVGPFLFTCQ
        N G+SGA+L +Y+ L N   MKLF+VGPF+F+ +
Subjt:  NNGVSGALLHSYKFLVNN-NMKLFSVGPFLFTCQ

AT3G62680.1 proline-rich protein 31.3e-1639.17Show/hide
Query:  IEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGA--LLHS
        ++GIIL K G    P+ GA  +I C     YG         S+ TDS GYF   LS + ++D   L  C+  L LSP+E C++P+++N G++G    L+ 
Subjt:  IEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGA--LLHS

Query:  YKFLVNNNMKLFSVGPFLFT
        Y+F  + N++LFSVGPF +T
Subjt:  YKFLVNNNMKLFSVGPFLFT

AT4G02270.1 root hair specific 135.8e-2543.44Show/hide
Query:  IGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHS
        I +EGII  K G    P+QGA ARI C  +D YG E    + LS  TD+ GYF+AT+ PS++   R + +CK +L  SPL +C  P+D+N GV G  L +
Subjt:  IGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKRKLKECKAFLELSPLENCQSPSDLNNGVSGALLHS

Query:  YKFLVNNNMKLFSVGPFLFTCQ
        Y+ L + + KL+  GPF +T +
Subjt:  YKFLVNNNMKLFSVGPFLFTCQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTAAATGCAGTTTTATTGTCACTTTTGTTCATGATTGCTTCAGCTGATGATAACAATGGTGGTGGGAATTATGATCTTATGACACCCAAATTGGCAAAGGA
AGAAAGGCTTCTCTCTTCCATGATTGGTATTGAAGGAATTATTCTTTACAAATTTGGCTCAACAATGGCCCCTCTTCAAGGAGCTGTGGCAAGAATAACATGTAAATCAA
TGGATGAGTATGGTTATGAGGCAGCTTCTTACACTTTTTTAAGTGATTCAACTGATTCAAATGGCTACTTTTTGGCAACACTATCTCCATCAGAGGTAGAAGACAAGAGG
AAGTTGAAGGAATGCAAGGCTTTTTTAGAGCTCTCACCATTAGAGAACTGTCAATCTCCTTCTGACCTCAACAATGGAGTCTCTGGTGCTCTTCTCCATTCTTACAAATT
TTTGGTCAACAACAACATGAAACTCTTCTCTGTTGGGCCTTTCCTTTTCACTTGCCAAAGCTTGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTAAATGCAGTTTTATTGTCACTTTTGTTCATGATTGCTTCAGCTGATGATAACAATGGTGGTGGGAATTATGATCTTATGACACCCAAATTGGCAAAGGA
AGAAAGGCTTCTCTCTTCCATGATTGGTATTGAAGGAATTATTCTTTACAAATTTGGCTCAACAATGGCCCCTCTTCAAGGAGCTGTGGCAAGAATAACATGTAAATCAA
TGGATGAGTATGGTTATGAGGCAGCTTCTTACACTTTTTTAAGTGATTCAACTGATTCAAATGGCTACTTTTTGGCAACACTATCTCCATCAGAGGTAGAAGACAAGAGG
AAGTTGAAGGAATGCAAGGCTTTTTTAGAGCTCTCACCATTAGAGAACTGTCAATCTCCTTCTGACCTCAACAATGGAGTCTCTGGTGCTCTTCTCCATTCTTACAAATT
TTTGGTCAACAACAACATGAAACTCTTCTCTGTTGGGCCTTTCCTTTTCACTTGCCAAAGCTTGGATTAA
Protein sequenceShow/hide protein sequence
MASLNAVLLSLLFMIASADDNNGGGNYDLMTPKLAKEERLLSSMIGIEGIILYKFGSTMAPLQGAVARITCKSMDEYGYEAASYTFLSDSTDSNGYFLATLSPSEVEDKR
KLKECKAFLELSPLENCQSPSDLNNGVSGALLHSYKFLVNNNMKLFSVGPFLFTCQSLD