; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021925 (gene) of Snake gourd v1 genome

Gene IDTan0021925
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEpidermal patterning factor-like protein
Genome locationLG10:6584297..6585271
RNA-Seq ExpressionTan0021925
SyntenyTan0021925
Gene Ontology termsGO:0010052 - guard cell differentiation (biological process)
GO:0005576 - extracellular region (cellular component)
InterPro domainsIPR039455 - EPIDERMAL PATTERNING FACTOR-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597420.1 EPIDERMAL PATTERNING FACTOR-like protein 4, partial [Cucurbita argyrosperma subsp. sororia]1.5e-3065.38Show/hide
Query:  GSHHHFYISSNHYHFSLLLTLL--FLALLFHSVSSTPGSERWVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNK
        GSH H    SNH +FSLLLTLL  F A+ FH  ++   ++RW  G      +GPGSSPPTCRS+CGRCWPC+PVHVPIQPG+S+PLEYYPEAWRC+CGNK
Subjt:  GSHHHFYISSNHYHFSLLLTLL--FLALLFHSVSSTPGSERWVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNK

Query:  FFMP
         FMP
Subjt:  FFMP

KGN56945.1 hypothetical protein Csa_009821 [Cucumis sativus]4.1e-3170.21Show/hide
Query:  NHYHFSLLLTLLFLALLFHSVSSTPGSERWVGG--ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        +H+HFSLL+  LF   L  + SS+  S    GG   DRKSL+GPGSSPPTC +KCGRC PCEPVHVPIQPGLSLPLEYYPEAWRC+CGNK FMP
Subjt:  NHYHFSLLLTLLFLALLFHSVSSTPGSERWVGG--ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

XP_006284802.1 EPIDERMAL PATTERNING FACTOR-like protein 4 [Capsella rubella]5.7e-2562.24Show/hide
Query:  LLLTLLFLALL-FHSVSSTPGSERWVGG-----------ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        LL  L  LALL   S SS     RW+G             D K   GPGSSPPTCRSKCGRC PC+PVHVPIQPGLS+PLEYYPEAWRC+C NK FMP
Subjt:  LLLTLLFLALL-FHSVSSTPGSERWVGG-----------ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

XP_008438622.1 PREDICTED: EPIDERMAL PATTERNING FACTOR-like protein 4 [Cucumis melo]5.9e-3064.71Show/hide
Query:  NHYHFSLLLTLLFLALLFHSVSSTPGSER----------WVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFF
        +H+HFSLL+  LF   L  + SS+  S            W+G  DRKSL+GPGSSPPTC +KCGRC PCEPVHVPIQPGLSLPLEYYPEAWRC+CGNK F
Subjt:  NHYHFSLLLTLLFLALLFHSVSSTPGSER----------WVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFF

Query:  MP
        MP
Subjt:  MP

XP_038905188.1 EPIDERMAL PATTERNING FACTOR-like protein 4 [Benincasa hispida]1.7e-3269.23Show/hide
Query:  MGSHHHFYISSNHYHFSLLLTLLFLALLF-HSVSSTPGSERWVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNK
        MGS  ++Y     YHFSLLL L  L  LF   VS++P S  W+G  DRKSL+GPGS PPTCR+KCGRC PCEPVHVPIQPGLS+PLEYYPEAWRC+CGNK
Subjt:  MGSHHHFYISSNHYHFSLLLTLLFLALLF-HSVSSTPGSERWVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNK

Query:  FFMP
         FMP
Subjt:  FFMP

TrEMBL top hitse value%identityAlignment
A0A0A0L4P0 Epidermal patterning factor-like protein2.0e-3170.21Show/hide
Query:  NHYHFSLLLTLLFLALLFHSVSSTPGSERWVGG--ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        +H+HFSLL+  LF   L  + SS+  S    GG   DRKSL+GPGSSPPTC +KCGRC PCEPVHVPIQPGLSLPLEYYPEAWRC+CGNK FMP
Subjt:  NHYHFSLLLTLLFLALLFHSVSSTPGSERWVGG--ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

A0A1S3AWH2 Epidermal patterning factor-like protein2.8e-3064.71Show/hide
Query:  NHYHFSLLLTLLFLALLFHSVSSTPGSER----------WVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFF
        +H+HFSLL+  LF   L  + SS+  S            W+G  DRKSL+GPGSSPPTC +KCGRC PCEPVHVPIQPGLSLPLEYYPEAWRC+CGNK F
Subjt:  NHYHFSLLLTLLFLALLFHSVSSTPGSER----------WVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFF

Query:  MP
        MP
Subjt:  MP

A0A2H5PP20 Epidermal patterning factor-like protein4.7e-2565Show/hide
Query:  FLALLFHSVSSTPGSERWVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        FL  +   + +    E+     D+K + GPGSSPPTCRSKCGRC PC+PVHVPIQPGLS+PLEYYPEAWRC+CGNK FMP
Subjt:  FLALLFHSVSSTPGSERWVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

A0A2P5FWC0 Epidermal patterning factor-like protein4.7e-2556.76Show/hide
Query:  HHHFYISSNHYHFSLLLTLLFLALLFHSVSS-----TPGSERWVGGA------DRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAW
        HHH +   N   F  L T+  L LL  + +S     TP   R  G          K L GPGSSPPTCR KCGRC PC+PVHVPIQPGLS+PLEYYPEAW
Subjt:  HHHFYISSNHYHFSLLLTLLFLALLFHSVSS-----TPGSERWVGGA------DRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAW

Query:  RCRCGNKFFMP
        RC+CGNK FMP
Subjt:  RCRCGNKFFMP

R0GZF1 Epidermal patterning factor-like protein2.8e-2562.24Show/hide
Query:  LLLTLLFLALL-FHSVSSTPGSERWVGG-----------ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        LL  L  LALL   S SS     RW+G             D K   GPGSSPPTCRSKCGRC PC+PVHVPIQPGLS+PLEYYPEAWRC+C NK FMP
Subjt:  LLLTLLFLALL-FHSVSSTPGSERWVGG-----------ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

SwissProt top hitse value%identityAlignment
Q1PEY6 EPIDERMAL PATTERNING FACTOR-like protein 62.6e-2068.42Show/hide
Query:  RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        R+ L G GSSPP C SKCGRC PC+PVHVP+ PG  +  EYYPEAWRC+CGNK +MP
Subjt:  RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

Q2V3I3 EPIDERMAL PATTERNING FACTOR-like protein 41.7e-2760.61Show/hide
Query:  LLLTLLFLAL--LFHSVSSTPGSERWVG---GAD--------RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        LL  L+  AL  LF + S      RW+G   G+D         K   GPGSSPPTCRSKCG+C PC+PVHVPIQPGLS+PLEYYPEAWRC+CGNK FMP
Subjt:  LLLTLLFLAL--LFHSVSSTPGSERWVG---GAD--------RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

Q7M1E7 Polygalacturonase5.0e-0843.4Show/hide
Query:  IGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        I   SSPP C++KC  C PC+P  + + P  + P +YYP+ W C C NK + P
Subjt:  IGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

Q9FY19 Polygalacturonase2.9e-0841.38Show/hide
Query:  DRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        D K      SSPP C++KC  C PC+P  + + P  + P +YYP+ W C C NK + P
Subjt:  DRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

Q9LUH9 EPIDERMAL PATTERNING FACTOR-like protein 55.0e-2453.54Show/hide
Query:  LLFLALLFHSVSSTPGSERWVGG-----------------ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        +++  LLF S SS    +R  GG                  D+K L GPGS PP CR KCG+C PC+ VHVPIQPGL +PLEYYPEAWRC+CGNK FMP
Subjt:  LLFLALLFHSVSSTPGSERWVGG-----------------ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

Arabidopsis top hitse value%identityAlignment
AT2G30370.1 allergen-related1.8e-2168.42Show/hide
Query:  RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        R+ L G GSSPP C SKCGRC PC+PVHVP+ PG  +  EYYPEAWRC+CGNK +MP
Subjt:  RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

AT2G30370.2 allergen-related1.8e-2168.42Show/hide
Query:  RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        R+ L G GSSPP C SKCGRC PC+PVHVP+ PG  +  EYYPEAWRC+CGNK +MP
Subjt:  RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

AT3G13898.1 unknown protein1.4e-0546.81Show/hide
Query:  GSSPPTCRSKCGRCWPCEPVHVPI---QPGLSLP-LEYYPEAWRCRC
        GS PP+C  KC  C PCE +  P     P LS     Y PE WRC C
Subjt:  GSSPPTCRSKCGRCWPCEPVHVPI---QPGLSLP-LEYYPEAWRCRC

AT3G22820.1 allergen-related3.6e-2553.54Show/hide
Query:  LLFLALLFHSVSSTPGSERWVGG-----------------ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        +++  LLF S SS    +R  GG                  D+K L GPGS PP CR KCG+C PC+ VHVPIQPGL +PLEYYPEAWRC+CGNK FMP
Subjt:  LLFLALLFHSVSSTPGSERWVGG-----------------ADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP

AT4G14723.1 BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1)1.2e-2860.61Show/hide
Query:  LLLTLLFLAL--LFHSVSSTPGSERWVG---GAD--------RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP
        LL  L+  AL  LF + S      RW+G   G+D         K   GPGSSPPTCRSKCG+C PC+PVHVPIQPGLS+PLEYYPEAWRC+CGNK FMP
Subjt:  LLLTLLFLAL--LFHSVSSTPGSERWVG---GAD--------RKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCACACCACCATTTCTATATTTCCTCCAATCATTACCATTTCTCTCTTCTTCTCACTCTCCTCTTCCTCGCCCTCCTCTTCCACTCCGTTTCCTCGACACCAGG
AAGCGAGCGGTGGGTTGGAGGAGCGGATCGGAAGAGTTTGATTGGGCCGGGATCGTCGCCACCCACGTGTCGGTCCAAGTGTGGGAGATGCTGGCCGTGTGAACCGGTTC
ATGTCCCAATTCAACCGGGGCTGAGTCTGCCATTGGAGTATTACCCTGAAGCTTGGAGATGCAGATGTGGGAATAAGTTCTTCATGCCTTAG
mRNA sequenceShow/hide mRNA sequence
CAGACCTCCAATGACGCTCCAAAGGCCCCCACCAGGTTCAGCCAGACAATAAGACAAACCCCATTACTAACCCTCTCTCTCTCTCTCTCTTTGCAACCCTTTTGGGCTCT
GTCTTTGACTTCAAATCCTCTCTAAATAGAGAGAAAATAAATAAATAAATAATAAAAAAATAAAATAAAACGATAAGAGAGAAGCAGAGGAATTCAGAAAACCAGCATGC
AGAAGCAGAAGCAGAAGCAGATGAGCAGTGCTTCCAATTTCTGAATGGGATCACACCACCATTTCTATATTTCCTCCAATCATTACCATTTCTCTCTTCTTCTCACTCTC
CTCTTCCTCGCCCTCCTCTTCCACTCCGTTTCCTCGACACCAGGAAGCGAGCGGTGGGTTGGAGGAGCGGATCGGAAGAGTTTGATTGGGCCGGGATCGTCGCCACCCAC
GTGTCGGTCCAAGTGTGGGAGATGCTGGCCGTGTGAACCGGTTCATGTCCCAATTCAACCGGGGCTGAGTCTGCCATTGGAGTATTACCCTGAAGCTTGGAGATGCAGAT
GTGGGAATAAGTTCTTCATGCCTTAGACAAAATTCAACCACAAACAACAAAAATAGTCATTTTCTTTTTGGGGTTTGTTTCGTTTAGAGCCAACAAAAAAAAAAGTTGTT
TGTGATTTTTTTTGTGTGTGTGTATATATATTGATCTTCTGCTTGGTGGTTGTTATTGGTGGCGTTGGTATGATTGAAACTGGGCAGAGGAATTTGAAGAG
Protein sequenceShow/hide protein sequence
MGSHHHFYISSNHYHFSLLLTLLFLALLFHSVSSTPGSERWVGGADRKSLIGPGSSPPTCRSKCGRCWPCEPVHVPIQPGLSLPLEYYPEAWRCRCGNKFFMP