; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000071 (gene) of Snake gourd v1 genome

Gene IDTan0000071
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionproline-rich protein HaeIII subfamily 1-like
Genome locationLG03:60572004..60572537
RNA-Seq ExpressionTan0000071
SyntenyTan0000071
Gene Ontology termsGO:0010227 - floral organ abscission (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR039639 - Protein IDA-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY16631.1 Uncharacterized protein TCM_035454 [Theobroma cacao]1.3e-2153.23Show/hide
Query:  LLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL
        +L  +AL+FLG  + +   +AR + E  S+ NF ++P+ VPIPPSGP++R S +PPPP           NFGM+PKGVPIPPSGPS+RTS +SPPP    
Subjt:  LLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL

Query:  DQFDFGMYPKGVPIPPSGPSQRTS
           +FG +PKGVPIPPSGPS+ TS
Subjt:  DQFDFGMYPKGVPIPPSGPSQRTS

KAG6603169.1 hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sororia]1.1e-2859.06Show/hide
Query:  PTHARK-LAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPP---SFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL-DQFDFGMYPKGV
        P HA   +   +S+ NF M P+ VPIPPSGP+QR S  PPPP   S  I   + + NFGM PKGVPIPPSGPSQRTS+  PPP H L  + +FGM PKGV
Subjt:  PTHARK-LAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPP---SFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL-DQFDFGMYPKGV

Query:  PIPPSGPSQRTSDDLHLLHHIPFIKLR
        PIPPSGPS+RTSD      H PFI LR
Subjt:  PIPPSGPSQRTSDDLHLLHHIPFIKLR

XP_008442275.1 PREDICTED: proline-rich protein HaeIII subfamily 1-like [Cucumis melo]1.4e-2349.32Show/hide
Query:  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEK-------------ESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVP
        MAS S  M+LG   L+ LGFH     THAR +                 S F F MYP+ + +PPSGP+QR S + PPP F I +N  +F+FGMYPKG+ 
Subjt:  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEK-------------ESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVP

Query:  IPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSD
        +PPSGPSQRTSDSSPPP  +  +  FG     VP+PPSGP+  TSD
Subjt:  IPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSD

XP_011654399.2 proline-rich protein HaeIII subfamily 1-like [Cucumis sativus]1.8e-2350.35Show/hide
Query:  MLLGALALIFLGFHFTTNPTHARKL-------------AEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPS
        M+LGA  L+ LGFH     THAR +                 S   F +Y + + IPPSGP+QR S + PPP F I   + +F+FGMYPKG+PIPPSGPS
Subjt:  MLLGALALIFLGFHFTTNPTHARKL-------------AEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPS

Query:  QRTSDSSPPPLHSL---DQFDFGMYPKGVPIPPSGPSQRTS
        QRTSDSSPPP  +    + F FGMY + VPIPPSG + RTS
Subjt:  QRTSDSSPPPLHSL---DQFDFGMYPKGVPIPPSGPSQRTS

XP_022967687.1 actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima]6.8e-2657.26Show/hide
Query:  PTHARK-LAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL-DQFDFGMYPKGVPIP
        P HA   +  K+S+ NF M P+ VPIPPSGP+ R S  PPPP   +    P+ NFGM PK VPIPPSGPSQRTSD  PPP H L  + +FGM PKGVPIP
Subjt:  PTHARK-LAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL-DQFDFGMYPKGVPIP

Query:  PSGPSQRTSDDLHLLHHIPFIKLR
        P GPS+RTSD      + P I LR
Subjt:  PSGPSQRTSDDLHLLHHIPFIKLR

TrEMBL top hitse value%identityAlignment
A0A061FHV4 Uncharacterized protein6.4e-2253.23Show/hide
Query:  LLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL
        +L  +AL+FLG  + +   +AR + E  S+ NF ++P+ VPIPPSGP++R S +PPPP           NFGM+PKGVPIPPSGPS+RTS +SPPP    
Subjt:  LLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL

Query:  DQFDFGMYPKGVPIPPSGPSQRTS
           +FG +PKGVPIPPSGPS+ TS
Subjt:  DQFDFGMYPKGVPIPPSGPSQRTS

A0A1R3GMC3 Uncharacterized protein2.1e-2051.85Show/hide
Query:  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDS
        MA+  S ML+  L L+F+G+  T    +AR L+ +    NF M P+ VPIPPSGP+ R S   PPP        P  NF M PKGVPIPPSGPS RTS  
Subjt:  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDS

Query:  SPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSDDL
         PPP       +F M PKGVPIPPSGPS RTS D+
Subjt:  SPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSDDL

A0A1S3B4V4 proline-rich protein HaeIII subfamily 1-like6.8e-2449.32Show/hide
Query:  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEK-------------ESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVP
        MAS S  M+LG   L+ LGFH     THAR +                 S F F MYP+ + +PPSGP+QR S + PPP F I +N  +F+FGMYPKG+ 
Subjt:  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEK-------------ESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVP

Query:  IPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSD
        +PPSGPSQRTSDSSPPP  +  +  FG     VP+PPSGP+  TSD
Subjt:  IPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSD

A0A5D3C2C7 Proline-rich protein HaeIII subfamily 1-like6.8e-2449.32Show/hide
Query:  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEK-------------ESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVP
        MAS S  M+LG   L+ LGFH     THAR +                 S F F MYP+ + +PPSGP+QR S + PPP F I +N  +F+FGMYPKG+ 
Subjt:  MASMSSKMLLGALALIFLGFHFTTNPTHARKLAEK-------------ESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVP

Query:  IPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSD
        +PPSGPSQRTSDSSPPP  +  +  FG     VP+PPSGP+  TSD
Subjt:  IPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSD

A0A6J1HRH7 actin cytoskeleton-regulatory complex protein PAN1-like3.3e-2657.26Show/hide
Query:  PTHARK-LAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL-DQFDFGMYPKGVPIP
        P HA   +  K+S+ NF M P+ VPIPPSGP+ R S  PPPP   +    P+ NFGM PK VPIPPSGPSQRTSD  PPP H L  + +FGM PKGVPIP
Subjt:  PTHARK-LAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGMYPKGVPIPPSGPSQRTSDSSPPPLHSL-DQFDFGMYPKGVPIP

Query:  PSGPSQRTSDDLHLLHHIPFIKLR
        P GPS+RTSD      + P I LR
Subjt:  PSGPSQRTSDDLHLLHHIPFIKLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAATAAATTCAGAGATTTAAAATTGATTAGGAGCCAAATGCTATATAAAGGAAGCATTGGAACAAGGGAAAACACAGCAAAAATGGCCTCAATGAGTTCTAAAAT
GCTTTTAGGAGCTTTGGCTCTTATTTTCTTAGGGTTTCATTTTACCACAAACCCAACACATGCTAGAAAACTTGCTGAGAAGGAATCAAGGTTTAACTTTGTGATGTATC
CCGAAAGAGTACCTATTCCCCCGTCGGGGCCAAATCAAAGACATTCATTTGCTCCTCCACCACCTTCCTTTACTATTTCGAAGAATGAACCCGAGTTCAATTTTGGGATG
TACCCAAAAGGTGTACCTATTCCTCCTTCTGGCCCGAGTCAAAGGACATCAGATTCATCTCCTCCTCCACTCCATTCATTGGATCAATTCGATTTCGGAATGTATCCCAA
AGGCGTGCCTATTCCTCCTTCTGGACCGAGTCAAAGGACATCTGACGATCTTCACCTCCTCCACCACATTCCATTTATCAAGCTTCGAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAATAAATTCAGAGATTTAAAATTGATTAGGAGCCAAATGCTATATAAAGGAAGCATTGGAACAAGGGAAAACACAGCAAAAATGGCCTCAATGAGTTCTAAAAT
GCTTTTAGGAGCTTTGGCTCTTATTTTCTTAGGGTTTCATTTTACCACAAACCCAACACATGCTAGAAAACTTGCTGAGAAGGAATCAAGGTTTAACTTTGTGATGTATC
CCGAAAGAGTACCTATTCCCCCGTCGGGGCCAAATCAAAGACATTCATTTGCTCCTCCACCACCTTCCTTTACTATTTCGAAGAATGAACCCGAGTTCAATTTTGGGATG
TACCCAAAAGGTGTACCTATTCCTCCTTCTGGCCCGAGTCAAAGGACATCAGATTCATCTCCTCCTCCACTCCATTCATTGGATCAATTCGATTTCGGAATGTATCCCAA
AGGCGTGCCTATTCCTCCTTCTGGACCGAGTCAAAGGACATCTGACGATCTTCACCTCCTCCACCACATTCCATTTATCAAGCTTCGAGAATGA
Protein sequenceShow/hide protein sequence
MANKFRDLKLIRSQMLYKGSIGTRENTAKMASMSSKMLLGALALIFLGFHFTTNPTHARKLAEKESRFNFVMYPERVPIPPSGPNQRHSFAPPPPSFTISKNEPEFNFGM
YPKGVPIPPSGPSQRTSDSSPPPLHSLDQFDFGMYPKGVPIPPSGPSQRTSDDLHLLHHIPFIKLRE