; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008940 (gene) of Snake gourd v1 genome

Gene IDTan0008940
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProline-rich protein HaeIII subfamily 1-like
Genome locationLG03:44122666..44123367
RNA-Seq ExpressionTan0008940
SyntenyTan0008940
Gene Ontology termsGO:0010227 - floral organ abscission (biological process)
InterPro domainsIPR039639 - Protein IDA-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040932.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa]7.3e-3648.08Show/hide
Query:  MGSSTCNGLFGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHPKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRI
        M S   N  F ACA IFLGLH  +Y T+A+   V +E+N+     L  D++NHPK +    +   K  PD +  PP S  +            L KGIR 
Subjt:  MGSSTCNGLFGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHPKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRI

Query:  PPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQP-
        PPSG SQ TS+  PP P V  +++ KES ++FG+LPKGVRIPPSG S+RTS+ PPP  +  S+   KESR+ +G+LPKGV IPPSGPS+R SDY PP P 
Subjt:  PPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQP-

Query:  --HVPSIV
          H PSI+
Subjt:  --HVPSIV

KAG6603169.1 hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sororia]8.8e-6657.32Show/hide
Query:  LYQTDARNIVVTNEENTIAIKELVNDVVNHPKGL------------------------------PIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRM
        +Y+TDA+ +V  +  +TI  +EL + +V HPKG+                              PI P G ++ T D   PPP +SSVIL+K++K+NF M
Subjt:  LYQTDARNIVVTNEENTIAIKELVNDVVNHPKGL------------------------------PIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRM

Query:  LPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPP-HAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTS
        LPKG+ IPPSGPSQ+TSDYPPPPPR  SV++ K+SKI+FGMLPKGV IPPSG SQRTSNYPPPPP HA SVIL  +S++NFGMLPKGV IPPSGPSQRTS
Subjt:  LPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPP-HAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTS

Query:  DYPPPQPHVPSIVLEKESKFNFGMYPRN-PIPPSNPSGR
        DYPPP PH  S +L  +SK NFGM P+  PIPPS PS R
Subjt:  DYPPPQPHVPSIVLEKESKFNFGMYPRN-PIPPSNPSGR

XP_022967687.1 actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima]9.4e-6052.94Show/hide
Query:  LYQTDARNIVVTNEENTIAIKELVNDVVNHPKGL------------------------------PIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRM
        +Y+TDA+N+V  + ++ I   EL + +V +PKG+                              PIPP   ++ T D   PPP +SS+IL K +K+N  M
Subjt:  LYQTDARNIVVTNEENTIAIKELVNDVVNHPKGL------------------------------PIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRM

Query:  LPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSD
        LP+G+ IPPSGPSQ+TSDYPPPPP   SV++ K+SKI+FGMLPKGV IPPSG SQRTS YPPPPP A SVIL K+S++  GMLP+GV IPP G SQRTSD
Subjt:  LPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSD

Query:  YPPPQPHVPSIVLEKESKFNFGMYPRN-PIPPSNPSGR
        YP P PH  S++L K+SK NFGM P+  PIPPS PS R
Subjt:  YPPPQPHVPSIVLEKESKFNFGMYPRN-PIPPSNPSGR

XP_031744042.1 proline-rich receptor-like protein kinase PERK9 [Cucumis sativus]1.7e-4045.38Show/hide
Query:  FGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHP---------------------------KGLPIPPFGSNKITPDSSSPPPLSSSVI
        F  C  + LGL   LYQT+  N+   NEEN++   E    V+ HP                           K + IPP G ++ + DS+ PP    S++
Subjt:  FGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHP---------------------------KGLPIPPFGSNKITPDSSSPPPLSSSVI

Query:  LTKEAKVNFRMLPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHI
        L KE+ +NF +LPKG+    SGPSQ+ SD PP PP   S+V+ KES+I FG+LPKGV    SG S+R S+ PPPPP  PS++L KESR+NFG+L KGV  
Subjt:  LTKEAKVNFRMLPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHI

Query:  PPSGPSQRTSDYPPPQPHVPSIVLEKESKFNFGMYPRN-PIPPSNPSGR
          SGPS+R SD PPP P  PSIVL KES+ NFG+ P+  P   S PS R
Subjt:  PPSGPSQRTSDYPPPQPHVPSIVLEKESKFNFGMYPRN-PIPPSNPSGR

XP_038882352.1 uncharacterized protein LOC120073615 [Benincasa hispida]4.5e-4651.98Show/hide
Query:  STCNGLFGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHPKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRIPPS
        S  N  FGACAFIFLGLH T+YQTDA+N+ V  + ++    EL ++ + HPK       G  K  PD + P               N  +L K  R+PPS
Subjt:  STCNGLFGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHPKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRIPPS

Query:  GPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQPHVPS
        G SQ TSD PPPPP V+S+++ KES+I+F +L KG RIPPSG SQRTS  PPPPPHA SVIL K+  +NFG+LPK +HIPPSGPS+R S+YP P  H PS
Subjt:  GPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQPHVPS

Query:  IV
        ++
Subjt:  IV

TrEMBL top hitse value%identityAlignment
A0A5D3BP86 Proline-rich protein HaeIII subfamily 1-like6.0e-3645.85Show/hide
Query:  MGSSTCNGLFGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHPKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRI
        M S + N  F  CA + LGLH  +YQT+ RN+    E+N++   EL  + + HPK      +    +T     P           +    F +  K +RI
Subjt:  MGSSTCNGLFGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHPKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRI

Query:  PPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQPH
        PPSGPSQ++SD PP P    S+++ KES+I+FG+LPKG RIPPSG SQR S+ P P    PS  L K S + FGMLPKG HIPPSGPS+RTSD PPP PH
Subjt:  PPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQPH

Query:  VPSIV
         PS++
Subjt:  VPSIV

A0A5D3CGG1 Proline-rich protein HaeIII subfamily 1-like3.5e-3648.08Show/hide
Query:  MGSSTCNGLFGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHPKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRI
        M S   N  F ACA IFLGLH  +Y T+A+   V +E+N+     L  D++NHPK +    +   K  PD +  PP S  +            L KGIR 
Subjt:  MGSSTCNGLFGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHPKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRI

Query:  PPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQP-
        PPSG SQ TS+  PP P V  +++ KES ++FG+LPKGVRIPPSG S+RTS+ PPP  +  S+   KESR+ +G+LPKGV IPPSGPS+R SDY PP P 
Subjt:  PPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQP-

Query:  --HVPSIV
          H PSI+
Subjt:  --HVPSIV

A0A6A4PCZ2 Uncharacterized protein3.4e-3150.56Show/hide
Query:  PKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNY
        PKG+P+PP G +  T D   PPP      +   + +NF MLPKG R PPSGPS KTSD PPPPP+     ++  S I+FGMLPKG   PPSG S +TS+ 
Subjt:  PKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNY

Query:  PPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQPHVPSIVLEKESKFNFGMYPRNPIPPSNPS-GRIP
        PPPP   P   +   S +NFGMLPKG   PPSG S +TSD PPP P   +   E  S  NFGM P+  +PPS PS G+ P
Subjt:  PPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQPHVPSIVLEKESKFNFGMYPRNPIPPSNPS-GRIP

A0A6A5P9E6 Uncharacterized protein3.4e-3150.56Show/hide
Query:  PKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNY
        PKG+P+PP G +  T D   PPP      +   + +NF MLPKG R PPSGPS KTSD PPPPP+     ++  S I+FGMLPKG   PPSG S +TS+ 
Subjt:  PKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNY

Query:  PPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQPHVPSIVLEKESKFNFGMYPRNPIPPSNPS-GRIP
        PPPP   P   +   S +NFGMLPKG   PPSG S +TSD PPP P   +   E  S  NFGM P+  +PPS PS G+ P
Subjt:  PPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQPHVPSIVLEKESKFNFGMYPRNPIPPSNPS-GRIP

A0A6J1HRH7 actin cytoskeleton-regulatory complex protein PAN1-like4.6e-6052.94Show/hide
Query:  LYQTDARNIVVTNEENTIAIKELVNDVVNHPKGL------------------------------PIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRM
        +Y+TDA+N+V  + ++ I   EL + +V +PKG+                              PIPP   ++ T D   PPP +SS+IL K +K+N  M
Subjt:  LYQTDARNIVVTNEENTIAIKELVNDVVNHPKGL------------------------------PIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRM

Query:  LPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSD
        LP+G+ IPPSGPSQ+TSDYPPPPP   SV++ K+SKI+FGMLPKGV IPPSG SQRTS YPPPPP A SVIL K+S++  GMLP+GV IPP G SQRTSD
Subjt:  LPKGIRIPPSGPSQKTSDYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSD

Query:  YPPPQPHVPSIVLEKESKFNFGMYPRN-PIPPSNPSGR
        YP P PH  S++L K+SK NFGM P+  PIPPS PS R
Subjt:  YPPPQPHVPSIVLEKESKFNFGMYPRN-PIPPSNPSGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCGAGTACTTGCAATGGGTTATTCGGAGCTTGTGCTTTCATATTTCTGGGCTTACACCTTACACTTTATCAAACGGACGCTAGGAACATTGTTGTAACCAATGA
AGAAAACACCATTGCAATAAAGGAACTGGTTAACGACGTCGTAAACCATCCAAAGGGCCTCCCCATTCCTCCGTTTGGATCGAATAAAATAACACCTGATTCGTCATCTC
CTCCACCACTTTCTTCTTCAGTCATTCTTACGAAGGAAGCTAAGGTAAACTTTAGAATGTTACCAAAAGGCATACGCATTCCTCCGTCTGGGCCAAGCCAAAAGACATCA
GACTATCCACCTCCTCCACCGCGTGTTTTATCAGTTGTTGTAAAGAAAGAATCTAAGATCAGCTTTGGAATGTTACCAAAAGGAGTACGTATTCCCCCTTCTGGGTCGAG
TCAAAGGACATCGAACTATCCACCTCCTCCACCTCATGCTCCATCCGTCATTTTAATGAAGGAATCTAGGGTCAATTTTGGAATGTTACCCAAAGGCGTTCATATTCCTC
CATCTGGGCCGAGTCAAAGGACTTCCGACTATCCACCCCCTCAACCGCATGTTCCTTCTATTGTTTTGGAGAAAGAATCAAAGTTTAATTTCGGGATGTATCCCAGAAAC
CCTATTCCTCCATCAAACCCGAGTGGAAGGATACCAATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCGAGTACTTGCAATGGGTTATTCGGAGCTTGTGCTTTCATATTTCTGGGCTTACACCTTACACTTTATCAAACGGACGCTAGGAACATTGTTGTAACCAATGA
AGAAAACACCATTGCAATAAAGGAACTGGTTAACGACGTCGTAAACCATCCAAAGGGCCTCCCCATTCCTCCGTTTGGATCGAATAAAATAACACCTGATTCGTCATCTC
CTCCACCACTTTCTTCTTCAGTCATTCTTACGAAGGAAGCTAAGGTAAACTTTAGAATGTTACCAAAAGGCATACGCATTCCTCCGTCTGGGCCAAGCCAAAAGACATCA
GACTATCCACCTCCTCCACCGCGTGTTTTATCAGTTGTTGTAAAGAAAGAATCTAAGATCAGCTTTGGAATGTTACCAAAAGGAGTACGTATTCCCCCTTCTGGGTCGAG
TCAAAGGACATCGAACTATCCACCTCCTCCACCTCATGCTCCATCCGTCATTTTAATGAAGGAATCTAGGGTCAATTTTGGAATGTTACCCAAAGGCGTTCATATTCCTC
CATCTGGGCCGAGTCAAAGGACTTCCGACTATCCACCCCCTCAACCGCATGTTCCTTCTATTGTTTTGGAGAAAGAATCAAAGTTTAATTTCGGGATGTATCCCAGAAAC
CCTATTCCTCCATCAAACCCGAGTGGAAGGATACCAATATAA
Protein sequenceShow/hide protein sequence
MGSSTCNGLFGACAFIFLGLHLTLYQTDARNIVVTNEENTIAIKELVNDVVNHPKGLPIPPFGSNKITPDSSSPPPLSSSVILTKEAKVNFRMLPKGIRIPPSGPSQKTS
DYPPPPPRVLSVVVKKESKISFGMLPKGVRIPPSGSSQRTSNYPPPPPHAPSVILMKESRVNFGMLPKGVHIPPSGPSQRTSDYPPPQPHVPSIVLEKESKFNFGMYPRN
PIPPSNPSGRIPI