; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006059 (gene) of Snake gourd v1 genome

Gene IDTan0006059
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMucin-2 isoform X2
Genome locationLG04:83473874..83478222
RNA-Seq ExpressionTan0006059
SyntenyTan0006059
Gene Ontology termsGO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR028226 - Protein LIN37


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606861.1 hypothetical protein SDJN03_00203, partial [Cucurbita argyrosperma subsp. sororia]6.7e-16389.34Show/hide
Query:  MTVDSPLIPTNTTAATT---TNTVAPISFTLTTAAATTTAA-IARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY
        M  DS  I TNTT A     T TVAP S TLTTAAATTTAA IARP+ANQAPSRP+SSIPQTHHLHYPPQALYAAQ IP+RTPNHQLPK+QQDASQ+ILY
Subjt:  MTVDSPLIPTNTTAATT---TNTVAPISFTLTTAAATTTAA-IARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES
        PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTF HRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI ES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES

Query:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK
        NGCKEMR+RDDAL V+RDRKVRITDGASLYALCRSWLRNGS EE QPQYGNFLR LPRPLPIPVAGAV +QKKEV+ +EVDEEDKDEGSIE LS+QDLLK
Subjt:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK

Query:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022934169.1 uncharacterized protein LOC111441417 [Cucurbita moschata]1.3e-16388.57Show/hide
Query:  MTVDSPLIPTNTT------AATTTNTVAPISFTLTTAAATTT-AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQS
        M VDSPLIPTNTT      AATTT TVAPISFTLTTAAATT+ AAIARP+ANQAPSRP+SSIPQTHHLHYPPQALY AQ IPVRTPN QLPKLQQDASQ+
Subjt:  MTVDSPLIPTNTT------AATTTNTVAPISFTLTTAAATTT-AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQS

Query:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI
        ILYPVASSGRGFVPRPIRPLP DQ VTVANPGGYPHRPVV+F HRPIGSPHLDSMSHPMH+ARPPNL QQLI  SGS+ISGSIKGAPNSSDPKVFPPSTI
Subjt:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI

Query:  YESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQD
         E+NGCKEMR+RDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPI V GAVP+QKKEV+EE VDE+DKDE SIE LSTQ+
Subjt:  YESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQD

Query:  LLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        LLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  LLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022949214.1 uncharacterized protein LOC111452632 [Cucurbita moschata]6.7e-16389.34Show/hide
Query:  MTVDSPLIPTNTT---AATTTNTVAPISFTLTTAAATTTAA-IARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY
        M  DS  I TNTT   A   T TVAP S TLTTAAATTTAA IARP+ANQAPSRP+SSIPQTHHLHYPPQALYAAQ IP+RTPNHQLPK+QQDASQ+ILY
Subjt:  MTVDSPLIPTNTT---AATTTNTVAPISFTLTTAAATTTAA-IARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES
        PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTF HRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI ES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES

Query:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK
        NGCKEMR+RDDAL V+RDRKVRITDGASLYALCRSWLRNGS EE QPQYGNFLR LPRPLPIPVAGAV +QKKEV+ +EVDEEDKDEGSIE LS+QDLLK
Subjt:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK

Query:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

XP_022998467.1 uncharacterized protein LOC111493092 [Cucurbita maxima]4.6e-16489.63Show/hide
Query:  MTVDSPLIPTNTT---AATTTNTVAPISFTLTTAAATTT-AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY
        M  DS  I TNTT    A TT TVAP S TLTTAAATTT AAIARP+ANQAPSRP+SSIPQTHHLHYPPQALYAAQ IP+RTPNHQLPKLQQDASQ+ILY
Subjt:  MTVDSPLIPTNTT---AATTTNTVAPISFTLTTAAATTT-AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES
        PVASSGRGFVPRPIRPLP DQAVTVANPGGYPHRP VTF HRPIGSPHLDSMSHPMHLARPPNLPQQLIS+SGSAISGSIKGAPNSSDPKVFPPSTI ES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES

Query:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK
        NGCKEMR+RDDAL V+RDRKVRITDGASLYALCRSWLRNGSQEE QPQYGNFLR LPRPLPIPVAGAVP+QKKEV+ +EVDEEDKDEGSIE LS+QDLLK
Subjt:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK

Query:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

XP_038903387.1 uncharacterized protein LOC120089997 [Benincasa hispida]6.7e-16388.32Show/hide
Query:  MTVDSPLIPTNTTAAT-TTNTVAPISFTLTTAAATTT-----AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSI
        M VDSPLIPTNTTAA  +T TV PISFTLTTAAATTT     AAIARP+ANQAPSRP+SSIPQTHHLHYPPQALYAAQ IPVRTPN+QLPKL QDASQ+I
Subjt:  MTVDSPLIPTNTTAAT-TTNTVAPISFTLTTAAATTT-----AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSI

Query:  LYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNL--PQQLISLSGSAISGSIKGAPNSSDPKVFPPST
        LYPVASSGRGFVPRPIRPLPADQ VT+ANPGGY +RPVVTF HRPIGS HLDSMSHPMH+ARPPNL   QQLI  SGS+ISGSIKG PNSSDPKVF PST
Subjt:  LYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNL--PQQLISLSGSAISGSIKGAPNSSDPKVFPPST

Query:  IYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQ
        I ESNGCKEMR+RDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPI  AGAVP+QKKEV+EEEVDEEDKDEGSIEHLSTQ
Subjt:  IYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQ

Query:  DLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        +LLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDN+TGS
Subjt:  DLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

TrEMBL top hitse value%identityAlignment
A0A1S3CFG6 uncharacterized protein LOC103500291 isoform X14.7e-16285.15Show/hide
Query:  MTVDSPLIPTNTTAA--------TTTNTVAPISFTLT------TAAATTTAAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL
        M VDSPLIPTNTT+A        TTT TV PISFTLT      T AA  TAAIARP+ANQAPSRP+SSIPQTHHLHYP QALY  Q IPVRTPN QLPKL
Subjt:  MTVDSPLIPTNTTAA--------TTTNTVAPISFTLT------TAAATTTAAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL

Query:  QQDASQSILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPK
         QDASQ+ILYPVASSGRGFVPRPIRPLP DQAVT+ANPGGYPHRPVVTF HRPIGSPHLDSMSHPMH+ RPPNL QQLI  SGS+ISGSIKGAPNSSDPK
Subjt:  QQDASQSILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPK

Query:  VFPPSTIYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSI
         FPPSTI ESNGCKEMR+RDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPI VAGA P+QKKEV++EEVDEEDKDEGSI
Subjt:  VFPPSTIYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSI

Query:  EHLSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        EHLSTQ+LLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDN+TGS
Subjt:  EHLSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

A0A5A7TZA2 Mucin-2 isoform X24.7e-16285.15Show/hide
Query:  MTVDSPLIPTNTTAA--------TTTNTVAPISFTLT------TAAATTTAAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL
        M VDSPLIPTNTT+A        TTT TV PISFTLT      T AA  TAAIARP+ANQAPSRP+SSIPQTHHLHYP QALY  Q IPVRTPN QLPKL
Subjt:  MTVDSPLIPTNTTAA--------TTTNTVAPISFTLT------TAAATTTAAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKL

Query:  QQDASQSILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPK
         QDASQ+ILYPVASSGRGFVPRPIRPLP DQAVT+ANPGGYPHRPVVTF HRPIGSPHLDSMSHPMH+ RPPNL QQLI  SGS+ISGSIKGAPNSSDPK
Subjt:  QQDASQSILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPK

Query:  VFPPSTIYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSI
         FPPSTI ESNGCKEMR+RDD LCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPI VAGA P+QKKEV++EEVDEEDKDEGSI
Subjt:  VFPPSTIYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSI

Query:  EHLSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        EHLSTQ+LLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPP+EQLRTDN+TGS
Subjt:  EHLSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1F1T6 uncharacterized protein LOC1114414176.5e-16488.57Show/hide
Query:  MTVDSPLIPTNTT------AATTTNTVAPISFTLTTAAATTT-AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQS
        M VDSPLIPTNTT      AATTT TVAPISFTLTTAAATT+ AAIARP+ANQAPSRP+SSIPQTHHLHYPPQALY AQ IPVRTPN QLPKLQQDASQ+
Subjt:  MTVDSPLIPTNTT------AATTTNTVAPISFTLTTAAATTT-AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQS

Query:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI
        ILYPVASSGRGFVPRPIRPLP DQ VTVANPGGYPHRPVV+F HRPIGSPHLDSMSHPMH+ARPPNL QQLI  SGS+ISGSIKGAPNSSDPKVFPPSTI
Subjt:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI

Query:  YESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQD
         E+NGCKEMR+RDDALCVVRDRKV ITDGASLYALCRSWLRNGSQEESQPQYG+FLRSLPRPLPI V GAVP+QKKEV+EE VDE+DKDE SIE LSTQ+
Subjt:  YESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQD

Query:  LLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        LLKRHV+RAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  LLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1GC68 uncharacterized protein LOC1114526323.2e-16389.34Show/hide
Query:  MTVDSPLIPTNTT---AATTTNTVAPISFTLTTAAATTTAA-IARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY
        M  DS  I TNTT   A   T TVAP S TLTTAAATTTAA IARP+ANQAPSRP+SSIPQTHHLHYPPQALYAAQ IP+RTPNHQLPK+QQDASQ+ILY
Subjt:  MTVDSPLIPTNTT---AATTTNTVAPISFTLTTAAATTTAA-IARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES
        PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTF HRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTI ES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES

Query:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK
        NGCKEMR+RDDAL V+RDRKVRITDGASLYALCRSWLRNGS EE QPQYGNFLR LPRPLPIPVAGAV +QKKEV+ +EVDEEDKDEGSIE LS+QDLLK
Subjt:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK

Query:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

A0A6J1K833 uncharacterized protein LOC1114930922.2e-16489.63Show/hide
Query:  MTVDSPLIPTNTT---AATTTNTVAPISFTLTTAAATTT-AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY
        M  DS  I TNTT    A TT TVAP S TLTTAAATTT AAIARP+ANQAPSRP+SSIPQTHHLHYPPQALYAAQ IP+RTPNHQLPKLQQDASQ+ILY
Subjt:  MTVDSPLIPTNTT---AATTTNTVAPISFTLTTAAATTT-AAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILY

Query:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES
        PVASSGRGFVPRPIRPLP DQAVTVANPGGYPHRP VTF HRPIGSPHLDSMSHPMHLARPPNLPQQLIS+SGSAISGSIKGAPNSSDPKVFPPSTI ES
Subjt:  PVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYES

Query:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK
        NGCKEMR+RDDAL V+RDRKVRITDGASLYALCRSWLRNGSQEE QPQYGNFLR LPRPLPIPVAGAVP+QKKEV+ +EVDEEDKDEGSIE LS+QDLLK
Subjt:  NGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLK

Query:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS
        RHVKRAKKVRSRLREERL RIERYKTRLALLLPPPVEQLRTDNITGS
Subjt:  RHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTDNITGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G04930.1 hydroxyproline-rich glycoprotein family protein6.2e-4239.12Show/hide
Query:  IPTNTTAATTTNTVAPISFTLTTAAAT--TTAAIARPVANQAPSRPVSSIPQTHH----LHYP----PQALYA----AQPIPVRTPNHQLPKLQQDASQS
        IP   ++A+ T     +S +L+TA+ T  T     RP  +Q P  P    P T+     L +P     Q+ Y+    A  IPVR       +  QD S +
Subjt:  IPTNTTAATTTNTVAPISFTLTTAAAT--TTAAIARPVANQAPSRPVSSIPQTHH----LHYP----PQALYA----AQPIPVRTPNHQLPKLQQDASQS

Query:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNL-PQQLISLSGSAISGSIKGAPNSSDPKVFPPST
        +LYP A  GRGF  RP+R   AD +VT  N  GYP RP  T+   P     ++S+       R P + P   + L      G I+ +P    P+V PP T
Subjt:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNL-PQQLISLSGSAISGSIKGAPNSSDPKVFPPST

Query:  -IYESNGCKEMRIRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVA--GAVPTQKKEVIEEEVDEEDKDEGSIEH
         I +++  ++ R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ +    +VP       EE  DE+ +DE +++ 
Subjt:  -IYESNGCKEMRIRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVA--GAVPTQKKEVIEEEVDEEDKDEGSIEH

Query:  LSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL
        LS +DLLKRH++RAKKVR++LREER +RI RYK R+ L+L
Subjt:  LSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL

AT1G04930.2 hydroxyproline-rich glycoprotein family protein2.5e-3836.66Show/hide
Query:  IPTNTTAATTTNTVAPISFTLTTAAAT--TTAAIARPVANQAPSRPVSSIPQTHH----LHYP----PQALYA----AQPIPVRTPNHQLPKLQQDASQS
        IP   ++A+ T     +S +L+TA+ T  T     RP  +Q P  P    P T+     L +P     Q+ Y+    A  IPVR       +  QD S +
Subjt:  IPTNTTAATTTNTVAPISFTLTTAAAT--TTAAIARPVANQAPSRPVSSIPQTHH----LHYP----PQALYA----AQPIPVRTPNHQLPKLQQDASQS

Query:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNL-PQQLISLSGSAI-----------------SGS
        +LYP A  GRGF  RP+R   AD +VT  N  GYP RP  T+   P     ++S+       R P + P   + L GS +                 SG 
Subjt:  ILYPVASSGRGFVPRPIRPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNL-PQQLISLSGSAI-----------------SGS

Query:  IKGAPNSSDPKVF---------------PPSTIYESNGCKEMRIRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIP
        I G     DPK                 PP++I +++  ++ R +D AL VVR RKVRIT+G +SLY+L RSWL+NG+    QPQ    ++ LP+PLP+ 
Subjt:  IKGAPNSSDPKVF---------------PPSTIYESNGCKEMRIRDDALCVVRDRKVRITDG-ASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIP

Query:  VA--GAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL
        +    +VP       EE  DE+ +DE +++ LS +DLLKRH++RAKKVR++LREER +RI RYK R+ L+L
Subjt:  VA--GAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLL

AT2G32840.1 proline-rich family protein5.1e-5243.87Show/hide
Query:  MTVDSPLIPTNTTAATTTNTVAPISFTLTTAAATTTAAIARPVANQAPSRPVSSIPQTH--HLHYPPQALYAAQPIPVRTPN------HQLPKLQQDASQ
        M+   P    N   + +     PI  T + +  T    +  P +   P  P SS       H H+P Q +Y   P+P+R  N      HQ P    D S 
Subjt:  MTVDSPLIPTNTTAATTTNTVAPISFTLTTAAATTTAAIARPVANQAPSRPVSSIPQTH--HLHYPPQALYAAQPIPVRTPN------HQLPKLQQDASQ

Query:  SILYPVASSGRGFVPRPIRPLPADQAVTVA--NPGGY-PHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFP
        S++YP  SSGRGF  RP+R      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N  QQ   L     SG +KG P+   P+  P
Subjt:  SILYPVASSGRGFVPRPIRPLPADQAVTVA--NPGGY-PHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFP

Query:  -PSTIYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDK-DEGSIE
         P++I +++G K+ R RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       +  K+++EE + EEDK DE S++
Subjt:  -PSTIYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDK-DEGSIE

Query:  HLSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD
        HLS  DLLKRH+ RAKKVR+RLREERL+RI RYK RLALLLPP  EQ R +
Subjt:  HLSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPPPVEQLRTD

AT2G32840.2 proline-rich family protein3.4e-3238.44Show/hide
Query:  MTVDSPLIPTNTTAATTTNTVAPISFTLTTAAATTTAAIARPVANQAPSRPVSSIPQTH--HLHYPPQALYAAQPIPVRTPN------HQLPKLQQDASQ
        M+   P    N   + +     PI  T + +  T    +  P +   P  P SS       H H+P Q +Y   P+P+R  N      HQ P    D S 
Subjt:  MTVDSPLIPTNTTAATTTNTVAPISFTLTTAAATTTAAIARPVANQAPSRPVSSIPQTH--HLHYPPQALYAAQPIPVRTPN------HQLPKLQQDASQ

Query:  SILYPVASSGRGFVPRPIRPLPADQAVTVA--NPGGY-PHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFP
        S++YP  SSGRGF  RP+R      A  V   +PGGY P  PV  + H    S +LD M+  M  A P N  QQ   L     SG +KG P+   P+  P
Subjt:  SILYPVASSGRGFVPRPIRPLPADQAVTVA--NPGGY-PHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFP

Query:  -PSTIYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKD
         P++I +++G K+ R RDDAL +VR RKVRIT+GASLY+LCRSWLRNG+ E  +PQ  + +  LP+PLP+       +  K+++EE + EEDK+
Subjt:  -PSTIYESNGCKEMRIRDDALCVVRDRKVRITDGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGTCGACTCCCCCCTCATCCCCACCAACACGACGGCCGCCACCACTACTAACACCGTCGCTCCGATTTCATTCACTCTAACCACAGCTGCAGCCACCACCACTGC
CGCCATTGCTCGTCCGGTTGCGAATCAAGCGCCGTCGAGACCCGTTTCTTCAATTCCTCAGACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACGCCGCGCAGCCCA
TCCCCGTTCGAACTCCCAACCACCAATTGCCGAAGCTTCAGCAAGATGCATCTCAGTCAATTCTTTACCCTGTTGCTTCCTCCGGCCGCGGCTTCGTTCCTCGCCCCATT
CGGCCCCTTCCCGCCGATCAGGCCGTCACGGTGGCCAACCCTGGCGGCTACCCACATCGCCCCGTCGTCACTTTCTCGCATCGGCCGATTGGGTCGCCTCATTTGGACTC
CATGAGCCATCCAATGCACTTGGCGCGACCTCCCAACTTGCCGCAGCAGTTGATTTCCCTTTCTGGGTCCGCCATTTCGGGCTCGATTAAGGGTGCCCCCAATTCCTCTG
ATCCAAAGGTTTTTCCTCCATCAACAATCTACGAGTCAAATGGATGTAAAGAAATGAGGATCAGAGACGATGCTCTTTGTGTGGTTAGAGATAGAAAAGTCCGAATAACT
GATGGGGCTTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGAAAGCCAGCCACAATATGGAAATTTTTTGAGATCTCTTCCGAGACCTTTGCC
CATCCCTGTGGCTGGTGCTGTACCAACCCAGAAGAAGGAAGTCATCGAAGAAGAAGTTGATGAGGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCAACTCAAGATC
TATTAAAAAGACATGTTAAACGAGCAAAGAAAGTTCGATCGCGATTGAGGGAAGAACGGTTGCAACGAATTGAAAGATACAAAACCAGGCTTGCTCTTCTCCTTCCTCCG
CCAGTCGAGCAGTTGAGAACTGATAATATTACTGGAAGCTGA
mRNA sequenceShow/hide mRNA sequence
CAATTTGATATTCAAACAATCAAAGCACATTCACCCAAATTCCCCGCCACCATGACCGTCGACTCCCCCCTCATCCCCACCAACACGACGGCCGCCACCACTACTAACAC
CGTCGCTCCGATTTCATTCACTCTAACCACAGCTGCAGCCACCACCACTGCCGCCATTGCTCGTCCGGTTGCGAATCAAGCGCCGTCGAGACCCGTTTCTTCAATTCCTC
AGACCCACCATCTCCACTACCCTCCTCAAGCCCTCTACGCCGCGCAGCCCATCCCCGTTCGAACTCCCAACCACCAATTGCCGAAGCTTCAGCAAGATGCATCTCAGTCA
ATTCTTTACCCTGTTGCTTCCTCCGGCCGCGGCTTCGTTCCTCGCCCCATTCGGCCCCTTCCCGCCGATCAGGCCGTCACGGTGGCCAACCCTGGCGGCTACCCACATCG
CCCCGTCGTCACTTTCTCGCATCGGCCGATTGGGTCGCCTCATTTGGACTCCATGAGCCATCCAATGCACTTGGCGCGACCTCCCAACTTGCCGCAGCAGTTGATTTCCC
TTTCTGGGTCCGCCATTTCGGGCTCGATTAAGGGTGCCCCCAATTCCTCTGATCCAAAGGTTTTTCCTCCATCAACAATCTACGAGTCAAATGGATGTAAAGAAATGAGG
ATCAGAGACGATGCTCTTTGTGTGGTTAGAGATAGAAAAGTCCGAATAACTGATGGGGCTTCTCTTTATGCACTTTGTCGATCATGGTTGAGGAATGGTTCTCAAGAAGA
AAGCCAGCCACAATATGGAAATTTTTTGAGATCTCTTCCGAGACCTTTGCCCATCCCTGTGGCTGGTGCTGTACCAACCCAGAAGAAGGAAGTCATCGAAGAAGAAGTTG
ATGAGGAAGATAAGGATGAGGGATCCATTGAGCACTTGTCAACTCAAGATCTATTAAAAAGACATGTTAAACGAGCAAAGAAAGTTCGATCGCGATTGAGGGAAGAACGG
TTGCAACGAATTGAAAGATACAAAACCAGGCTTGCTCTTCTCCTTCCTCCGCCAGTCGAGCAGTTGAGAACTGATAATATTACTGGAAGCTGAGTATGCATCCCGAGATC
CTCGTCGAATTCCCTGTGGACACTTTGTCCAAATCATTCTTCCCAATTACAGATGAACTCGAGAAACATTCAACGGAAGCGCGGAGGATTGTATGTAGGTAAGACTAGAT
AGATATTTGATAGATTTTGGTAATTCTTGTCTTCCTATCAGTTTGTATCATGTTGGTTTTAATCAAGTTAAAAAATGAAGTCATTTACTGAAGTTAGAAATGGAAGACAT
TGATAAGTAAATCACATATTTTCCCCTTTTTTGTACCTATCTCCATTACTTTTATCTTAG
Protein sequenceShow/hide protein sequence
MTVDSPLIPTNTTAATTTNTVAPISFTLTTAAATTTAAIARPVANQAPSRPVSSIPQTHHLHYPPQALYAAQPIPVRTPNHQLPKLQQDASQSILYPVASSGRGFVPRPI
RPLPADQAVTVANPGGYPHRPVVTFSHRPIGSPHLDSMSHPMHLARPPNLPQQLISLSGSAISGSIKGAPNSSDPKVFPPSTIYESNGCKEMRIRDDALCVVRDRKVRIT
DGASLYALCRSWLRNGSQEESQPQYGNFLRSLPRPLPIPVAGAVPTQKKEVIEEEVDEEDKDEGSIEHLSTQDLLKRHVKRAKKVRSRLREERLQRIERYKTRLALLLPP
PVEQLRTDNITGS