; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023658 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023658
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF604)
Genome locationtig00000892:5310426..5317593
RNA-Seq ExpressionSgr023658
SyntenySgr023658
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008375 - acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006740 - Protein of unknown function DUF604


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034956.1 hypothetical protein SDJN02_01749, partial [Cucurbita argyrosperma subsp. argyrosperma]7.6e-21078.27Show/hide
Query:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR
        +S      Y  ASVFLLRTF   H S     SPLSL QIVFGIA+++ SWP RKDY++IWWKP LMRGCVF+D+LP  E  QN D SSLPAV +S DTSR
Subjt:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR

Query:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF
        FRYT+RGG RSAIR+ARVVLET+A GHSNVRWYVFGDDDT FFPENLVKTLSKYD +LWYYIGSNSET DQNR F +EMGFGGAGFAISQSLAK+L+KVF
Subjt:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF

Query:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR
        DSCIERYPHLYGSDSRI SCL ELGV+LTHE GFHQVD++GNIFGLLASHPLTPLVSLH+LDHI+PIFPNMT KE+LQHLFEAVEVD QRI+QQSVCYDR
Subjt:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR

Query:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE
        WFSWTISVSWGYAVQI E HVFL D I VQ+TF  WK+   VEPGSF  NTREIH DPCRRP VFYFDRASS+W+G I+++YKKA VNCS+GP SA+RLE
Subjt:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE

Query:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
        EVRVLS KLDL  KQLQAPRRQCCDVLPS  GE +DIAIR+CKEEELIHMH
Subjt:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

XP_022140605.1 uncharacterized protein LOC111011216 [Momordica charantia]2.8e-21279.55Show/hide
Query:  YLFASVFLLRTF-QPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD--SSLPAVCISEDTSRFRYTFR
        YLF+SVFL R+F  P   ++   SSPLSL QIVFGIASN+ SWP RKDYVR+WWKPNLMRGCVFVD+LPP +   + D  SSLPA+CIS DTS FRYT+R
Subjt:  YLFASVFLLRTF-QPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD--SSLPAVCISEDTSRFRYTFR

Query:  GGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIER
         G RSAIRVARVVLET+AAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDH LWYY+GSNSETY QNR FDFEMGFGGAGFAI+QSLA+ L+KVFDSC+ER
Subjt:  GGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIER

Query:  YPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTI
        YPHLYGSDSR+  C+AELGV+LTHE GFHQVD++GNIFGLLA+HP+TPLVSLHHLDHIDPIFPNMTTKEAL+HLFE+VEVDPQR++QQSVCYDRWFSWTI
Subjt:  YPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTI

Query:  SVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLEEVRVLS
        SVSWGYAV+IFE HVFLPD  R ++TF  W K   VE GSF+FNT E+  DPCRRPTVFYFDRASSDWDG IK++YKK  VNCSFGP+S KRLEEVRVLS
Subjt:  SVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLEEVRVLS

Query:  HKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
         KLD  VKQL APRRQCCDVLPSTAGEA+DIAIR+CKE ELIHMH
Subjt:  HKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

XP_022947744.1 uncharacterized protein LOC111451516 [Cucurbita moschata]2.9e-20977.83Show/hide
Query:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR
        +S      Y  ASVFLLRTF   H S     SPLSL QIVFGIA+++ SWP+RKDY++IWWKP LMRGCVF+D+LP  E  QN D SSLP V +S DTSR
Subjt:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR

Query:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF
        FRYT+RGG RSAIR+ARVVLET+A GHSNVRWYVFGDDDT FFPENLVKTLSKYD +LWYYIGSNSET DQNR F +EMGFGGAGFAISQSLAK+L+KVF
Subjt:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF

Query:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR
        DSCIERYPHLYGSDSRI SCL ELGV+LTHE GFHQVD++GNIFGLLASHPLTPLVSLH+LDHI+PIFPNMT KE+LQHLFEAVEVD QRI+QQSVCYDR
Subjt:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR

Query:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE
        WFSWTISVSWGYAVQI E HVFL D I VQ+TF  WK+   VEPGSF  NTREIH DPCRRP VFYFDR SS+W+G I+++YKKA VNCS+GP SA+RLE
Subjt:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE

Query:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
        EVRVLS KLDL  KQLQAPRRQCCDVLPS  GE +DIAIR+CKEEELIHMH
Subjt:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

XP_022970998.1 uncharacterized protein LOC111469800 [Cucurbita maxima]2.8e-21278.71Show/hide
Query:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR
        +S      Y  ASVFLLRTF P + S  H  SPLSL QIVFGIA+++ SWP+RKDY++IWWKP LMRGCVF+D+LP  E  QN D SSLPAV +S DTSR
Subjt:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR

Query:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF
        FRYT+RGG RSAIR+ARVVLET+A GHSNVRWYVFGDDDTLFFPENLVKTLSKYD +LWYYIGSNSET DQNR F +EMGFGGAGFAISQSLAK+L+KVF
Subjt:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF

Query:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR
        DSCIERYPHLYGSDSRI SCL ELGV+LTHE GFHQVD++GNIFGLLASHPLTPLVSLH+LDHI+PIFPN+T KE+LQHLFEAVEVD QRI+QQSVCYDR
Subjt:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR

Query:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE
        WFSWTISVSWGYAVQI E HVFLPDVI VQ+TF  WK+   VEPGSF  NTREIH DPCRRP VFYFDRASS+W+G I+++YKKA VNCS+GP SA+RLE
Subjt:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE

Query:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
        EVRVLS KLDL  KQLQAPRRQCCDVLPS  GE +DIAIR+CKEEELIH+H
Subjt:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

XP_023534107.1 uncharacterized protein LOC111795766 [Cucurbita pepo subsp. pepo]1.1e-21178.27Show/hide
Query:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR
        +S      Y  ASVFLLRTF P H S  H  SPLSL QIVFGIA+++ SWP+RKDY++IWWKP LMRGCVF+D+LP  E  +N D SSLPAV +S DTSR
Subjt:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR

Query:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF
        FRYT+RGG RSAIR+ARVVLET+A GHSNVRWYVFGDDDT FFPENLVKTLSKYD +LWYYIGSNSET DQNR F +EMGFGGAGFAISQSLA++L+KVF
Subjt:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF

Query:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR
        DSCIERYPHLYGSDSRI SCL ELGV+LTHE GFHQVD++GNIFGLLASHPLTPLVSLH+LDHI+PIFPNMT KE+LQHLFEAVEVD QRI+QQSVCYDR
Subjt:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR

Query:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE
        WFSWTISVSWGYAVQI E +VFLPD I VQ+TF  WK+   VEPGSF  NTREIH DPCRRP VFYFDRASS+W+G I+++YKKA VNCS+GP SA+RLE
Subjt:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE

Query:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
        EVRVLS KLDL  KQLQAPRRQCCDVLPS  GE +DIAIR+CKEEELIHMH
Subjt:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

TrEMBL top hitse value%identityAlignment
A0A0A0LR38 Uncharacterized protein3.8e-20775.56Show/hide
Query:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRF
        +S  +   Y  +SVFL  TFQPS +     SS LSL QIVFGIASN+ SWP+RKDY++IWWKPNLMRGCVFVD +P    A +  SSLPAVC+S DTSRF
Subjt:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRF

Query:  RYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFD
        RYT+RGG RSAIRVARVVLETVAAGHSNVRWYVFGDDDT FFPENLVKTLSKYD  LWYYIGSNSETY QNR F FEMGFGGAGFAISQ LA+ L+ VFD
Subjt:  RYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFD

Query:  SCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRW
        SC++RYPHLYGSDSR+HSCL ELGV+LTHE GFHQVD++G+IFGLLASHPLTP+V+LHHLD I+PIFPN T KE+LQHL++AVE+DP R++QQSVCYDRW
Subjt:  SCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRW

Query:  FSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLEE
        FSWTISVSWGYAVQI++HHVFL D I VQQTF+ W KG  VEPGSFTFNTREIH DPCRRPTVFY D+ SSDW G IKT+YKK  +NCSFG AS +R +E
Subjt:  FSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLEE

Query:  VRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
        VRV S KL++  KQLQAPRRQCCDVLPSTAGE +++AIR+CKEEE+IHMH
Subjt:  VRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

A0A5D3BH05 DUF604 domain-containing protein1.0e-20476.98Show/hide
Query:  YLFASVFLLRTFQPSHSSERHGSSP-LSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRFRYTFRGG
        Y  +S+FL  TF PS +   H  SP LSL QIVFGIASN+ SWP+RKDY +IWWKPNLMRGCVFVD +P    A +  SSLPAVC+S DTSRFRYT+RGG
Subjt:  YLFASVFLLRTFQPSHSSERHGSSP-LSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRFRYTFRGG

Query:  LRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYP
         RSAIRVARVVLETVAAGHS+VRWYVFGDDDT FFPENLV+TLSKYD  LWYYIGSNSETY QNR F FEMGFGGAGFAISQ LAK L+ VFDSC+ERYP
Subjt:  LRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYP

Query:  HLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISV
        HLYGSDSR+HSCL ELGV+LTHE GFHQVD+RG+IFGLLASHPLTPLV+LHHLDHI+PIFPN TT+E+LQHL++AVE+DP R++QQSVCYDRWFSWTISV
Subjt:  HLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISV

Query:  SWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLEEVRVLSHK
        SWGYAVQI++HHVFL D I VQQTFS W+K   VEPGSFTFNTREIH DPCRRPTVFY D+ SSDW G IKT+YKK  +NCSFG AS +R +EVRV S K
Subjt:  SWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLEEVRVLSHK

Query:  LDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
        L++  KQLQAPRRQCCDVLPSTA E ++IAIR+CKEEE+IHMH
Subjt:  LDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

A0A6J1CFJ4 uncharacterized protein LOC1110112161.3e-21279.55Show/hide
Query:  YLFASVFLLRTF-QPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD--SSLPAVCISEDTSRFRYTFR
        YLF+SVFL R+F  P   ++   SSPLSL QIVFGIASN+ SWP RKDYVR+WWKPNLMRGCVFVD+LPP +   + D  SSLPA+CIS DTS FRYT+R
Subjt:  YLFASVFLLRTF-QPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD--SSLPAVCISEDTSRFRYTFR

Query:  GGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIER
         G RSAIRVARVVLET+AAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDH LWYY+GSNSETY QNR FDFEMGFGGAGFAI+QSLA+ L+KVFDSC+ER
Subjt:  GGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIER

Query:  YPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTI
        YPHLYGSDSR+  C+AELGV+LTHE GFHQVD++GNIFGLLA+HP+TPLVSLHHLDHIDPIFPNMTTKEAL+HLFE+VEVDPQR++QQSVCYDRWFSWTI
Subjt:  YPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTI

Query:  SVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLEEVRVLS
        SVSWGYAV+IFE HVFLPD  R ++TF  W K   VE GSF+FNT E+  DPCRRPTVFYFDRASSDWDG IK++YKK  VNCSFGP+S KRLEEVRVLS
Subjt:  SVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLEEVRVLS

Query:  HKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
         KLD  VKQL APRRQCCDVLPSTAGEA+DIAIR+CKE ELIHMH
Subjt:  HKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

A0A6J1G7T3 uncharacterized protein LOC1114515161.4e-20977.83Show/hide
Query:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR
        +S      Y  ASVFLLRTF   H S     SPLSL QIVFGIA+++ SWP+RKDY++IWWKP LMRGCVF+D+LP  E  QN D SSLP V +S DTSR
Subjt:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR

Query:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF
        FRYT+RGG RSAIR+ARVVLET+A GHSNVRWYVFGDDDT FFPENLVKTLSKYD +LWYYIGSNSET DQNR F +EMGFGGAGFAISQSLAK+L+KVF
Subjt:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF

Query:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR
        DSCIERYPHLYGSDSRI SCL ELGV+LTHE GFHQVD++GNIFGLLASHPLTPLVSLH+LDHI+PIFPNMT KE+LQHLFEAVEVD QRI+QQSVCYDR
Subjt:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR

Query:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE
        WFSWTISVSWGYAVQI E HVFL D I VQ+TF  WK+   VEPGSF  NTREIH DPCRRP VFYFDR SS+W+G I+++YKKA VNCS+GP SA+RLE
Subjt:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE

Query:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
        EVRVLS KLDL  KQLQAPRRQCCDVLPS  GE +DIAIR+CKEEELIHMH
Subjt:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

A0A6J1I5I9 uncharacterized protein LOC1114698001.3e-21278.71Show/hide
Query:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR
        +S      Y  ASVFLLRTF P + S  H  SPLSL QIVFGIA+++ SWP+RKDY++IWWKP LMRGCVF+D+LP  E  QN D SSLPAV +S DTSR
Subjt:  LSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGD-SSLPAVCISEDTSR

Query:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF
        FRYT+RGG RSAIR+ARVVLET+A GHSNVRWYVFGDDDTLFFPENLVKTLSKYD +LWYYIGSNSET DQNR F +EMGFGGAGFAISQSLAK+L+KVF
Subjt:  FRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVF

Query:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR
        DSCIERYPHLYGSDSRI SCL ELGV+LTHE GFHQVD++GNIFGLLASHPLTPLVSLH+LDHI+PIFPN+T KE+LQHLFEAVEVD QRI+QQSVCYDR
Subjt:  DSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDR

Query:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE
        WFSWTISVSWGYAVQI E HVFLPDVI VQ+TF  WK+   VEPGSF  NTREIH DPCRRP VFYFDRASS+W+G I+++YKKA VNCS+GP SA+RLE
Subjt:  WFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDG-IKTSYKKASVNCSFGPASAKRLE

Query:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH
        EVRVLS KLDL  KQLQAPRRQCCDVLPS  GE +DIAIR+CKEEELIH+H
Subjt:  EVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01570.1 Protein of unknown function (DUF604)1.3e-10645.39Show/hide
Query:  SHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPN-LMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRFRYTFRGGLRSAIRVARVVLET
        S S   H      L+ +VFGIA++   W  RKDYV++WWKPN  M G V++D+        N   +LP + IS DTSRF+Y +  GLRSAIR+ R+V ET
Subjt:  SHSSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPN-LMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRFRYTFRGGLRSAIRVARVVLET

Query:  V-----AAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYPHLYGSDSRI
        V          NVRW V GDDDT+FFPENLVK L KYDH  +YYIGS+SE++ QN  F + M +GG GFAIS  LAK L+K+ D CI+RY  LYGSD RI
Subjt:  V-----AAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYPHLYGSDSRI

Query:  HSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISVSWGYAVQIF
        H+C++ELGV LT E GFHQ+D+ G + GLL++HPL PLVS+HHLD +DP+FPNM    A++      ++D   + QQS+CYD    WT+SVSWGY VQI 
Subjt:  HSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISVSWGYAVQIF

Query:  EHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDGIKTS------YKKASVNCSFGPASAKRLEEVRVLSHKLDLH
           +   +++   +TF  W   K  +  S+ FNTR I    C+RP V+Y   A  D    +T+      Y      C +  +     E V V        
Subjt:  EHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSDWDGIKTS------YKKASVNCSFGPASAKRLEEVRVLSHKLDLH

Query:  VKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEE
          + +APRR CC VLP+T    M I +  CK++E
Subjt:  VKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEE

AT1G05280.1 Protein of unknown function (DUF604)5.7e-13956.5Show/hide
Query:  LACYLFASVFLLRTFQPSH-----SSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRF
        L  YL  S   L++    H     S +    SP  +E IVFGI S+ ISW  R++YV++WW    MRGCVFV++  P+         LP VC+S+DTSRF
Subjt:  LACYLFASVFLLRTFQPSH-----SSERHGSSPLSLEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRF

Query:  RYTFRGGLRSAIRVARVVLETVA---AGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQK
        RYT+RGG R+AIR+AR VLETV         VRWYVFGDDDT+F PENL +TLSKYDH  WYYIGS SE Y QN  F  +M FGG G+A+S SLA VL +
Subjt:  RYTFRGGLRSAIRVARVVLETVA---AGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQK

Query:  VFDSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCY
         FDSCIERYPHLYG DSR+++C+ ELGV L+ EPGFHQ DVRGN  G+L SH   PLVSLHH+ HIDPIFPN TT  A++HLF AV++DP RI Q SVCY
Subjt:  VFDSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCY

Query:  DRWFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDR-ASSDWDG-IKTSYKKASVNCSFGP-ASA
        DRW+SWTISVSWGY VQI   H+FL DV+R Q+TF  W+K   +    +TFNTREIH DPC+RP  FY    +SS  DG IK+ YK+A  NC++ P  S 
Subjt:  DRWFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYFDR-ASSDWDG-IKTSYKKASVNCSFGP-ASA

Query:  KRLEEVRVLSHKLDLHVKQLQAP
        +++ E+RV S +LD +++Q Q+P
Subjt:  KRLEEVRVLSHKLDLHVKQLQAP

AT4G15240.1 Protein of unknown function (DUF604)6.8e-13255.34Show/hide
Query:  IVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRFRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDD
        ++F IA++  SW RR  YVR+W+ P   R  VF+D+          D +LP V +S+D SRF Y F GGLRSAIRVARVV ETV  G  +VRW+VFGDDD
Subjt:  IVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRFRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDD

Query:  TLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDV
        T+FF +NLV  LSKYDH+ W+Y+GSNSE YDQN  + F+M FGG GFAIS SLAKVL KV DSC+ RY H+YGSDSRI SC+AELGV LTHEPGFHQ+DV
Subjt:  TLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDV

Query:  RGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKG
        RGNIFGLL +HPL+PLVSLHHLD +DP FP     E++ HL  A   D  RI+QQSVCYD   + T+SV WGYAVQ++E +  LPD++ +Q+TFS+W++G
Subjt:  RGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKG

Query:  KNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSD-WDGIKTSYKKASVNCSFGPASAKRLEEVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAI
          V+  ++ F+TRE   DPC RP VF+ D   SD  +G  ++Y    V       + +RLE +RVLS KL+ +V+Q+  PRRQCCD+  S   ++M I I
Subjt:  KNVEPGSFTFNTREIHPDPCRRPTVFYFDRASSD-WDGIKTSYKKASVNCSFGPASAKRLEEVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAI

Query:  RECKEEELIHMH
        R+C  +ELI M+
Subjt:  RECKEEELIHMH

AT4G23490.1 Protein of unknown function (DUF604)2.0e-10445.8Show/hide
Query:  LEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSS-LPAVCISEDTSRFRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVF
        L  +VFGIA++   W +RK+Y++IW+KP  MRG V++DK      + + D   LP V IS  T+ F YT + G RSA+R++R+V ET+  G  NVRW+V 
Subjt:  LEQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSS-LPAVCISEDTSRFRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVF

Query:  GDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFH
        GDDDT+F  +NL++ L KYDH+  YYIGS SE++ QN  F + M +GG GFAIS  LAK L K+ D CI+RYP LYGSD R+ +C+AELGV LT E GFH
Subjt:  GDDDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFH

Query:  QVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISVSWGYAVQIFEHHVFLPDVIRV-QQTFS
        Q DV GN+FGLLA+HP+TP VS+HHLD ++PIFPNMT   AL+ + E +++D   ++QQS+CYD+  SWTISVSWGYAVQIF   +F P  + +  +TF 
Subjt:  QVDVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISVSWGYAVQIFEHHVFLPDVIRV-QQTFS

Query:  SWKKGKNVEPGSFTFNTREIHPDPCRRPTVFY-----FDRASSDWDGIKTSYKKASVNCSFGPASAKRLEEVRVLSHKLDLHVKQLQAPRRQCCDVLPST
        +W   K  +  ++ FNTR +  +PC++P VFY     FD+  +      T ++ +  +C +   +   +  + V+  K D H+ + ++PRR CC VL + 
Subjt:  SWKKGKNVEPGSFTFNTREIHPDPCRRPTVFY-----FDRASSDWDGIKTSYKKASVNCSFGPASAKRLEEVRVLSHKLDLHVKQLQAPRRQCCDVLPST

Query:  AGEAMDIAIRECKEEEL
            + I +  C+  E+
Subjt:  AGEAMDIAIRECKEEEL

AT5G41460.1 Protein of unknown function (DUF604)4.6e-10445.56Show/hide
Query:  EQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRFRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGD
        + +VFGIA++   W +RK+Y++IW+KPN MR  V+++K P  E  +  + SLP V IS DTS+F Y  + G RSAIR++R+V ET+  G  +VRW+V GD
Subjt:  EQIVFGIASNEISWPRRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRFRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGD

Query:  DDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQV
        DDT+F  ENL++ L KYDH   YYIGS SE++ QN  F + M +GG GFAIS  LA  L K+ D CI+RYP LYGSD R+ +C+AELGV LT E GFHQ 
Subjt:  DDTLFFPENLVKTLSKYDHQLWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQV

Query:  DVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWK
        DV GN+FGLLA+HP+ PLV+LHHLD ++PIFPNMT  +AL+HL    ++D   +MQQS+CYD+   WT+SVSWG+AVQIF       ++    +TF +W 
Subjt:  DVRGNIFGLLASHPLTPLVSLHHLDHIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWK

Query:  KGKNVEPGSFTFNTREIHPDPCRRPTVFYF-----DRASSDWDGIKTSYKKASVNCSFGPASAKRLEEVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGE
          +  +  ++ FNTR +   PC++P VFY       R ++        ++ A   C +  A+   ++ V ++  K D H+   ++PRR CC V  S    
Subjt:  KGKNVEPGSFTFNTREIHPDPCRRPTVFYF-----DRASSDWDGIKTSYKKASVNCSFGPASAKRLEEVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGE

Query:  AMDIAIRECKEEELIHM
         ++I++  CKE E++ +
Subjt:  AMDIAIRECKEEELIHM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAATTCAACCACCATTGTGGCTGCAATCGCCAATGCCATTAACCTTCTTGACGCGGCCATCTTCAAGCACAGAGATGGCAGATCTCCTCTTGCAGTCCCC
AAAAGTCAAAGCCGCAGAAACAAACTTGAATTGGCTTCTGTGTGTGTTTCATTACTCTCGCTCCCCTTTCTCCAAGCGCACCGTTTTGGGAACCTCCGCAATAGC
ATTCTCATTGCAGTCCCAGCTAGAGAGGACCTGCTATCTCTCCTGGAGATGCCGGCGCCGCACGCCATGCGTTTGGTTCCGGCCGAAGCCCTTGGAACCGATGGC
AATTCTTGCGGCCGGGATGAGTGTGATAAGCTAGATAGCTCTTTAGTTCCTCCAAGTTTGTTGAGTAAGCTGGAGATCGGCGGGTGTAAAATTGCCACATATATT
CTAAGTAATAGTGAAAAGGATAGTCAGCAGTGTGAGAATGTGGGCGGGCTTAATGGCTTAAGCGAGCACTACCTGGCATGTTATCTCTTCGCCTCTGTTTTCTTG
TTGCGTACTTTCCAGCCTTCGCATTCATCTGAAAGACATGGGTCGTCTCCTCTCTCGCTCGAGCAAATCGTGTTCGGAATTGCCTCCAACGAGATTTCGTGGCCG
AGGAGGAAGGATTACGTTAGGATATGGTGGAAACCCAATCTAATGCGAGGCTGCGTTTTCGTCGATAAGCTTCCTCCCGCCGAAACGGCACAGAACGGCGACTCT
TCTCTTCCCGCCGTTTGCATCTCCGAGGACACTTCGCGATTCCGTTACACTTTCAGAGGTGGTCTCCGATCAGCCATAAGGGTGGCGCGCGTGGTTTTGGAGACG
GTGGCCGCAGGACATTCAAACGTCCGGTGGTACGTTTTTGGAGACGACGACACACTTTTCTTCCCGGAGAATCTGGTGAAGACTCTGTCCAAGTACGACCACCAA
CTCTGGTACTACATCGGAAGCAACTCGGAGACTTACGACCAGAACAGAGCATTCGATTTCGAAATGGGCTTCGGCGGAGCTGGATTCGCCATTAGCCAATCTCTC
GCCAAAGTCCTGCAAAAAGTCTTCGATTCCTGCATCGAACGGTACCCCCATCTCTACGGAAGCGATTCGAGGATTCACTCTTGCTTGGCAGAGCTCGGCGTGAGA
TTAACACACGAACCTGGATTCCATCAGGTTGATGTGAGAGGCAACATTTTTGGTCTGTTGGCTTCACACCCACTAACCCCATTGGTATCCTTGCACCATTTGGAT
CACATCGACCCAATCTTCCCCAACATGACGACTAAAGAAGCACTCCAACATCTTTTCGAAGCCGTCGAGGTGGATCCTCAGAGGATCATGCAACAGAGCGTCTGC
TACGACCGGTGGTTCTCTTGGACCATATCTGTGTCGTGGGGCTACGCCGTTCAAATATTCGAACACCACGTCTTCTTGCCGGACGTCATCCGCGTACAACAGACA
TTCAGCTCGTGGAAGAAAGGCAAGAACGTGGAACCTGGATCTTTTACCTTCAACACGAGGGAAATCCACCCGGACCCATGTCGCCGGCCGACGGTTTTCTACTTC
GATCGTGCGTCTTCCGATTGGGACGGAATCAAGACCAGTTACAAGAAAGCTTCTGTGAATTGCTCCTTCGGTCCGGCGTCGGCCAAGAGACTGGAAGAGGTCAGG
GTGTTGTCCCATAAGCTCGACCTCCATGTCAAGCAGTTGCAGGCTCCGAGAAGACAATGCTGTGATGTATTACCTTCAACGGCCGGTGAAGCAATGGATATCGCC
ATTAGAGAATGCAAGGAAGAAGAATTAATTCACATGCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAATTCAACCACCATTGTGGCTGCAATCGCCAATGCCATTAACCTTCTTGACGCGGCCATCTTCAAGCACAGAGATGGCAGATCTCCTCTTGCAGTCCCC
AAAAGTCAAAGCCGCAGAAACAAACTTGAATTGGCTTCTGTGTGTGTTTCATTACTCTCGCTCCCCTTTCTCCAAGCGCACCGTTTTGGGAACCTCCGCAATAGC
ATTCTCATTGCAGTCCCAGCTAGAGAGGACCTGCTATCTCTCCTGGAGATGCCGGCGCCGCACGCCATGCGTTTGGTTCCGGCCGAAGCCCTTGGAACCGATGGC
AATTCTTGCGGCCGGGATGAGTGTGATAAGCTAGATAGCTCTTTAGTTCCTCCAAGTTTGTTGAGTAAGCTGGAGATCGGCGGGTGTAAAATTGCCACATATATT
CTAAGTAATAGTGAAAAGGATAGTCAGCAGTGTGAGAATGTGGGCGGGCTTAATGGCTTAAGCGAGCACTACCTGGCATGTTATCTCTTCGCCTCTGTTTTCTTG
TTGCGTACTTTCCAGCCTTCGCATTCATCTGAAAGACATGGGTCGTCTCCTCTCTCGCTCGAGCAAATCGTGTTCGGAATTGCCTCCAACGAGATTTCGTGGCCG
AGGAGGAAGGATTACGTTAGGATATGGTGGAAACCCAATCTAATGCGAGGCTGCGTTTTCGTCGATAAGCTTCCTCCCGCCGAAACGGCACAGAACGGCGACTCT
TCTCTTCCCGCCGTTTGCATCTCCGAGGACACTTCGCGATTCCGTTACACTTTCAGAGGTGGTCTCCGATCAGCCATAAGGGTGGCGCGCGTGGTTTTGGAGACG
GTGGCCGCAGGACATTCAAACGTCCGGTGGTACGTTTTTGGAGACGACGACACACTTTTCTTCCCGGAGAATCTGGTGAAGACTCTGTCCAAGTACGACCACCAA
CTCTGGTACTACATCGGAAGCAACTCGGAGACTTACGACCAGAACAGAGCATTCGATTTCGAAATGGGCTTCGGCGGAGCTGGATTCGCCATTAGCCAATCTCTC
GCCAAAGTCCTGCAAAAAGTCTTCGATTCCTGCATCGAACGGTACCCCCATCTCTACGGAAGCGATTCGAGGATTCACTCTTGCTTGGCAGAGCTCGGCGTGAGA
TTAACACACGAACCTGGATTCCATCAGGTTGATGTGAGAGGCAACATTTTTGGTCTGTTGGCTTCACACCCACTAACCCCATTGGTATCCTTGCACCATTTGGAT
CACATCGACCCAATCTTCCCCAACATGACGACTAAAGAAGCACTCCAACATCTTTTCGAAGCCGTCGAGGTGGATCCTCAGAGGATCATGCAACAGAGCGTCTGC
TACGACCGGTGGTTCTCTTGGACCATATCTGTGTCGTGGGGCTACGCCGTTCAAATATTCGAACACCACGTCTTCTTGCCGGACGTCATCCGCGTACAACAGACA
TTCAGCTCGTGGAAGAAAGGCAAGAACGTGGAACCTGGATCTTTTACCTTCAACACGAGGGAAATCCACCCGGACCCATGTCGCCGGCCGACGGTTTTCTACTTC
GATCGTGCGTCTTCCGATTGGGACGGAATCAAGACCAGTTACAAGAAAGCTTCTGTGAATTGCTCCTTCGGTCCGGCGTCGGCCAAGAGACTGGAAGAGGTCAGG
GTGTTGTCCCATAAGCTCGACCTCCATGTCAAGCAGTTGCAGGCTCCGAGAAGACAATGCTGTGATGTATTACCTTCAACGGCCGGTGAAGCAATGGATATCGCC
ATTAGAGAATGCAAGGAAGAAGAATTAATTCACATGCATTAA
Protein sequenceShow/hide protein sequence
MANSTTIVAAIANAINLLDAAIFKHRDGRSPLAVPKSQSRRNKLELASVCVSLLSLPFLQAHRFGNLRNSILIAVPAREDLLSLLEMPAPHAMRLVPAEALGTDG
NSCGRDECDKLDSSLVPPSLLSKLEIGGCKIATYILSNSEKDSQQCENVGGLNGLSEHYLACYLFASVFLLRTFQPSHSSERHGSSPLSLEQIVFGIASNEISWP
RRKDYVRIWWKPNLMRGCVFVDKLPPAETAQNGDSSLPAVCISEDTSRFRYTFRGGLRSAIRVARVVLETVAAGHSNVRWYVFGDDDTLFFPENLVKTLSKYDHQ
LWYYIGSNSETYDQNRAFDFEMGFGGAGFAISQSLAKVLQKVFDSCIERYPHLYGSDSRIHSCLAELGVRLTHEPGFHQVDVRGNIFGLLASHPLTPLVSLHHLD
HIDPIFPNMTTKEALQHLFEAVEVDPQRIMQQSVCYDRWFSWTISVSWGYAVQIFEHHVFLPDVIRVQQTFSSWKKGKNVEPGSFTFNTREIHPDPCRRPTVFYF
DRASSDWDGIKTSYKKASVNCSFGPASAKRLEEVRVLSHKLDLHVKQLQAPRRQCCDVLPSTAGEAMDIAIRECKEEELIHMH