; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024715 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024715
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionVAN3-binding protein-like
Genome locationtig00002486:2149625..2158503
RNA-Seq ExpressionSgr024715
SyntenySgr024715
Gene Ontology termsNA
InterPro domainsIPR008546 - Domain of unknown function DUF828
IPR013666 - Pleckstrin-like, plant
IPR040269 - VAN3-binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022956171.1 VAN3-binding protein-like isoform X2 [Cucurbita moschata]1.0e-12059.83Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS
        MS PSS APL+SSLEFLSRSWRVSPST HLVNS+I  G++GGG  GDI+LE+S  DA+GF                                   LSHSS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS

Query:  GPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIAS
        GPLNAAHSGGSLTDSPPFSPSEIADLDSK+YR N+S  NTH RATV+GSG AA GGGKT                             AGVAAAIAAIAS
Subjt:  GPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIAS

Query:  ASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AVI
        ASA SS+G N+REDI KTD+A+ASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRA K+VWNAA        
Subjt:  ASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AVI

Query:  PARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSD
         + +GS   +    +V+  +L  +         D    +CYRGLLA GC         DLHWK+VSVY++R NQVVLKMKSRHVAGTITKKKK+      
Subjt:  PARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSD

Query:  FFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ
          L  G+ K                   A RHLLEGGEDRRYFG+KT+LRGVVEFECRNQREYDMWTQGV+KLL++AA+RIC+
Subjt:  FFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ

XP_023527170.1 VAN3-binding protein-like isoform X2 [Cucurbita pepo subsp. pepo]1.0e-12060.66Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS
        MS PSS APL+SSLEFLSRSWR+SPST HLVNS+I  G +GGG  GDI+LEDS  DAEGF                                   LSHSS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS

Query:  GPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIAS
        GPLNAAHSGGSLTDSPPFSPSEIADLDSK+YR N+S  N H RATV+GSG AA GGGKT                             AGVAAAIAAIAS
Subjt:  GPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIAS

Query:  ASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA--------
        ASA SS+G N+ EDI KTD+A+ASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRA K+VWNAA        
Subjt:  ASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA--------

Query:  -AVIPARKGSELPAAAVVMR---GQLQHEDTFL----TVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSD
         +   + +GS +    V  +    QLQ  D+ L     VCYRGLLANGC         DLHWK+VSVY++R  QVVLKMKSRHVAGTITKKKK+      
Subjt:  -AVIPARKGSELPAAAVVMR---GQLQHEDTFL----TVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSD

Query:  FFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ
          L  G+ K                   A RHLLEGGEDRRYFG+KT+LRGVVEFECRNQREYDMWTQGV+KLL+MAA+RIC+
Subjt:  FFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ

XP_038881991.1 VAN3-binding protein isoform X1 [Benincasa hispida]1.3e-12359.32Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW-----------------------------------LSHS
        MS PSS  PL+SSLEFLSRSWRVSPST H VNSK    ++GGG  GDI+LEDS GDAEGF                                    LSHS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW-----------------------------------LSHS

Query:  SGPLNAAHS-GGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG--AASGGGKT----------------------------SAGVAAAIAA
        SGPLNAAHS GGSLTDSPPFSPSEIADLD+K+YR N+SL +THLRATV+GSG  AA GGGKT                             AGVAAAIAA
Subjt:  SGPLNAAHS-GGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG--AASGGGKT----------------------------SAGVAAAIAA

Query:  IASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIPA
        IASASASSS  AN+ E+IPKTD+AMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNV++AGDIMTLTAAAATALRGAATLKSRA+++VWNAA VIP 
Subjt:  IASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIPA

Query:  RKGSEL----------------------------PAAAVVMRG-QLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHV
         KG                               P AA      Q+Q +D FLTVC+R LLANGC         DLHWK+VSVYINR NQVV+KMKSRHV
Subjt:  RKGSEL----------------------------PAAAVVMRG-QLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHV

Query:  AGTITKKKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ
        AGTITKKKK+        L   + K                     RHLLEGGEDRRYFGLKT+LRGVVEFECRNQREYDMWTQGV+KLL+MAA+R+C+
Subjt:  AGTITKKKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ

XP_038882000.1 VAN3-binding protein isoform X2 [Benincasa hispida]9.9e-12459.44Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS
        MS PSS  PL+SSLEFLSRSWRVSPST H VNSK    ++GGG  GDI+LEDS GDAEGF                                   LSHSS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS

Query:  GPLNAAHS-GGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG--AASGGGKT----------------------------SAGVAAAIAAI
        GPLNAAHS GGSLTDSPPFSPSEIADLD+K+YR N+SL +THLRATV+GSG  AA GGGKT                             AGVAAAIAAI
Subjt:  GPLNAAHS-GGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG--AASGGGKT----------------------------SAGVAAAIAAI

Query:  ASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIPAR
        ASASASSS  AN+ E+IPKTD+AMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNV++AGDIMTLTAAAATALRGAATLKSRA+++VWNAA VIP  
Subjt:  ASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIPAR

Query:  KGSEL----------------------------PAAAVVMRG-QLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVA
        KG                               P AA      Q+Q +D FLTVC+R LLANGC         DLHWK+VSVYINR NQVV+KMKSRHVA
Subjt:  KGSEL----------------------------PAAAVVMRG-QLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVA

Query:  GTITKKKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ
        GTITKKKK+        L   + K                     RHLLEGGEDRRYFGLKT+LRGVVEFECRNQREYDMWTQGV+KLL+MAA+R+C+
Subjt:  GTITKKKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ

XP_038882009.1 VAN3-binding protein isoform X3 [Benincasa hispida]1.2e-12460.41Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW--------------------------LSHSSGPLNAAHS
        MS PSS  PL+SSLEFLSRSWRVSPST H VNSK    ++GGG  GDI+LEDS GDAEGF                           LSHSSGPLNAAHS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW--------------------------LSHSSGPLNAAHS

Query:  -GGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG--AASGGGKT----------------------------SAGVAAAIAAIASASASSS
         GGSLTDSPPFSPSEIADLD+K+YR N+SL +THLRATV+GSG  AA GGGKT                             AGVAAAIAAIASASASSS
Subjt:  -GGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG--AASGGGKT----------------------------SAGVAAAIAAIASASASSS

Query:  SGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIPARKGSEL---
          AN+ E+IPKTD+AMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNV++AGDIMTLTAAAATALRGAATLKSRA+++VWNAA VIP  KG      
Subjt:  SGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIPARKGSEL---

Query:  -------------------------PAAAVVMRG-QLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKK
                                 P AA      Q+Q +D FLTVC+R LLANGC         DLHWK+VSVYINR NQVV+KMKSRHVAGTITKKKK
Subjt:  -------------------------PAAAVVMRG-QLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKK

Query:  SGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ
        +        L   + K                     RHLLEGGEDRRYFGLKT+LRGVVEFECRNQREYDMWTQGV+KLL+MAA+R+C+
Subjt:  SGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ

TrEMBL top hitse value%identityAlignment
A0A5A7U0G4 VAN3-binding protein-like5.1e-11857.68Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW-----------------------------------LSHS
        MS P++  PL+SSLEFLSRSWRVSPST HL  SKI T    G   GDI+LEDSGGDAEGF                                    LSHS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW-----------------------------------LSHS

Query:  SGPLNAAHS-GGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATV---SGSGAASGGGKT----------------------------SAGVAAAIA
        SGPLNA HS GGSL+DSPPFSPSEIA+LD+K+YR N+S  +THLRATV   SG+ AA GGGKT                             AGVAAAIA
Subjt:  SGPLNAAHS-GGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATV---SGSGAASGGGKT----------------------------SAGVAAAIA

Query:  AIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIP
        AIASASA SS+G N+ E++PKTD+AMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNV+SAGDIMTLTAAAATALRGAATLKSRA+K++WNAA VIP
Subjt:  AIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIP

Query:  ARKG----------------------------SEL--PAAAVVMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSR
          KG                             EL    AA     Q+Q  D F  VCYRGLLANGC         DLHWK+VSVYINRMNQVV+KMKSR
Subjt:  ARKG----------------------------SEL--PAAAVVMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSR

Query:  HVAGTITKKKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRIC
        HVAGTITKKKK+                      F     +       RHLLEGGEDRRYFGLKT+LRGVVEFECRNQREY+MWTQGVSKLL+MAA+R+C
Subjt:  HVAGTITKKKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRIC

Query:  Q
        +
Subjt:  Q

A0A6J1GVU2 VAN3-binding protein-like isoform X25.0e-12159.83Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS
        MS PSS APL+SSLEFLSRSWRVSPST HLVNS+I  G++GGG  GDI+LE+S  DA+GF                                   LSHSS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS

Query:  GPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIAS
        GPLNAAHSGGSLTDSPPFSPSEIADLDSK+YR N+S  NTH RATV+GSG AA GGGKT                             AGVAAAIAAIAS
Subjt:  GPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIAS

Query:  ASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AVI
        ASA SS+G N+REDI KTD+A+ASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRA K+VWNAA        
Subjt:  ASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AVI

Query:  PARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSD
         + +GS   +    +V+  +L  +         D    +CYRGLLA GC         DLHWK+VSVY++R NQVVLKMKSRHVAGTITKKKK+      
Subjt:  PARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSD

Query:  FFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ
          L  G+ K                   A RHLLEGGEDRRYFG+KT+LRGVVEFECRNQREYDMWTQGV+KLL++AA+RIC+
Subjt:  FFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ

A0A6J1GX28 VAN3-binding protein-like isoform X16.5e-12159.71Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW-----------------------------------LSHS
        MS PSS APL+SSLEFLSRSWRVSPST HLVNS+I  G++GGG  GDI+LE+S  DA+GF                                    LSHS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW-----------------------------------LSHS

Query:  SGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIA
        SGPLNAAHSGGSLTDSPPFSPSEIADLDSK+YR N+S  NTH RATV+GSG AA GGGKT                             AGVAAAIAAIA
Subjt:  SGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIA

Query:  SASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AV
        SASA SS+G N+REDI KTD+A+ASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRA K+VWNAA       
Subjt:  SASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AV

Query:  IPARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFS
          + +GS   +    +V+  +L  +         D    +CYRGLLA GC         DLHWK+VSVY++R NQVVLKMKSRHVAGTITKKKK+     
Subjt:  IPARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFS

Query:  DFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ
           L  G+ K                   A RHLLEGGEDRRYFG+KT+LRGVVEFECRNQREYDMWTQGV+KLL++AA+RIC+
Subjt:  DFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ

A0A6J1IQ72 VAN3-binding protein-like isoform X21.4e-12060.04Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS
        MS PSS APL+SSLEFLSRSWRVSPST HLVNS+I  G +GGG  GDI+LEDS  DAEGF                                   LSHSS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW----------------------------------LSHSS

Query:  GPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIAS
        GPLNAAHSG SLTDSPPFSPSEIADLDSK+YR N+S  NTH RATV+GSG AA GGGKT                             AGVAAAIAAIAS
Subjt:  GPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIAS

Query:  ASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AVI
        ASA SS+G N+ EDI KTD+A+ASAATLVAAQCVEAAE MGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRA K+VWNAA        
Subjt:  ASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AVI

Query:  PARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSD
         + +GS   +    +++  +L  +         D    VCYRGLLANGC         DLHWK+VSVY++R NQVVLKMKSRHVAGTITKKKK+      
Subjt:  PARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSD

Query:  FFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ
          L  G+ K                   A RHLLEGGEDRRYFG+KT+LRGVVEFECRNQREYDMWTQGV+KLL+MAA+RIC+
Subjt:  FFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ

A0A6J1ISN5 VAN3-binding protein-like isoform X11.9e-12059.92Show/hide
Query:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW-----------------------------------LSHS
        MS PSS APL+SSLEFLSRSWRVSPST HLVNS+I  G +GGG  GDI+LEDS  DAEGF                                    LSHS
Subjt:  MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFW-----------------------------------LSHS

Query:  SGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIA
        SGPLNAAHSG SLTDSPPFSPSEIADLDSK+YR N+S  NTH RATV+GSG AA GGGKT                             AGVAAAIAAIA
Subjt:  SGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSG-AASGGGKT----------------------------SAGVAAAIAAIA

Query:  SASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AV
        SASA SS+G N+ EDI KTD+A+ASAATLVAAQCVEAAE MGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRA K+VWNAA       
Subjt:  SASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAA-----AV

Query:  IPARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFS
          + +GS   +    +++  +L  +         D    VCYRGLLANGC         DLHWK+VSVY++R NQVVLKMKSRHVAGTITKKKK+     
Subjt:  IPARKGSELPAAA--VVMRGQLQHE---------DTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFS

Query:  DFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ
           L  G+ K                   A RHLLEGGEDRRYFG+KT+LRGVVEFECRNQREYDMWTQGV+KLL+MAA+RIC+
Subjt:  DFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQ

SwissProt top hitse value%identityAlignment
Q8W4K5 VAN3-binding protein1.6e-6347.44Show/hide
Query:  LSHSSGPLNAAHSGGSL--TDSPPFSPSEIADLDSKVYR------PNFSLNNTHLRATVSGSGA--ASGGGKT---------------------------
        LSHSSGPLN    GGS   TDSPP SPS+  D   K +R      P FS        T +GS    A  G KT                           
Subjt:  LSHSSGPLNAAHSGGSL--TDSPPFSPSEIADLDSKVYR------PNFSLNNTHLRATVSGSGA--ASGGGKT---------------------------

Query:  -SAGVAAAIAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALK
          A VA+A+AA+A+A+A+SS G NE+  + + D+AMASAA LVAAQCVEAAE MGA+ DHL SV+SSAVNVKS  DI+TLTAAAATALRGAATLK+RALK
Subjt:  -SAGVAAAIAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALK

Query:  EVWNAAAVIPARKGSELPAAAVVMRGQLQHEDT------------FLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITK
        EVWN AAV+PA KG+   ++A+  +   +H D+            FL VC + LLA G          +LHWKIVSVYIN+  Q VLKMKS+HV GT TK
Subjt:  EVWNAAAVIPARKGSELPAAAVVMRGQLQHEDT------------FLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITK

Query:  KKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR
        KKK             + + +     +  GRD           L  G+   YFGLKT  + V+EFECRNQREY++WTQGVS+LL +AA++
Subjt:  KKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR

Arabidopsis top hitse value%identityAlignment
AT3G22810.1 Plant protein of unknown function (DUF828) with plant pleckstrin homology-like region1.4e-7550.79Show/hide
Query:  LSHSSGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTH---------LRATVSGS---------------------GAASGGGKTSAGVAAA
        LSHSSGPLN     GSLTDSPP SP ++ D+  +  R N + N+ +         + AT + S                      A      + AGVAAA
Subjt:  LSHSSGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTH---------LRATVSGS---------------------GAASGGGKTSAGVAAA

Query:  IAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAV
        +AAIA+A+A+SSS A + E++ KTD+A+ASAATLVAAQCVEAAE MGAE DHLASV+SSAVNV+SAGDIMTLTA AATALRG ATLK+RA+KEVW+ A+V
Subjt:  IAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAV

Query:  IPARKGSELPAAAVV------------MRGQLQHEDTFLTVCYRGLLANG---------CDLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYF
        IP  KG  L   + V              G+   ED FL  C R  LA G          DLHWKIVSVYINR+NQV+LKMKSRHV GT TKK K+    
Subjt:  IPARKGSELPAAAVV------------MRGQLQHEDTFLTVCYRGLLANG---------CDLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYF

Query:  SDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR
                +       + + G           RHLLEGGED RYFGLKTV RG+VEF+C++QREY+MWTQGVS+L+ +AA+R
Subjt:  SDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR

AT4G14740.1 Plant protein of unknown function (DUF828) with plant pleckstrin homology-like region7.0e-7550.13Show/hide
Query:  LSHSSGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLN--NTHLRATVSGSGAASGGGKTS-------------------------------AGV
        LSHSSGPLN     GSLTDSPP SP E  D+         SLN  N+  R+T +  G  +     S                               AGV
Subjt:  LSHSSGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLN--NTHLRATVSGSGAASGGGKTS-------------------------------AGV

Query:  AAAIAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNA
        AAA+AAIA+A+A+SSS   + E + KTD+A+ASAATLVAAQCVEAAE MGAE ++LASV+SSAVNV+SAGDIMTLTA AATALRG  TLK+RA+KEVWN 
Subjt:  AAAIAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNA

Query:  AAVIPARKG------------SELPAAAVVMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSG
        A+VIP  KG                +++    G+L  ++ FL  C R  LA GC         DLHWKIVSVYIN+MNQV+LKMKSRHV GT TKKKK  
Subjt:  AAVIPARKG------------SELPAAAVVMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSG

Query:  SYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR
        +   D                      +       RHLLEGG+D RYFGLKTV+RG VEFE ++QREY+MWTQGVS+LLV+AA+R
Subjt:  SYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR

AT4G14740.2 Plant protein of unknown function (DUF828) with plant pleckstrin homology-like region7.0e-7550.13Show/hide
Query:  LSHSSGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLN--NTHLRATVSGSGAASGGGKTS-------------------------------AGV
        LSHSSGPLN     GSLTDSPP SP E  D+         SLN  N+  R+T +  G  +     S                               AGV
Subjt:  LSHSSGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLN--NTHLRATVSGSGAASGGGKTS-------------------------------AGV

Query:  AAAIAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNA
        AAA+AAIA+A+A+SSS   + E + KTD+A+ASAATLVAAQCVEAAE MGAE ++LASV+SSAVNV+SAGDIMTLTA AATALRG  TLK+RA+KEVWN 
Subjt:  AAAIAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNA

Query:  AAVIPARKG------------SELPAAAVVMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSG
        A+VIP  KG                +++    G+L  ++ FL  C R  LA GC         DLHWKIVSVYIN+MNQV+LKMKSRHV GT TKKKK  
Subjt:  AAVIPARKG------------SELPAAAVVMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSG

Query:  SYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR
        +   D                      +       RHLLEGG+D RYFGLKTV+RG VEFE ++QREY+MWTQGVS+LLV+AA+R
Subjt:  SYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR

AT4G14740.3 Plant protein of unknown function (DUF828) with plant pleckstrin homology-like region8.8e-7056.94Show/hide
Query:  AGVAAAIAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEV
        AGVAAA+AAIA+A+A+SSS   + E + KTD+A+ASAATLVAAQCVEAAE MGAE ++LASV+SSAVNV+SAGDIMTLTA AATALRG  TLK+RA+KEV
Subjt:  AGVAAAIAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEV

Query:  WNAAAVIPARKG------------SELPAAAVVMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKK
        WN A+VIP  KG                +++    G+L  ++ FL  C R  LA GC         DLHWKIVSVYIN+MNQV+LKMKSRHV GT TKKK
Subjt:  WNAAAVIPARKG------------SELPAAAVVMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKK

Query:  KSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR
        K  +   D                      +       RHLLEGG+D RYFGLKTV+RG VEFE ++QREY+MWTQGVS+LLV+AA+R
Subjt:  KSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR

AT5G43870.1 Plant protein of unknown function (DUF828) with plant pleckstrin homology-like region9.1e-7552.81Show/hide
Query:  SGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSGAASGGGKT----------------------------SAGVAAAIAAIASASASSSSG
        S  S TDSPP SPS+I D   + YR + S N  H+R + +  G A GG KT                             AGVAAA+AAIA+A+AS SS 
Subjt:  SGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRATVSGSGAASGGGKT----------------------------SAGVAAAIAAIASASASSSSG

Query:  ANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIPARKGSELPAAAV
          + E + K D A+ASAATLVAA+CVEAAE MGA+ +HLASV+SSAVNV+SAGDIMTLTAAAATALRGAA LK+RALKEVWN AAVIP  KG+       
Subjt:  ANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSRALKEVWNAAAVIPARKGSELPAAAV

Query:  VMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQR
           G+L   D FL +C + LLA GC         DLHWK+VS+YINR  QV+LK KS+HVAGTITKKKK+        +  GL K   G   + G     
Subjt:  VMRGQLQHEDTFLTVCYRGLLANGC---------DLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSDFFLSWGLTKAKAGAEKFGGGRDQR

Query:  HSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR
              R +LEGGE+ RYFGLKTV + V+EFEC++QREYD+WTQGVS LL +A+DR
Subjt:  HSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTGGCCGTCGTCCGACGCCCCTCTCAATTCCTCCTTGGAGTTCCTCTCGCGATCTTGGAGAGTCTCCCCCTCCACCGGCCACTTGGTAAATTCCAAGATTCAGAC
GGGGGCTTATGGTGGCGGCGGCTGTGGTGATATCATTTTAGAGGATTCCGGTGGCGACGCGGAGGGTTTCTGGCTGTCGCACAGCAGCGGCCCACTCAACGCCGCCCATA
GCGGCGGTTCATTAACCGACAGTCCACCTTTTTCTCCCTCTGAGATCGCCGACCTCGACTCCAAGGTTTACCGCCCCAACTTTTCTTTGAATAATACCCATTTGAGAGCC
ACCGTGAGTGGGTCCGGCGCCGCCTCTGGAGGCGGCAAGACGTCGGCAGGGGTGGCTGCTGCCATCGCCGCCATCGCTTCCGCCTCTGCCTCTTCTTCCTCCGGCGCCAA
TGAACGTGAAGACATTCCCAAGACCGATGTGGCTATGGCGTCCGCCGCCACGTTAGTCGCCGCTCAATGCGTGGAGGCCGCTGAGGCCATGGGAGCTGAACACGATCACC
TCGCTTCTGTCATTAGCTCCGCCGTGAATGTCAAGTCCGCCGGCGATATAATGACGCTCACGGCAGCCGCCGCAACAGCTTTAAGAGGAGCTGCAACGCTCAAGTCCCGG
GCTCTGAAGGAAGTCTGGAATGCAGCGGCGGTTATTCCTGCGAGAAAGGGATCGGAGCTTCCAGCAGCGGCAGTAGTCATGCGCGGCCAGCTTCAACATGAAGACACTTT
CCTCACCGTCTGCTACAGAGGATTGCTTGCCAATGGCTGTGATCTTCATTGGAAAATCGTCTCCGTTTATATCAACAGAATGAATCAGGTTGTGTTGAAGATGAAGAGCA
GGCACGTGGCTGGGACCATAACCAAAAAGAAAAAGAGTGGGTCTTATTTTTCTGATTTCTTCCTGTCATGGGGATTGACGAAAGCCAAAGCGGGCGCAGAAAAATTTGGT
GGTGGACGTGATCAAAGACATTCCGGCATGGCCCGGCGCCACCTACTGGAGGGCGGAGAGGACCGCCGCTACTTCGGGCTGAAGACGGTGCTGCGTGGGGTGGTGGAGTT
CGAGTGCAGAAACCAGAGAGAGTATGATATGTGGACGCAGGGCGTGTCCAAGCTCCTGGTTATGGCGGCTGATAGGATTTGTCAAAATGGGGACGATGATTTGATGGAGG
GAGGGGAGTGGGATGTCCAGAAAATCAGAAACAAGCACTCTGGGTTTCATTTCACCAGCAAAACGAAACCCCCACCCCCAACAAAACAAGAACGGCATGAAGAGGTTGTT
CCCGCACTTGCATCGCCAAGCTTCTGGGTAGTACTCCAATGGCATACTCAGACCGGGTTGAATTGGGACGTGAACCGGTTTACAGGGCCAGCATCTGCCGCACTTGGATC
TGCACGTCGGTGGCGACGACCCCGGACCGCCCAGAGTGCTCTTCTGGTCCGCCCACCCTCCGCTTCCGGCCTCTACAAACGCCGCCACCGATCGAATCAACCAACCAAGT
TTACGACGGATTCAAAAAGACGAGAGATTAGAGAGTTTGGAGATGGGTTACCTTTGGAGACTGAGGCGGAGGCGGAGGCGGAGCAGAAGAGAAGAGCGAAGGCGGCGAGA
GTGAGAAGAAGAAGAAGAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCTGGCCGTCGTCCGACGCCCCTCTCAATTCCTCCTTGGAGTTCCTCTCGCGATCTTGGAGAGTCTCCCCCTCCACCGGCCACTTGGTAAATTCCAAGATTCAGAC
GGGGGCTTATGGTGGCGGCGGCTGTGGTGATATCATTTTAGAGGATTCCGGTGGCGACGCGGAGGGTTTCTGGCTGTCGCACAGCAGCGGCCCACTCAACGCCGCCCATA
GCGGCGGTTCATTAACCGACAGTCCACCTTTTTCTCCCTCTGAGATCGCCGACCTCGACTCCAAGGTTTACCGCCCCAACTTTTCTTTGAATAATACCCATTTGAGAGCC
ACCGTGAGTGGGTCCGGCGCCGCCTCTGGAGGCGGCAAGACGTCGGCAGGGGTGGCTGCTGCCATCGCCGCCATCGCTTCCGCCTCTGCCTCTTCTTCCTCCGGCGCCAA
TGAACGTGAAGACATTCCCAAGACCGATGTGGCTATGGCGTCCGCCGCCACGTTAGTCGCCGCTCAATGCGTGGAGGCCGCTGAGGCCATGGGAGCTGAACACGATCACC
TCGCTTCTGTCATTAGCTCCGCCGTGAATGTCAAGTCCGCCGGCGATATAATGACGCTCACGGCAGCCGCCGCAACAGCTTTAAGAGGAGCTGCAACGCTCAAGTCCCGG
GCTCTGAAGGAAGTCTGGAATGCAGCGGCGGTTATTCCTGCGAGAAAGGGATCGGAGCTTCCAGCAGCGGCAGTAGTCATGCGCGGCCAGCTTCAACATGAAGACACTTT
CCTCACCGTCTGCTACAGAGGATTGCTTGCCAATGGCTGTGATCTTCATTGGAAAATCGTCTCCGTTTATATCAACAGAATGAATCAGGTTGTGTTGAAGATGAAGAGCA
GGCACGTGGCTGGGACCATAACCAAAAAGAAAAAGAGTGGGTCTTATTTTTCTGATTTCTTCCTGTCATGGGGATTGACGAAAGCCAAAGCGGGCGCAGAAAAATTTGGT
GGTGGACGTGATCAAAGACATTCCGGCATGGCCCGGCGCCACCTACTGGAGGGCGGAGAGGACCGCCGCTACTTCGGGCTGAAGACGGTGCTGCGTGGGGTGGTGGAGTT
CGAGTGCAGAAACCAGAGAGAGTATGATATGTGGACGCAGGGCGTGTCCAAGCTCCTGGTTATGGCGGCTGATAGGATTTGTCAAAATGGGGACGATGATTTGATGGAGG
GAGGGGAGTGGGATGTCCAGAAAATCAGAAACAAGCACTCTGGGTTTCATTTCACCAGCAAAACGAAACCCCCACCCCCAACAAAACAAGAACGGCATGAAGAGGTTGTT
CCCGCACTTGCATCGCCAAGCTTCTGGGTAGTACTCCAATGGCATACTCAGACCGGGTTGAATTGGGACGTGAACCGGTTTACAGGGCCAGCATCTGCCGCACTTGGATC
TGCACGTCGGTGGCGACGACCCCGGACCGCCCAGAGTGCTCTTCTGGTCCGCCCACCCTCCGCTTCCGGCCTCTACAAACGCCGCCACCGATCGAATCAACCAACCAAGT
TTACGACGGATTCAAAAAGACGAGAGATTAGAGAGTTTGGAGATGGGTTACCTTTGGAGACTGAGGCGGAGGCGGAGGCGGAGCAGAAGAGAAGAGCGAAGGCGGCGAGA
GTGAGAAGAAGAAGAAGAGAGTGA
Protein sequenceShow/hide protein sequence
MSWPSSDAPLNSSLEFLSRSWRVSPSTGHLVNSKIQTGAYGGGGCGDIILEDSGGDAEGFWLSHSSGPLNAAHSGGSLTDSPPFSPSEIADLDSKVYRPNFSLNNTHLRA
TVSGSGAASGGGKTSAGVAAAIAAIASASASSSSGANEREDIPKTDVAMASAATLVAAQCVEAAEAMGAEHDHLASVISSAVNVKSAGDIMTLTAAAATALRGAATLKSR
ALKEVWNAAAVIPARKGSELPAAAVVMRGQLQHEDTFLTVCYRGLLANGCDLHWKIVSVYINRMNQVVLKMKSRHVAGTITKKKKSGSYFSDFFLSWGLTKAKAGAEKFG
GGRDQRHSGMARRHLLEGGEDRRYFGLKTVLRGVVEFECRNQREYDMWTQGVSKLLVMAADRICQNGDDDLMEGGEWDVQKIRNKHSGFHFTSKTKPPPPTKQERHEEVV
PALASPSFWVVLQWHTQTGLNWDVNRFTGPASAALGSARRWRRPRTAQSALLVRPPSASGLYKRRHRSNQPTKFTTDSKRREIREFGDGLPLETEAEAEAEQKRRAKAAR
VRRRRRE