; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C08G161380 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C08G161380
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionProtein of unknown function (DUF707)
Genome locationCla97Chr08:27905652..27910714
RNA-Seq ExpressionCla97C08G161380
SyntenyCla97C08G161380
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061058.1 lysine ketoglutarate reductase trans-splicing protein [Cucumis melo var. makuwa]1.8e-19782.96Show/hide
Query:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP
        SRSPVSR+PNDAL +KLIV AFVGA+ GF IG+SFPVLSLTK                                                      IWVP
Subjt:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP

Query:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
        SNPRGAE+LPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
Subjt:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY

Query:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPFDYIFIWDEDLGLDHFDA+EY+KLVRKYGLEISQPGLQPSKGLTWRMTRRRD SEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR

Query:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS
        EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNG+APWVGVSERCRREWTMLQSRWTIAENAYLNDVED VIPS
Subjt:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS

Query:  HAVMH
        H+VMH
Subjt:  HAVMH

XP_004142963.1 uncharacterized protein LOC101206771 isoform X1 [Cucumis sativus]9.0e-19781.98Show/hide
Query:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP
        SRSPVSR+PNDAL  KLIV AFVGA+ GF IGLSFPVLSLT+                                                      IWVP
Subjt:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP

Query:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
        SNPRGAE+LPPKI+SSESDLY RRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
Subjt:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY

Query:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPFDYIFIWDEDLGLDHFDA+EY+KLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDP LPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR

Query:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS
        EAWRC+WHMLQNDLVHGWGLDFLLRRCVDPAHEKIGV+DAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWT+LQSRWTIAENAYLNDVED VIPS
Subjt:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS

Query:  HAVMH
        H+VMH
Subjt:  HAVMH

XP_008444374.1 PREDICTED: uncharacterized protein LOC103487723 [Cucumis melo]2.1e-19883.21Show/hide
Query:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP
        SRSPVSR+PNDAL +KLIV AFVGA+ GF IG+SFPVLSLTK                                                      IWVP
Subjt:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP

Query:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
        SNPRGAE+LPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
Subjt:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY

Query:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPFDYIFIWDEDLGLDHFDA+EY+KLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR

Query:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS
        EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNG+APWVGVSERCRREWTMLQSRWTIAENAYLNDVED VIPS
Subjt:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS

Query:  HAVMH
        H+VMH
Subjt:  HAVMH

XP_031737077.1 uncharacterized protein LOC101206771 isoform X2 [Cucumis sativus]2.1e-19381.7Show/hide
Query:  RLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVPSNPRGA
        R+PNDAL  KLIV AFVGA+ GF IGLSFPVLSLT+                                                      IWVPSNPRGA
Subjt:  RLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVPSNPRGA

Query:  ESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRFLH
        E+LPPKI+SSESDLY RRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRFLH
Subjt:  ESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRFLH

Query:  PDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWRCI
        PDIVAPFDYIFIWDEDLGLDHFDA+EY+KLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDP LPPCAAFVEIMAPVFSREAWRC+
Subjt:  PDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWRCI

Query:  WHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPSHAVMH
        WHMLQNDLVHGWGLDFLLRRCVDPAHEKIGV+DAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWT+LQSRWTIAENAYLNDVED VIPSH+VMH
Subjt:  WHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPSHAVMH

XP_038884102.1 uncharacterized protein LOC120075031 [Benincasa hispida]2.7e-20184.44Show/hide
Query:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP
        SRSPVSRLPNDA+NMKLIVAAFVGA+FGFSIGLSFPVLSLTK                                                      IW P
Subjt:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP

Query:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
        SNPRGAESLPPKI+SSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
Subjt:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY

Query:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
        AKRF HPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR

Query:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS
        EAWRC+WHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVED V PS
Subjt:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS

Query:  HAVMH
        H+VMH
Subjt:  HAVMH

TrEMBL top hitse value%identityAlignment
A0A0A0LK84 Uncharacterized protein4.3e-19781.98Show/hide
Query:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP
        SRSPVSR+PNDAL  KLIV AFVGA+ GF IGLSFPVLSLT+                                                      IWVP
Subjt:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP

Query:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
        SNPRGAE+LPPKI+SSESDLY RRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
Subjt:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY

Query:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPFDYIFIWDEDLGLDHFDA+EY+KLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDP LPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR

Query:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS
        EAWRC+WHMLQNDLVHGWGLDFLLRRCVDPAHEKIGV+DAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWT+LQSRWTIAENAYLNDVED VIPS
Subjt:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS

Query:  HAVMH
        H+VMH
Subjt:  HAVMH

A0A1S3B9P8 uncharacterized protein LOC1034877231.0e-19883.21Show/hide
Query:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP
        SRSPVSR+PNDAL +KLIV AFVGA+ GF IG+SFPVLSLTK                                                      IWVP
Subjt:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP

Query:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
        SNPRGAE+LPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
Subjt:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY

Query:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPFDYIFIWDEDLGLDHFDA+EY+KLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR

Query:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS
        EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNG+APWVGVSERCRREWTMLQSRWTIAENAYLNDVED VIPS
Subjt:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS

Query:  HAVMH
        H+VMH
Subjt:  HAVMH

A0A5A7V0H9 Lysine ketoglutarate reductase trans-splicing protein8.8e-19882.96Show/hide
Query:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP
        SRSPVSR+PNDAL +KLIV AFVGA+ GF IG+SFPVLSLTK                                                      IWVP
Subjt:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLTK------------------------------------------------------IWVP

Query:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
        SNPRGAE+LPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
Subjt:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY

Query:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPFDYIFIWDEDLGLDHFDA+EY+KLVRKYGLEISQPGLQPSKGLTWRMTRRRD SEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR

Query:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS
        EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNG+APWVGVSERCRREWTMLQSRWTIAENAYLNDVED VIPS
Subjt:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS

Query:  HAVMH
        H+VMH
Subjt:  HAVMH

A0A6J1GIV6 uncharacterized protein LOC1114546674.8e-18880Show/hide
Query:  RSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLT-------------------------------------------------------KIWVP
        RSP SR+PNDA+ MKLIVAAFVG + G SIGLSFPVLSLT                                                       KIW P
Subjt:  RSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLT-------------------------------------------------------KIWVP

Query:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
        SNPRGAESLPPKIVSSESDLYARRLWGNPSEDL TKPQYLVAFTVGYNQRYNIDRAVRKFSD FVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
Subjt:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY

Query:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLV+KYGLEISQPGLQPSKGLTWRMTRRRDGSEVHK+TDERPGWCIDP LPPC++FVEIMAPVFSR
Subjt:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR

Query:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS
        EAWRC+W+MLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGES+NGKA WVGVSERCRREWTMLQSRW  AENAYL+DVED +IPS
Subjt:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS

A0A6J1KUH3 uncharacterized protein LOC1114965121.5e-18979.75Show/hide
Query:  RSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLT-------------------------------------------------------KIWVP
        RSPVSR+PN+A+ MKLIVAAFVG + G SIGLSFPVLSLT                                                       KIW P
Subjt:  RSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLT-------------------------------------------------------KIWVP

Query:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
        SNPRGAESLPPKIVSSESDLYARRLWGNPSEDL TKPQYLVAFTVGYNQRYNIDRAVRKFSD FVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY
Subjt:  SNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWY

Query:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHK+TDERPGWCIDP LPPC++FVEIMAPVFSR
Subjt:  AKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR

Query:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS
        EAWRC+W+MLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGES+NGKA WVGVSERCRREWTMLQSRW  AENAYL+DVED +IPS
Subjt:  EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPS

Query:  HAVMH
          V+H
Subjt:  HAVMH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08040.1 Protein of unknown function (DUF707)1.9e-14461.56Show/hide
Query:  RSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSL-------------------------------------TKIWVPSNPRGAESLPPKIVSSES
        RS   R  N+  N KLI+   VG +FGF +G++ P+ S                                       KI+VP+NP GAE LPP I+ +E+
Subjt:  RSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSL-------------------------------------TKIWVPSNPRGAESLPPKIVSSES

Query:  DLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRFLHPDIVAPFDYIFI
        D Y RRLWG PSEDL  KP+YLV FTVG+ QR NI+ AV+KFS++F ILLFHYDGRT+EWD+FEWSK AIH+SA+KQ KWWYAKRFLHPD+V+ ++YIFI
Subjt:  DLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRFLHPDIVAPFDYIFI

Query:  WDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWRCIWHMLQNDLVHGW
        WDEDLG++HF+A+ Y++LV+K+GLEISQPGL+P+ GLTW MT+RR   +VHK+T E+PGWC DP LPPCAAFVEIMAPVFSREAWRC+WHM+QNDLVHGW
Subjt:  WDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWRCIWHMLQNDLVHGW

Query:  GLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYL
        GLDF LRRCV+PAHEKIGV+D+QWI+HQ +PSLGSQGES+ GK+PW GV ERCR EWTM Q+R   A+ AY+
Subjt:  GLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYL

AT1G08040.2 Protein of unknown function (DUF707)1.9e-14461.56Show/hide
Query:  RSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSL-------------------------------------TKIWVPSNPRGAESLPPKIVSSES
        RS   R  N+  N KLI+   VG +FGF +G++ P+ S                                       KI+VP+NP GAE LPP I+ +E+
Subjt:  RSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSL-------------------------------------TKIWVPSNPRGAESLPPKIVSSES

Query:  DLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRFLHPDIVAPFDYIFI
        D Y RRLWG PSEDL  KP+YLV FTVG+ QR NI+ AV+KFS++F ILLFHYDGRT+EWD+FEWSK AIH+SA+KQ KWWYAKRFLHPD+V+ ++YIFI
Subjt:  DLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRFLHPDIVAPFDYIFI

Query:  WDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWRCIWHMLQNDLVHGW
        WDEDLG++HF+A+ Y++LV+K+GLEISQPGL+P+ GLTW MT+RR   +VHK+T E+PGWC DP LPPCAAFVEIMAPVFSREAWRC+WHM+QNDLVHGW
Subjt:  WDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWRCIWHMLQNDLVHGW

Query:  GLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYL
        GLDF LRRCV+PAHEKIGV+D+QWI+HQ +PSLGSQGES+ GK+PW GV ERCR EWTM Q+R   A+ AY+
Subjt:  GLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYL

AT1G67850.1 Protein of unknown function (DUF707)1.3e-14861.28Show/hide
Query:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLT--------------------------------------------------KIWVPSNPR
        SRS +SR   D   MK+I  AF G  FGF IG+SFP LS+T                                                  KIWVPSNPR
Subjt:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLT--------------------------------------------------KIWVPSNPR

Query:  GAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRF
        GAE LPP +V++ESD Y RRLWG P EDL ++P+YL  FTVG NQ+ NID  V+KFS+NF I+LFHYDGR +EWDEFEWSK AIH+S RKQ KWWYAKRF
Subjt:  GAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRF

Query:  LHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWR
        LHPDIVA +DYIF+WDEDLG++HF+AEEY+K+V+K+GLEISQPGL+P++GLTW+MT+RR   EVHK T+ERPGWC DP LPPCAAFVEIMAPVFSR AWR
Subjt:  LHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWR

Query:  CIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVE
        C+WH++QNDLVHGWGLDF LRRCV+PAHEKIGV+D+QW+VHQ  PSLG+QGE+ +GKAPW GV +RC++EWTM QSR   AE  Y   ++
Subjt:  CIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVE

AT1G67850.2 Protein of unknown function (DUF707)1.3e-14861.28Show/hide
Query:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLT--------------------------------------------------KIWVPSNPR
        SRS +SR   D   MK+I  AF G  FGF IG+SFP LS+T                                                  KIWVPSNPR
Subjt:  SRSPVSRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSLT--------------------------------------------------KIWVPSNPR

Query:  GAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRF
        GAE LPP +V++ESD Y RRLWG P EDL ++P+YL  FTVG NQ+ NID  V+KFS+NF I+LFHYDGR +EWDEFEWSK AIH+S RKQ KWWYAKRF
Subjt:  GAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRF

Query:  LHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWR
        LHPDIVA +DYIF+WDEDLG++HF+AEEY+K+V+K+GLEISQPGL+P++GLTW+MT+RR   EVHK T+ERPGWC DP LPPCAAFVEIMAPVFSR AWR
Subjt:  LHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWR

Query:  CIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVE
        C+WH++QNDLVHGWGLDF LRRCV+PAHEKIGV+D+QW+VHQ  PSLG+QGE+ +GKAPW GV +RC++EWTM QSR   AE  Y   ++
Subjt:  CIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVE

AT3G26440.4 Protein of unknown function (DUF707)8.5e-14565.53Show/hide
Query:  SRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSL----------------TKIWVPSNPRGAESLPPKIVSSESDLYARRLWGNPSEDL-TTKPQYLVA
        S+ PND   M L+++ F G + GF +G+SFP LSL                TKIWVPSNPRGAE LPP  V++ESD Y RRLWG P +DL   KP+YLVA
Subjt:  SRLPNDALNMKLIVAAFVGALFGFSIGLSFPVLSL----------------TKIWVPSNPRGAESLPPKIVSSESDLYARRLWGNPSEDL-TTKPQYLVA

Query:  FTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGL
        FTV Y QR NID  V+KFSDNF I+LFHYDG+TSE+DEFEWSKRAIHVS  KQ KWWYAKRFLHPDI+AP++YIFIWDEDLG+++FDAEEYIK+V K+GL
Subjt:  FTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHVSARKQAKWWYAKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGL

Query:  EISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQW
        EISQP ++  K +TW++T+R  G EVHK+ +E+PG C DP LPPCA F+EIMAPVFSREAWRC+WHM+QNDLVHGWGLDF LR+CV+PAHEKIGV+D+QW
Subjt:  EISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSREAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQW

Query:  IVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDV
        I+HQ +PSLGSQG +Q GK  + GV ERC+REWTM Q R T +E  YL ++
Subjt:  IVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGCAGTTAAGTTAACTTTGAAGACCTCGTTGACTGTTTACCGGGGGGCTTCGAAGAAAGGTCTTCCCTGGAAAACTCCCGAAGGAAATTTGTTTTCTTTGAGATT
TCAAGATTTGAACGTGATCCACTTCCAGATCTTGCTCCCATTATCTGGGTTTCTTAGTCTCGCTCAACCAACTTTATGTGAATTTGGGTTGGCGTCCTACACTCTCAATC
ACGGCAGAGGTTTGGATGTGAATATGGGCAGGTCACGCAGTCCTGTCAGTAGACTTCCCAATGACGCCTTGAACATGAAGCTTATCGTGGCTGCATTTGTGGGAGCACTA
TTTGGCTTCTCCATTGGACTGTCCTTTCCAGTGCTGTCATTAACTAAAATTTGGGTCCCATCAAATCCTAGAGGTGCAGAAAGTCTTCCTCCAAAAATTGTTTCTTCTGA
ATCAGATTTGTATGCACGTAGACTGTGGGGCAATCCTAGTGAGGACCTGACCACCAAGCCACAGTATCTTGTAGCTTTTACAGTTGGTTACAACCAAAGATACAATATCG
ATAGAGCAGTCAGAAAGTTTTCAGATAATTTCGTTATCCTTCTCTTTCATTATGATGGACGAACATCAGAATGGGATGAATTTGAATGGTCAAAGCGGGCTATTCATGTT
AGTGCTCGTAAGCAAGCCAAGTGGTGGTATGCCAAACGATTTTTGCATCCTGACATTGTTGCACCATTTGACTACATTTTTATTTGGGATGAAGACCTTGGATTGGATCA
TTTTGATGCCGAAGAATATATAAAATTGGTAAGGAAGTATGGGTTAGAGATTTCACAGCCTGGTTTGCAACCAAGTAAAGGTTTAACGTGGCGAATGACAAGGAGAAGAG
ATGGTAGTGAAGTTCACAAAGATACGGATGAAAGACCAGGATGGTGCATTGATCCTCTTTTGCCTCCCTGTGCTGCTTTTGTTGAGATTATGGCTCCAGTTTTTTCCCGA
GAGGCATGGCGTTGTATTTGGCATATGTTACAGAATGATCTGGTCCATGGTTGGGGTCTTGACTTCCTCCTCAGAAGATGTGTAGATCCTGCACATGAAAAGATAGGAGT
TATTGATGCACAGTGGATTGTTCACCAAGGTTTGCCTTCACTAGGAAGCCAGGGAGAATCACAAAATGGAAAGGCACCATGGGTAGGGGTGAGTGAACGATGTCGAAGGG
AATGGACAATGCTTCAGAGTCGGTGGACAATTGCCGAGAATGCTTACTTAAATGATGTTGAGGACAGGGTCATCCCTTCGCATGCAGTAATGCATTAG
mRNA sequenceShow/hide mRNA sequence
GAAAAATTAATGATCGCAGTTAAGTTAACTTTGAAGACCTCGTTGACTGTTTACCGGGGGGCTTCGAAGAAAGGTCTTCCCTGGAAAACTCCCGAAGGAAATTTGTTTTC
TTTGAGATTTCAAGATTTGAACGTGATCCACTTCCAGATCTTGCTCCCATTATCTGGGTTTCTTAGTCTCGCTCAACCAACTTTATGTGAATTTGGGTTGGCGTCCTACA
CTCTCAATCACGGCAGAGGTTTGGATGTGAATATGGGCAGGTCACGCAGTCCTGTCAGTAGACTTCCCAATGACGCCTTGAACATGAAGCTTATCGTGGCTGCATTTGTG
GGAGCACTATTTGGCTTCTCCATTGGACTGTCCTTTCCAGTGCTGTCATTAACTAAAATTTGGGTCCCATCAAATCCTAGAGGTGCAGAAAGTCTTCCTCCAAAAATTGT
TTCTTCTGAATCAGATTTGTATGCACGTAGACTGTGGGGCAATCCTAGTGAGGACCTGACCACCAAGCCACAGTATCTTGTAGCTTTTACAGTTGGTTACAACCAAAGAT
ACAATATCGATAGAGCAGTCAGAAAGTTTTCAGATAATTTCGTTATCCTTCTCTTTCATTATGATGGACGAACATCAGAATGGGATGAATTTGAATGGTCAAAGCGGGCT
ATTCATGTTAGTGCTCGTAAGCAAGCCAAGTGGTGGTATGCCAAACGATTTTTGCATCCTGACATTGTTGCACCATTTGACTACATTTTTATTTGGGATGAAGACCTTGG
ATTGGATCATTTTGATGCCGAAGAATATATAAAATTGGTAAGGAAGTATGGGTTAGAGATTTCACAGCCTGGTTTGCAACCAAGTAAAGGTTTAACGTGGCGAATGACAA
GGAGAAGAGATGGTAGTGAAGTTCACAAAGATACGGATGAAAGACCAGGATGGTGCATTGATCCTCTTTTGCCTCCCTGTGCTGCTTTTGTTGAGATTATGGCTCCAGTT
TTTTCCCGAGAGGCATGGCGTTGTATTTGGCATATGTTACAGAATGATCTGGTCCATGGTTGGGGTCTTGACTTCCTCCTCAGAAGATGTGTAGATCCTGCACATGAAAA
GATAGGAGTTATTGATGCACAGTGGATTGTTCACCAAGGTTTGCCTTCACTAGGAAGCCAGGGAGAATCACAAAATGGAAAGGCACCATGGGTAGGGGTGAGTGAACGAT
GTCGAAGGGAATGGACAATGCTTCAGAGTCGGTGGACAATTGCCGAGAATGCTTACTTAAATGATGTTGAGGACAGGGTCATCCCTTCGCATGCAGTAATGCATTAGAGA
ACAGAGCTAATGACCCTTTGATTTGTACGATAGTACATATATACTAAGCCAAGATTTAACTTCGTAAGGTGATATGAGAGAGCTAAGCTCTTAAGTGATTACACTCATTC
GATTTTTTATTTGTACAATTCAATTCAAATTCCTTCATTTTGTAACATTATCACTAAGAATACACCTCTCAGAGGCTTGTGCTATTTGAACTTTGCTGTAACGTGATAGA
CTAGCCAACAGCAGAAAACTTGAATCTAGATTGACAAGGTCTATATGCAGCTCAACTCTGATTTAAATGACTCAATTATTTTATGTGAGACTATTGTTTGTTTTGGTACC
TTAAGCTTCCCTGTGAGACTAAAACATGACAATAGTGGTGAGCTACAGTCCTGTAGGAAACAACAGCGAAGATGATTACCATAAAGAATGAGCTGTGTCATCAGCAAGGC
AAATAACTTGGCAACGGCTGCAGCCAGAATAGATCTTACATATGCACACGGAGACAGGAGGGAATGGAGCTTGCACCGGGAGGAGCTGGCATAGGTGGTCATAGGGCAGA
GGTAAATTGTTCTTCATTGAAGCAGTTGCTGGTTGTCCTGTGGTAACGTGCTCTGATAGAACAGAATCTCACTTTCCCTGCTTTGAGAATATATATTTGGAAATCTGCTT
TATATTCTCCATTAAAGGCGCGTTTCAAAATTCAAACTCTGCATTTGACACAGAACATGCTGCAAATTGAACATTAGAATACGTTGTTTTGATAGAATCATAAAGCTTAT
GATAATGATGTCTTTCACCCATGCCTTCTGAGTTTTAACCAAGTAATGTCCTAGTTTTATCAGGGATTACAGTGGTAGTGAGCAAACAGAAGGAAGATAAGTTAATTCTC
TTCTCTCTCTGTGAAAACCATTGAGGTCTCACCTTTCTTTATCACTTTTGGTAAATGGTTCAAACCACTATAGAGTATGTTATCATCTTTGAAAAGGCAATCCCCTCTAC
TTGATGGTCCTCTTATCTAAATAATCTTAATTCTACACCCTGCTGCTTGTGTAGTCAAGTTATGAATAGAATGTTTTTTTTTCTTTTCCTTTTTTTTTAAATGAATGAAT
GAATGAATTATACTTTTTCTTAAAAGAAGCAAAACACGATTTTGATATTAA
Protein sequenceShow/hide protein sequence
MIAVKLTLKTSLTVYRGASKKGLPWKTPEGNLFSLRFQDLNVIHFQILLPLSGFLSLAQPTLCEFGLASYTLNHGRGLDVNMGRSRSPVSRLPNDALNMKLIVAAFVGAL
FGFSIGLSFPVLSLTKIWVPSNPRGAESLPPKIVSSESDLYARRLWGNPSEDLTTKPQYLVAFTVGYNQRYNIDRAVRKFSDNFVILLFHYDGRTSEWDEFEWSKRAIHV
SARKQAKWWYAKRFLHPDIVAPFDYIFIWDEDLGLDHFDAEEYIKLVRKYGLEISQPGLQPSKGLTWRMTRRRDGSEVHKDTDERPGWCIDPLLPPCAAFVEIMAPVFSR
EAWRCIWHMLQNDLVHGWGLDFLLRRCVDPAHEKIGVIDAQWIVHQGLPSLGSQGESQNGKAPWVGVSERCRREWTMLQSRWTIAENAYLNDVEDRVIPSHAVMH