; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003818 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003818
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4149 domain-containing protein
Genome locationscaffold4:49309388..49312012
RNA-Seq ExpressionSpg003818
SyntenySpg003818
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025423 - Domain of unknown function DUF4149


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578300.1 Transmembrane protein 205, partial [Cucurbita argyrosperma subsp. sororia]1.0e-16071.71Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG
        MTNLFA CL++TSLTAAGLWSPSPAS+QDHEQDVIVKEGHRVVVVEYGDQ QHNTKVSISSEP++DAS SN LH+SLN G P+EDS+RHRTRDLICDA G
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG

Query:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKETLKEA-GED
        KCKHKIASAV KAKVMVSET             AFDEAK++VSDKSH                                  V+EK RDLKE  K+   E 
Subjt:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKETLKEA-GED

Query:  VMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV
           D +  A+EKGREAR+TAE IK GGNK+KE L+   D G+KLVN SFRYL S+ES KA M+VLSLLGF MALGM VWTTFISSYVLASALPRQQLAVV
Subjt:  VMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV

Query:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT
        QSKIYP+YFRAMASSIGMALFGHLFS TKWM      A+VVQGYVLVAALL IFANSLYMEPQATKVMFERLK+EKEEG+GIEDI AEPRH ND+PPAIT
Subjt:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT

Query:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHI
        TST        ATQVV RE VKSRIVGLN++LKKLNSYSSLLNLLTLMALTWHLVYLSQR  +
Subjt:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHI

KAG7015871.1 Transmembrane protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-16071.4Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG
        M NLFA CL++TSLTAAGLWSPSPAS+QDHEQDVIVKEGHRVVVVEYGDQ QHNTKVSISSEP++DAS SNPLH+SLN G P+EDS+RHRTRDLICDA G
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG

Query:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKETLKEA-GED
        KCKHKIASAV KAKVMVSET             AFDEAK++VSDKSH                                  V+EK RDLKE  K+   E 
Subjt:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKETLKEA-GED

Query:  VMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV
           D +  A+EKGREAR+TAE IK GGNK+KE L+   D G+KLVN SFRYL S+ES KA M+VLSLLGF MALGM VWTTFISSYVLASALPRQQLAVV
Subjt:  VMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV

Query:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT
        QSKIYP+YFRAMASSIGMALFGHLFS TKWM      A+VVQGYVLVAALL IFANSLYMEPQATKVMFERLK+EKEEG+GIEDI AEPR  +D+PPA+T
Subjt:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT

Query:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC
        TST        ATQVV RE VKSRIVGLN++LKKLNS SSLLNLLTLMALTWHLVYLSQRL IPC
Subjt:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC

XP_022938890.1 uncharacterized protein LOC111444965 [Cucurbita moschata]5.5e-16271.83Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG
        M NLFA CL++TSLTAAGLWSPSPAS+QDHEQDVIVKEGHRVVVVEYGDQ QHNTKVSISSEP++DAS SNPLH+SLN G P+EDS+RHRTRDLICDA G
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG

Query:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKETLKEA-GED
        KCKHKIASAV KAKVMVSET             AFDEAK++VSDKSH                                  V+EK RDLKE  K+   E 
Subjt:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKETLKEA-GED

Query:  VMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV
           D +  A+EKGREAR+TAE IK GGNK+KE L+   D G+KLVN SFRYL S+ES KA M+VLSLLGF MALGM VWTTFISSYVLASALPRQQLAVV
Subjt:  VMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV

Query:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT
        QSKIYP+YFRAMASSIGMALFGHLFS TKWM      A+VVQGYVLVAALL IFANSLYMEPQATKVMFERLK+EKEEG+GIEDI AEPR  ND+PPA+T
Subjt:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT

Query:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC
        TST        ATQVV RE VKSRIVGLN++LKKLNSYSSLLNLLTLMALTWHLVYLSQRL IPC
Subjt:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC

XP_022992882.1 uncharacterized protein LOC111489083 [Cucurbita maxima]2.2e-16372.69Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG
        M NLFA CL++TSLTAAGLWSPSPAS+ DHEQDVIVKEGHRVVVVEYGDQ QHNTKVSISSEP++DAS SNPLH+SLN G P+EDS+RHRTRDLICDA G
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG

Query:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSHLV--------------IEKTRD--------LKET-------LKEAGEDVMEDA
        KCKHKIASAV KAKVMVSET             AFDEAK++VSDKSH V              +EK R+         KET       LKE  +DV+++ 
Subjt:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSHLV--------------IEKTRD--------LKET-------LKEAGEDVMEDA

Query:  ERL-----AVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV
        +       A+EKGREAR+TAE IK GGNK+KE LI     G+KLVN SFRYLGS+ES KA M+VLSLLGF+MALGM VWTTFISSYVLASALPRQQLAVV
Subjt:  ERL-----AVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV

Query:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT
        QSKIYP+YFRAMASSIGMALFGHLFS TKWM      A+VVQGYVLVAALLTIFANSLYMEPQATKVMFERLK+EKEEG+GIEDI AEPRH ND+PPAIT
Subjt:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT

Query:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC
        TST        ATQVV+RE VKSRIVGLN++LKKLNSYSSLLNLLTLMALTWHLVYLSQRL IPC
Subjt:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC

XP_023550065.1 uncharacterized protein LOC111808367 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-16170.06Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG
        M NLFA CL++TSLTAAGLWSPSPAS+QDHEQDVIVKEGHRVVVVEYGDQ QHNTKVSISSEP++DAS SNPLH+SLN G P+EDS+RHRTRDLICDA G
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG

Query:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKE---------
        KCKHKIASAV KAKVMVSET             AFDEAK++VSDKSH                                  V+EK RDLKE         
Subjt:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKE---------

Query:  ---TLKEAGEDVME-----DAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFIS
            LKE  +DV++     D +  A+EKGREAR+TAE IK GGNK+KE L+   D G+KLVN SFRYL S+ES KA M+VLSLLGF MALGM VWTTFIS
Subjt:  ---TLKEAGEDVME-----DAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFIS

Query:  SYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIED
        SYVLASALPRQQLAVVQSKIYP+YFRAMASSIGMALFGHLFS TKWM      A+VVQGYVLVAALL IFANSLYMEPQATKVMFERLK+EKEEG+GIED
Subjt:  SYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIED

Query:  IIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC
        I AEPR  ND+PPA+TTST        ATQVV RE VKSRIVGLN++LKKLNSYSSLLNLLTLMALTWHLVYLSQRL IPC
Subjt:  IIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC

TrEMBL top hitse value%identityAlignment
A0A1S3CA56 uncharacterized protein LOC1034986951.7e-12160.77Show/hide
Query:  MYFTL--DLFDTLNLNTNMTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRP
        MYF L  +LF TL+   NMTNLFA  LI+T+LTAAGLWSP P       Q+VIVKEGHRVVVVEY DQ QHNTKVSISSEP  DA               
Subjt:  MYFTL--DLFDTLNLNTNMTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRP

Query:  SEDSKRHRTRDLICDAFGKCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSHLVIEKTRD--------LKETLKEAGEDVMEDA-ER
         ++S+RHRT+DLICD +GKCKHK+ASAVEKAKVMV+ET             AFDEAKD + + +     + +D         KET  EA + + E A E 
Subjt:  SEDSKRHRTRDLICDAFGKCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSHLVIEKTRD--------LKETLKEAGEDVMEDA-ER

Query:  LAVEKGRE----------ARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQL
        L + K RE          A+ T E I+ G NKLKE L+  +DRG K++N  FR+LG        M+ L LLGFAMALGM VW TFISSYVLAS LPRQQL
Subjt:  LAVEKGRE----------ARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQL

Query:  AVVQSKIYPVYFRAMASSIGMALFGHLFSHTKW-----MKAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPR-HVNDSP
         VVQSKIYPVYF+AMAS IGMAL GHLFS T+W       ++VVQGYVLVAALL IFANSLYMEP+ATKVMFERLKIEKEEGRGIEDI  E   +V D+ 
Subjt:  AVVQSKIYPVYFRAMASSIGMALFGHLFSHTKW-----MKAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPR-HVNDSP

Query:  PAITTSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC
        PAIT+ST         TQ+V+REVVKSRIVGLN++LKKLNSYSSLLNLLTLMALTWHLVYLSQRL  PC
Subjt:  PAITTSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC

A0A5D3BMZ0 DUF4149 domain-containing protein4.3e-12061.83Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG
        MTNLFA  LIVT+LTAAGLWSP P       Q+VIVKEGHRVVVVEY DQ QHNTKVSISSEP  DA                + S+RHRT+DLICD +G
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG

Query:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSHLVIEKTRD-----LKETLKEAGEDVMEDA-ERLAVEKGRE----------ARR
        KCKHK+ASAVEKAKVMV+ET             AFDEAKD + + +    ++ +D      KET  EA + + E A E L + K RE          A+ 
Subjt:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSHLVIEKTRD-----LKETLKEAGEDVMEDA-ERLAVEKGRE----------ARR

Query:  TAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGM
        T E I+ G NKLKE L+  +DRG K++N  FR+LG        M+ L LLGFAMALGM VW TFISSYVLAS LPRQQL VVQSKIYPVYF+AMAS IGM
Subjt:  TAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGM

Query:  ALFGHLFSHTKW-----MKAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPR-HVNDSPPAITTSTAAMSTPLHATQVVN
        AL GHLFS T+W       ++VVQGYVLVAALL IFANSLYMEP+ATKVMFERLKIEKEEGRGIEDI  E   +V D+ PAIT+ST         TQ+V+
Subjt:  ALFGHLFSHTKW-----MKAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPR-HVNDSPPAITTSTAAMSTPLHATQVVN

Query:  REVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC
        REVVKSRIVGLN++LKKLNSYSSLLNLLTLMALTWHLVYLSQRL  PC
Subjt:  REVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC

A0A6J1DI19 uncharacterized protein LOC1110202466.1e-14370.2Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG
        MTNLFA C+IVTSLTAAGLWSPSPAS+QDHEQDVIVKEGHRVVVVEYGDQ + +TKVSIS EP    S+ NPLHN   +     +++RHR+RDLICDAFG
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG

Query:  KCKHKIASAVEKAKVMVSETRV--------AFDEAKDSVSDKSHLV---IEKTRD--------LKETLKEAGEDVMEDAERLAVE-KGR-------EARR
        KCKHKIA+AV KAKVMVSET          AFDEAKD+VSDKSH V   +EK R+         K+T+     D+ E A+  A E KGR         R+
Subjt:  KCKHKIASAVEKAKVMVSETRV--------AFDEAKDSVSDKSHLV---IEKTRD--------LKETLKEAGEDVMEDAERLAVE-KGR-------EARR

Query:  TAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGM
        TAE IKAGGNK    L++ L RGVKLV GS RYLGS ESL++ M VLSLLGFAMALGM VWTTFISSYVLA ALPRQQL VVQSKIYPVYFRAMASSIGM
Subjt:  TAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGM

Query:  ALFGHLFSHTK-WMKAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVK
        AL GHLFSH+  + KAQVVQGYVL+AALL  FAN +YMEPQATKVMFERLK+EKEEGRGIEDI A+        PAITTS AAMS  L AT+VV+REVVK
Subjt:  ALFGHLFSHTK-WMKAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVK

Query:  SRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC
        SRIVGLN++LKKLNSYSSLLNLLTLMALTWHLVYLSQRL I C
Subjt:  SRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC

A0A6J1FL32 uncharacterized protein LOC1114449652.6e-16271.83Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG
        M NLFA CL++TSLTAAGLWSPSPAS+QDHEQDVIVKEGHRVVVVEYGDQ QHNTKVSISSEP++DAS SNPLH+SLN G P+EDS+RHRTRDLICDA G
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG

Query:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKETLKEA-GED
        KCKHKIASAV KAKVMVSET             AFDEAK++VSDKSH                                  V+EK RDLKE  K+   E 
Subjt:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSH---------------------------------LVIEKTRDLKETLKEA-GED

Query:  VMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV
           D +  A+EKGREAR+TAE IK GGNK+KE L+   D G+KLVN SFRYL S+ES KA M+VLSLLGF MALGM VWTTFISSYVLASALPRQQLAVV
Subjt:  VMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV

Query:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT
        QSKIYP+YFRAMASSIGMALFGHLFS TKWM      A+VVQGYVLVAALL IFANSLYMEPQATKVMFERLK+EKEEG+GIEDI AEPR  ND+PPA+T
Subjt:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT

Query:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC
        TST        ATQVV RE VKSRIVGLN++LKKLNSYSSLLNLLTLMALTWHLVYLSQRL IPC
Subjt:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC

A0A6J1JR64 uncharacterized protein LOC1114890831.1e-16372.69Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG
        M NLFA CL++TSLTAAGLWSPSPAS+ DHEQDVIVKEGHRVVVVEYGDQ QHNTKVSISSEP++DAS SNPLH+SLN G P+EDS+RHRTRDLICDA G
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDLICDAFG

Query:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSHLV--------------IEKTRD--------LKET-------LKEAGEDVMEDA
        KCKHKIASAV KAKVMVSET             AFDEAK++VSDKSH V              +EK R+         KET       LKE  +DV+++ 
Subjt:  KCKHKIASAVEKAKVMVSETRV-----------AFDEAKDSVSDKSHLV--------------IEKTRD--------LKET-------LKEAGEDVMEDA

Query:  ERL-----AVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV
        +       A+EKGREAR+TAE IK GGNK+KE LI     G+KLVN SFRYLGS+ES KA M+VLSLLGF+MALGM VWTTFISSYVLASALPRQQLAVV
Subjt:  ERL-----AVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVV

Query:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT
        QSKIYP+YFRAMASSIGMALFGHLFS TKWM      A+VVQGYVLVAALLTIFANSLYMEPQATKVMFERLK+EKEEG+GIEDI AEPRH ND+PPAIT
Subjt:  QSKIYPVYFRAMASSIGMALFGHLFSHTKWM-----KAQVVQGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAIT

Query:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC
        TST        ATQVV+RE VKSRIVGLN++LKKLNSYSSLLNLLTLMALTWHLVYLSQRL IPC
Subjt:  TSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPC

SwissProt top hitse value%identityAlignment
A1L2F6 Transmembrane protein 2052.3e-0631.09Show/hide
Query:  MEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFGHLFSHTK----WMKAQVVQGYVLVAALLTIFANSLYMEP
        ++VL LL  +   GM VW +FI+ +VL S +      +VQSK++PVYF  +     ++L  +   H +    W +   +Q  +   A++    N+ +  P
Subjt:  MEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFGHLFSHTK----WMKAQVVQGYVLVAALLTIFANSLYMEP

Query:  QATKVMFERLKIEKEEGRG
         AT+ M    +IEKE G G
Subjt:  QATKVMFERLKIEKEEGRG

Q32L10 Transmembrane protein 2053.6e-0734.71Show/hide
Query:  EVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFG--HLFSHTKWMKA---QVVQGYVLVAALLTIFANSLYMEP
        +V+ LL  + A GM +W TFIS +VL   LPR    +VQSK++P YF     S+G A      L S   W +    +  Q ++L+ +L     N+ ++E 
Subjt:  EVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFG--HLFSHTKWMKA---QVVQGYVLVAALLTIFANSLYMEP

Query:  QATKVMFERLKIEKEEGRGIE
        + T  M+    +EKE G G E
Subjt:  QATKVMFERLKIEKEEGRGIE

Q5REM8 Transmembrane protein 2059.5e-0833.07Show/hide
Query:  SLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFG--HLFSHTKWMKA---QVVQGYVLVAALLTIFAN
        +L   ++++ LL  + A GM +W TF+S ++L  +LPR    +VQSK++P YF     S+G A      L S   W +    +  Q Y+L  +L     N
Subjt:  SLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFG--HLFSHTKWMKA---QVVQGYVLVAALLTIFAN

Query:  SLYMEPQATKVMFERLKIEKEEGRGIE
        + ++EP+ T  M+    +EKE G G E
Subjt:  SLYMEPQATKVMFERLKIEKEEGRGIE

Q6UW68 Transmembrane protein 2057.2e-0833.07Show/hide
Query:  SLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFG--HLFSHTKWMKA---QVVQGYVLVAALLTIFAN
        +L   ++++ LL  + A GM +W TF+S ++L  +LPR    +VQSK++P YF     S+G A      L S   W +    +  Q Y+L  +L     N
Subjt:  SLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFG--HLFSHTKWMKA---QVVQGYVLVAALLTIFAN

Query:  SLYMEPQATKVMFERLKIEKEEGRGIE
        + ++EP+ T  M+    +EKE G G E
Subjt:  SLYMEPQATKVMFERLKIEKEEGRGIE

Q91XE8 Transmembrane protein 2052.5e-0836.89Show/hide
Query:  MEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFG--HLFSHTKWMKA---QVVQGYVLVAALLTIFANSLYME
        ++V+ LL  + A GM VW TFIS ++L  +LPR    +VQSK++PVYF     S+G A      L     W+     +V Q  +L+ +L     N+ ++E
Subjt:  MEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFG--HLFSHTKWMKA---QVVQGYVLVAALLTIFANSLYME

Query:  PQATKVMFERLKIEKEEGRGIE
         + T VM     IEKE G G E
Subjt:  PQATKVMFERLKIEKEEGRGIE

Arabidopsis top hitse value%identityAlignment
AT1G22600.1 Late embryogenesis abundant protein (LEA) family protein8.1e-4732.4Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSIS------SEPSQDASR--SNPLHNSLNTGRPSEDSKRHRT-
        MT L A  L++++L   G  + S       E D+IV++GHRVVVVEY    + NT+V IS       + S+D      N    + ++    E+ + H T 
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSIS------SEPSQDASR--SNPLHNSLNTGRPSEDSKRHRT-

Query:  RDLICDAFGKCKHKIASAVEKAKVMVSETRVAFDEAKDSVSDKS--HLVIEKTRDLKETLKEAGEDVMEDAERLAVEKGREARRTAENIKAGGNKLKEIL
         +LICDA GKCKHK+ + + + K          D     +SD++    V  +  +++E +     +        A +K    ++  E ++          
Subjt:  RDLICDAFGKCKHKIASAVEKAKVMVSETRVAFDEAKDSVSDKS--HLVIEKTRDLKETLKEAGEDVMEDAERLAVEKGREARRTAENIKAGGNKLKEIL

Query:  ILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFGHLFSHTKWMKAQ
                  V G    + +   L     V+ ++G A A GMCVW TF+S YVLAS L  QQ  VVQSK+YPVYF+A++  I + L GH+    + +   
Subjt:  ILTLDRGVKLVNGSFRYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFGHLFSHTKWMKAQ

Query:  VV---QGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLN
         V   Q   L++++L + AN+ ++  +ATK MFE +K EKE+GRG +                + S          T+  + +VVK R+  L+ +++KLN
Subjt:  VV---QGYVLVAALLTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLN

Query:  SYSSLLNLLTLMALTWHLVYLSQRLHIPC
        +YSS LNLLTLM+LTWH VYL  RL + C
Subjt:  SYSSLLNLLTLMALTWHLVYLSQRLHIPC

AT1G72100.1 late embryogenesis abundant domain-containing protein / LEA domain-containing protein1.5e-5333.79Show/hide
Query:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDH-----EQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTG---------------
        MTNL A CL++++L AA +WSPSPA    +     E +VIVK+GH VVVVEY    + NT+VSI S PS D          + T                
Subjt:  MTNLFAFCLIVTSLTAAGLWSPSPASQQDH-----EQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTG---------------

Query:  ------------RPSEDSKRHRTR-DLICDAFGKCKHKIASAVEKAK------------------------VMVSETRVAFD------------------
                       E    H T  ++ICDAFGKC+ KIAS V +AK                        V  + T  A D                  
Subjt:  ------------RPSEDSKRHRTR-DLICDAFGKCKHKIASAVEKAK------------------------VMVSETRVAFD------------------

Query:  --------------EAKDSVSDKSH----LVIEKTRDLKETLKEAGEDVMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYL
                      +AK+SV+DK+H     V +K  D KE ++E   DV E   + A E    A+   + ++    +LKE          + V    R  
Subjt:  --------------EAKDSVSDKSH----LVIEKTRDLKETLKEAGEDVMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSFRYL

Query:  GSME----SLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFGHLFSHTKWM---KAQVVQGYVLVAAL
        GS      S      ++ L G A A G  VW TF+SSYVLAS L RQQ  VVQSK+YPVYF+A +  I + LFGH+ S  + +     ++ QG  L+++ 
Subjt:  GSME----SLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFGHLFSHTKWM---KAQVVQGYVLVAAL

Query:  LTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMAL
          I AN  ++EP+ATK MFER+K EKEEGRG E                                 + + ++ ++  L+ +L KLN+YSS LN+LTLM+L
Subjt:  LTIFANSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMAL

Query:  TWHLVYLSQRLHIPC
        TWH VYL QRL   C
Subjt:  TWHLVYLSQRLHIPC

AT3G62580.1 Late embryogenesis abundant protein (LEA) family protein2.1e-1026.58Show/hide
Query:  ILILTLDRGVKLVNGSFRYLGSMES---LKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMAS--SIGMALFGHLFSH
        +++ +L  GV     +F  L + E+   +   +++  LL FA A G  +W TFI   ++   LPR Q   +QSK++P YF  + S  +I ++ FG+L   
Subjt:  ILILTLDRGVKLVNGSFRYLGSMES---LKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMAS--SIGMALFGHLFSH

Query:  TKWMKAQVVQGYVLVAALLTIFA----NSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVKS--RIV
          W  +  V+ Y  +  LL+ FA    N     P    +M +R K+E+E   G E                                 NRE  KS  ++ 
Subjt:  TKWMKAQVVQGYVLVAALLTIFA----NSLYMEPQATKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVKS--RIV

Query:  GLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHI
         +N+K   ++  SSL N+ +  +L  H  YL+ +L++
Subjt:  GLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTTTACTCTTGATCTCTTCGATACCTTGAACTTGAACACGAACATGACCAACCTATTCGCTTTTTGTCTCATCGTGACTTCTCTAACTGCCGCGGGACTCTGGTC
TCCTTCTCCGGCGTCCCAGCAGGATCATGAGCAGGATGTTATTGTTAAAGAAGGTCATCGAGTGGTTGTGGTCGAGTATGGCGACCAAGATCAACACAATACTAAGGTTT
CCATCTCTTCCGAACCCAGTCAAGATGCCTCCCGTAGCAACCCACTACATAATAGTTTAAATACTGGGAGGCCCAGCGAAGACTCCAAAAGGCACCGTACCAGGGATCTT
ATTTGCGATGCCTTTGGCAAATGTAAACATAAGATAGCTAGTGCTGTGGAGAAAGCTAAAGTAATGGTTTCGGAGACGCGCGTCGCATTCGACGAAGCCAAAGATAGTGT
TTCAGACAAATCCCACCTAGTGATTGAGAAAACACGTGACTTGAAGGAGACATTAAAGGAAGCAGGAGAAGACGTCATGGAGGATGCTGAAAGGCTGGCAGTGGAGAAAG
GGAGAGAAGCGAGACGAACTGCGGAGAACATTAAAGCTGGTGGAAACAAGCTTAAGGAAATTCTTATTCTTACACTTGATAGAGGAGTGAAACTGGTAAACGGTTCCTTT
AGGTACTTGGGTTCGATGGAGTCATTGAAAGCGACGATGGAAGTGTTGAGTCTGCTAGGATTTGCTATGGCTCTGGGAATGTGCGTGTGGACTACCTTCATCTCCAGCTA
TGTGCTGGCGAGTGCGCTGCCAAGGCAGCAGTTGGCTGTGGTGCAGAGCAAGATATATCCCGTGTACTTCAGGGCCATGGCGTCCAGCATTGGGATGGCCTTATTCGGGC
ATCTATTCAGCCACACAAAATGGATGAAGGCTCAAGTGGTCCAAGGATATGTACTTGTGGCTGCACTTTTGACGATTTTTGCCAATTCTCTCTACATGGAGCCTCAAGCC
ACCAAGGTAATGTTTGAGAGGTTAAAAATAGAGAAGGAAGAAGGAAGAGGAATTGAGGACATAATCGCTGAACCTCGGCATGTCAATGATAGTCCCCCAGCAATCACAAC
CAGCACAGCCGCAATGTCTACTCCATTACATGCTACACAAGTCGTAAATCGAGAGGTGGTGAAGTCCAGAATCGTGGGATTGAATAGGAAGCTAAAGAAGCTGAATTCGT
ATTCATCCTTGTTAAACCTGCTCACTTTGATGGCTCTCACCTGGCATCTTGTGTACCTGAGCCAGCGTCTTCACATACCGTGCCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTATTTTACTCTTGATCTCTTCGATACCTTGAACTTGAACACGAACATGACCAACCTATTCGCTTTTTGTCTCATCGTGACTTCTCTAACTGCCGCGGGACTCTGGTC
TCCTTCTCCGGCGTCCCAGCAGGATCATGAGCAGGATGTTATTGTTAAAGAAGGTCATCGAGTGGTTGTGGTCGAGTATGGCGACCAAGATCAACACAATACTAAGGTTT
CCATCTCTTCCGAACCCAGTCAAGATGCCTCCCGTAGCAACCCACTACATAATAGTTTAAATACTGGGAGGCCCAGCGAAGACTCCAAAAGGCACCGTACCAGGGATCTT
ATTTGCGATGCCTTTGGCAAATGTAAACATAAGATAGCTAGTGCTGTGGAGAAAGCTAAAGTAATGGTTTCGGAGACGCGCGTCGCATTCGACGAAGCCAAAGATAGTGT
TTCAGACAAATCCCACCTAGTGATTGAGAAAACACGTGACTTGAAGGAGACATTAAAGGAAGCAGGAGAAGACGTCATGGAGGATGCTGAAAGGCTGGCAGTGGAGAAAG
GGAGAGAAGCGAGACGAACTGCGGAGAACATTAAAGCTGGTGGAAACAAGCTTAAGGAAATTCTTATTCTTACACTTGATAGAGGAGTGAAACTGGTAAACGGTTCCTTT
AGGTACTTGGGTTCGATGGAGTCATTGAAAGCGACGATGGAAGTGTTGAGTCTGCTAGGATTTGCTATGGCTCTGGGAATGTGCGTGTGGACTACCTTCATCTCCAGCTA
TGTGCTGGCGAGTGCGCTGCCAAGGCAGCAGTTGGCTGTGGTGCAGAGCAAGATATATCCCGTGTACTTCAGGGCCATGGCGTCCAGCATTGGGATGGCCTTATTCGGGC
ATCTATTCAGCCACACAAAATGGATGAAGGCTCAAGTGGTCCAAGGATATGTACTTGTGGCTGCACTTTTGACGATTTTTGCCAATTCTCTCTACATGGAGCCTCAAGCC
ACCAAGGTAATGTTTGAGAGGTTAAAAATAGAGAAGGAAGAAGGAAGAGGAATTGAGGACATAATCGCTGAACCTCGGCATGTCAATGATAGTCCCCCAGCAATCACAAC
CAGCACAGCCGCAATGTCTACTCCATTACATGCTACACAAGTCGTAAATCGAGAGGTGGTGAAGTCCAGAATCGTGGGATTGAATAGGAAGCTAAAGAAGCTGAATTCGT
ATTCATCCTTGTTAAACCTGCTCACTTTGATGGCTCTCACCTGGCATCTTGTGTACCTGAGCCAGCGTCTTCACATACCGTGCCAATAG
Protein sequenceShow/hide protein sequence
MYFTLDLFDTLNLNTNMTNLFAFCLIVTSLTAAGLWSPSPASQQDHEQDVIVKEGHRVVVVEYGDQDQHNTKVSISSEPSQDASRSNPLHNSLNTGRPSEDSKRHRTRDL
ICDAFGKCKHKIASAVEKAKVMVSETRVAFDEAKDSVSDKSHLVIEKTRDLKETLKEAGEDVMEDAERLAVEKGREARRTAENIKAGGNKLKEILILTLDRGVKLVNGSF
RYLGSMESLKATMEVLSLLGFAMALGMCVWTTFISSYVLASALPRQQLAVVQSKIYPVYFRAMASSIGMALFGHLFSHTKWMKAQVVQGYVLVAALLTIFANSLYMEPQA
TKVMFERLKIEKEEGRGIEDIIAEPRHVNDSPPAITTSTAAMSTPLHATQVVNREVVKSRIVGLNRKLKKLNSYSSLLNLLTLMALTWHLVYLSQRLHIPCQ