; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011785 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011785
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationtig00153057:65314..68946
RNA-Seq ExpressionSgr011785
SyntenySgr011785
Gene Ontology termsNA
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447712.1 PREDICTED: uncharacterized protein LOC103490125 [Cucumis melo]1.9e-9552.63Show/hide
Query:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        MGD NREALINRK++VFKRS SHA DEL SFRSYLRWMCVDQSDIWTAGLSWSMFF+FA+IVPATSHFLLAC+SCDSNHARPFDRVVQLSLSSVATVSF 
Subjt:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGRVADSVPGERCAERRSSMRDGAAVVAV-
        CLS+FIRRYGLRRFLFFDKLCDESETVR+GYTIKLN +   +          E            + +  +G V  S     C E  S +     +  V 
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGRVADSVPGERCAERRSSMRDGAAVVAV-

Query:  -----PDDGDIPGLHPVPSDLRPPDPTAAGLRDGVP-------GGLRCGVGAVGAPEDQTASEDHQPPV-----PRLHTMDPGPRHRQPIRL--------
               D  I  L    +  +     A+ L + +           R  V  +G+    T S+     +       L+    G      + L        
Subjt:  -----PDDGDIPGLHPVPSDLRPPDPTAAGLRDGVP-------GGLRCGVGAVGAPEDQTASEDHQPPV-----PRLHTMDPGPRHRQPIRL--------

Query:  ---PPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYAY
               H+          W  CA LD     S D+T            +GETPMA+     + Q FPT    G+ES+ DEGC +EDDLDNTKLIPAYAY
Subjt:  ---PPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYAY

Query:  STISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        STISFQKRQALV YFENNRAGITIYGFTLDR+TLHTIFGIELSLVLWLLGKTIGFS
Subjt:  STISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

XP_022139425.1 uncharacterized protein LOC111010361 [Momordica charantia]2.3e-9653.35Show/hide
Query:  MGDNREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFC
        MGDN EAL+NR    +KRSASHA DEL+SFRSYLRWMCVDQSDIWTAGLSWSMFF+FAVIVPATSHFLLACASCDSNHARPFDRVVQLSLS VATVSF C
Subjt:  MGDNREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFC

Query:  LSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLN--------------ITEGAVDV--------RGPLLRGGERVQNMVVRVGRVADSVPGER------
        LSNFIRRYGLRRFLFFDKLCDESETVR+GYTIKLN                E A  +        + P L G   V ++V     +   +          
Subjt:  LSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLN--------------ITEGAVDV--------RGPLLRGGERVQNMVVRVGRVADSVPGER------

Query:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDP--------TAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGP
              C  +   ++D A+V  V  D G +   H  +   LR            T   +       L     +        A E     +  L ++    
Subjt:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDP--------TAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGP

Query:  RHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPC--GGDESDDDEGCYEEDDLDNTK
        R    I      H+          W  CA LD     S D+T            +GETPMAA   GG   VFP TP   G  E+D DEGC EEDDLDNTK
Subjt:  RHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPC--GGDESDDDEGCYEEDDLDNTK

Query:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        LIPAYAYSTISFQKRQALV YFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

XP_022940637.1 uncharacterized protein LOC111446173 [Cucurbita moschata]9.5e-9551.4Show/hide
Query:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        MGD +REALI+RKA VF RSASHAQDEL SFRSYLRWMCVDQSDIW+AGLSWS+FF+FA++VPATSHF LAC+SCDS+HARPFDRVVQLSLSSVATVSF 
Subjt:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGR--VADSVPGER----------------
        CLSNFIRRYGLRRFLFFD+LCDESETVR+GYT K N +   +          E            + +  +G   V+D+V                    
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGR--VADSVPGER----------------

Query:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQT----------ASEDHQPPVPRLHTMDP
              C  +   ++D A V  V  D G +   H  +   LR     +   R  +   L    G+       T          A E     +  L ++  
Subjt:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQT----------ASEDHQPPVPRLHTMDP

Query:  GPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTK
          R    I      H+          W  CA LD     S D+T            +GETPMAAAA      +FP TP G +ES+ +EGC EED+LDNTK
Subjt:  GPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTK

Query:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        LIPAYAYSTISFQKRQALV YFENNRAGITIYGFTLDR+TLHTIFGIELSLVLWLLGKTIGFS
Subjt:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

XP_022981472.1 uncharacterized protein LOC111480583 [Cucurbita maxima]1.9e-9551.62Show/hide
Query:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        MGD +REALI+RKA VF RSASHAQDEL SFRSYLRWMCVDQSDIW+AGLSWS+FF+FA++VPATSHF LAC+SCDS+HARPFDRVVQLSLSSVATVSF 
Subjt:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGR--VADSVPGER----------------
        CLSNFIRRYGLRRFLFFD+LCDESETVR+GYT K N +   +          E            + +  +G   V+D+V                    
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGR--VADSVPGER----------------

Query:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQT----------ASEDHQPPVPRLHTMDP
              C  +   ++D A V  V  D G +   H  +   LR     +   R  +   L    G+       T          A E     +  L ++  
Subjt:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQT----------ASEDHQPPVPRLHTMDP

Query:  GPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTK
          R    I      H+          W  CA LD     S D+T            +GETPMAAAA      +FP TP G +ESD +EGC EED+LDNTK
Subjt:  GPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTK

Query:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        LIPAYAYSTISFQKRQALV YFENNRAGITIYGFTLDR+TLHTIFGIELSLVLWLLGKTIGFS
Subjt:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

XP_038898198.1 uncharacterized protein LOC120085939 [Benincasa hispida]1.9e-9552.59Show/hide
Query:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        MGD NREALI+RK++VFKRSASHA DELQSFRSYLRWMCVDQSDIWTAGLSWSMFF+FA+IVPATSHF+LAC+SCDSNHARPFDRVVQLSLSS+ATVSF 
Subjt:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLN--------------ITEGAVDV--------RGPLLRGGERVQNMVVRVGRVADSVPGER-----
        CLS FIRRYGLRRFLFFDKLCDESETVR GYT K N                E A  +        + P L G   V ++V     +   +         
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLN--------------ITEGAVDV--------RGPLLRGGERVQNMVVRVGRVADSVPGER-----

Query:  -------CAERRSSMRDGAAVVAVPDDGDIPGLHPVPSDLRPP-DPTAAGLRDGVPGGLRCGVGAVGAP-----------EDQTASEDHQPPVPRLHTMD
               C  +   ++D A V  V  D D+  +      +R      +   R  + G L    G+                   A E     +  L ++ 
Subjt:  -------CAERRSSMRDGAAVVAVPDDGDIPGLHPVPSDLRPP-DPTAAGLRDGVPGGLRCGVGAVGAP-----------EDQTASEDHQPPVPRLHTMD

Query:  PGPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNT
           R    I      H+          W  CA LD     S D+T            +GETPMA+  Q    QVFP T  GGDES   EGC EEDDLDNT
Subjt:  PGPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNT

Query:  KLIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        KLIPAYAYSTISFQKRQALV YFENNRAGITIYGFTLDR+TLHTIFGIELSLVLWLLGKTIGFS
Subjt:  KLIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

TrEMBL top hitse value%identityAlignment
A0A0A0K352 Uncharacterized protein5.6e-9351.64Show/hide
Query:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        MGD NREALI+RK++VFKRS SHA DEL SFRSYLRWMCVDQSDIWTAGLSWSMFF+FA+IVPATSHF+LAC+SCDSNHARPFDRVVQLSLSSVATVSF 
Subjt:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGRVADSVPGERCAERRSSMRDGAAVV---
        CLS+FIRRYGLRRFLFFDKLCDESETVR+GYTIK N +   +          E            + +  +G V  S     CA    S      V+   
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGRVADSVPGERCAERRSSMRDGAAVV---

Query:  ----AVPDDGDIPGLHPVPSDLRPPDPTAAGLRDGVP-------GGLRCGVGAVGAPEDQTASEDHQPPV-----PRLHTMDPGPRHRQPIRL-------
             +  D  I  L    +  +     A+ L + +           R  V  +G+    T S+     +       L+    G      + L       
Subjt:  ----AVPDDGDIPGLHPVPSDLRPPDPTAAGLRDGVP-------GGLRCGVGAVGAPEDQTASEDHQPPV-----PRLHTMDPGPRHRQPIRL-------

Query:  ----PPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYA
                H+          W  CA LD     S D+T            +GETPMA+     + Q FP     G+ES+ DEGC +EDDLDNTKLIPAYA
Subjt:  ----PPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYA

Query:  YSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        YSTISFQKRQALV YFENNRAGITIYGFTLDR+TLHTIFGIELSLVLWLLGKTIGFS
Subjt:  YSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

A0A1S3BI23 uncharacterized protein LOC1034901259.2e-9652.63Show/hide
Query:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        MGD NREALINRK++VFKRS SHA DEL SFRSYLRWMCVDQSDIWTAGLSWSMFF+FA+IVPATSHFLLAC+SCDSNHARPFDRVVQLSLSSVATVSF 
Subjt:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGRVADSVPGERCAERRSSMRDGAAVVAV-
        CLS+FIRRYGLRRFLFFDKLCDESETVR+GYTIKLN +   +          E            + +  +G V  S     C E  S +     +  V 
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGRVADSVPGERCAERRSSMRDGAAVVAV-

Query:  -----PDDGDIPGLHPVPSDLRPPDPTAAGLRDGVP-------GGLRCGVGAVGAPEDQTASEDHQPPV-----PRLHTMDPGPRHRQPIRL--------
               D  I  L    +  +     A+ L + +           R  V  +G+    T S+     +       L+    G      + L        
Subjt:  -----PDDGDIPGLHPVPSDLRPPDPTAAGLRDGVP-------GGLRCGVGAVGAPEDQTASEDHQPPV-----PRLHTMDPGPRHRQPIRL--------

Query:  ---PPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYAY
               H+          W  CA LD     S D+T            +GETPMA+     + Q FPT    G+ES+ DEGC +EDDLDNTKLIPAYAY
Subjt:  ---PPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYAY

Query:  STISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        STISFQKRQALV YFENNRAGITIYGFTLDR+TLHTIFGIELSLVLWLLGKTIGFS
Subjt:  STISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

A0A6J1CCA0 uncharacterized protein LOC1110103611.1e-9653.35Show/hide
Query:  MGDNREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFC
        MGDN EAL+NR    +KRSASHA DEL+SFRSYLRWMCVDQSDIWTAGLSWSMFF+FAVIVPATSHFLLACASCDSNHARPFDRVVQLSLS VATVSF C
Subjt:  MGDNREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFC

Query:  LSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLN--------------ITEGAVDV--------RGPLLRGGERVQNMVVRVGRVADSVPGER------
        LSNFIRRYGLRRFLFFDKLCDESETVR+GYTIKLN                E A  +        + P L G   V ++V     +   +          
Subjt:  LSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLN--------------ITEGAVDV--------RGPLLRGGERVQNMVVRVGRVADSVPGER------

Query:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDP--------TAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGP
              C  +   ++D A+V  V  D G +   H  +   LR            T   +       L     +        A E     +  L ++    
Subjt:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDP--------TAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGP

Query:  RHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPC--GGDESDDDEGCYEEDDLDNTK
        R    I      H+          W  CA LD     S D+T            +GETPMAA   GG   VFP TP   G  E+D DEGC EEDDLDNTK
Subjt:  RHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPC--GGDESDDDEGCYEEDDLDNTK

Query:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        LIPAYAYSTISFQKRQALV YFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

A0A6J1FK66 uncharacterized protein LOC1114461734.6e-9551.4Show/hide
Query:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        MGD +REALI+RKA VF RSASHAQDEL SFRSYLRWMCVDQSDIW+AGLSWS+FF+FA++VPATSHF LAC+SCDS+HARPFDRVVQLSLSSVATVSF 
Subjt:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGR--VADSVPGER----------------
        CLSNFIRRYGLRRFLFFD+LCDESETVR+GYT K N +   +          E            + +  +G   V+D+V                    
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGR--VADSVPGER----------------

Query:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQT----------ASEDHQPPVPRLHTMDP
              C  +   ++D A V  V  D G +   H  +   LR     +   R  +   L    G+       T          A E     +  L ++  
Subjt:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQT----------ASEDHQPPVPRLHTMDP

Query:  GPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTK
          R    I      H+          W  CA LD     S D+T            +GETPMAAAA      +FP TP G +ES+ +EGC EED+LDNTK
Subjt:  GPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTK

Query:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        LIPAYAYSTISFQKRQALV YFENNRAGITIYGFTLDR+TLHTIFGIELSLVLWLLGKTIGFS
Subjt:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

A0A6J1IU24 uncharacterized protein LOC1114805839.2e-9651.62Show/hide
Query:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        MGD +REALI+RKA VF RSASHAQDEL SFRSYLRWMCVDQSDIW+AGLSWS+FF+FA++VPATSHF LAC+SCDS+HARPFDRVVQLSLSSVATVSF 
Subjt:  MGD-NREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGR--VADSVPGER----------------
        CLSNFIRRYGLRRFLFFD+LCDESETVR+GYT K N +   +          E            + +  +G   V+D+V                    
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERV---------QNMVVRVGR--VADSVPGER----------------

Query:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQT----------ASEDHQPPVPRLHTMDP
              C  +   ++D A V  V  D G +   H  +   LR     +   R  +   L    G+       T          A E     +  L ++  
Subjt:  ------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQT----------ASEDHQPPVPRLHTMDP

Query:  GPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTK
          R    I      H+          W  CA LD     S D+T            +GETPMAAAA      +FP TP G +ESD +EGC EED+LDNTK
Subjt:  GPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTK

Query:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        LIPAYAYSTISFQKRQALV YFENNRAGITIYGFTLDR+TLHTIFGIELSLVLWLLGKTIGFS
Subjt:  LIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)1.5e-6942.24Show/hide
Query:  REALINRKAN-----VFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        +E LIN + N     +F R  SH QDEL SFR YLRWMCVD S  WTA LSW+MF VF ++VPA SHFLLACA CDS H+RP+D VVQLSLSSVATVSF 
Subjt:  REALINRKAN-----VFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDV----------------------RGPLLRGGERVQNMVVRVGRVADSVPGER-----
        CL+ F+ +YGLRRFLFFDKL DESETVR+ YT +LN +   V                        R P L G   + + V  +  +   +         
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDV----------------------RGPLLRGGERVQNMVVRVGRVADSVPGER-----

Query:  -------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLH--TMDPGPRHRQP
               C  +   ++D A +  +  D G I   H  +   LR     +   R  +   L    G+  +    T     +  + R     +         
Subjt:  -------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLH--TMDPGPRHRQP

Query:  IRLPPHHHQVPFQPQYLHC----WRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAA----QGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNT
        + L     ++  + Q + C    W  CA L+     S D T E  +         ETP   A        V  V   T     ESD DE   EEDDLDN 
Subjt:  IRLPPHHHQVPFQPQYLHC----WRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAA----QGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNT

Query:  KLIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
         +IP YA+ST+SFQKRQALV YFENN AGIT+YGFTLDR TLHTIFG+ELSLVLWLLGKTIG S
Subjt:  KLIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

AT1G50630.2 Protein of unknown function (DUF3537)5.1e-4637.18Show/hide
Query:  REALINRKAN-----VFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF
        +E LIN + N     +F R  SH QDEL SFR YLRWMCVD S  WTA LSW+MF VF ++VPA SHFLLACA CDS H+RP+D VVQLSLSSVATVSF 
Subjt:  REALINRKAN-----VFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFF

Query:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDV----------------------RGPLLRGGERVQNMVVRVGRVADSVPGER-----
        CL+ F+ +YGLRRFLFFDKL DESETVR+ YT +LN +   V                        R P L G   + + V  +  +   +         
Subjt:  CLSNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLNITEGAVDV----------------------RGPLLRGGERVQNMVVRVGRVADSVPGER-----

Query:  -------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLH--TMDPGPRHRQP
               C  +   ++D A +  +  D G I   H  +   LR     +   R  +   L    G+  +    T     +  + R     +         
Subjt:  -------CAERRSSMRDGAAVVAVPDD-GDIPGLH-PVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLH--TMDPGPRHRQP

Query:  IRLPPHHHQVPFQPQYLHC----WRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAA----QGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNT
        + L     ++  + Q + C    W  CA L+     S D T E  +         ETP   A        V  V   T     ESD DE   EEDDLDN 
Subjt:  IRLPPHHHQVPFQPQYLHC----WRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAA----QGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNT

Query:  KLIPAYAYSTISFQKRQALVMYFEN
         +IP YA+ST+SFQKRQAL    +N
Subjt:  KLIPAYAYSTISFQKRQALVMYFEN

AT3G20300.1 Protein of unknown function (DUF3537)1.4e-7242.92Show/hide
Query:  GDNREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFCL
        G  RE LINR+ N F RS SHAQDEL SFR YLRWMCVDQS  WTA LSWSMF VF ++VPATSHF+LAC+ CDS+H+RP+D VVQLSLSS A +SF CL
Subjt:  GDNREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFCL

Query:  SNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLN---------ITEGAVDVRGPLLRGGERVQNMVVRVGRV--ADSVPGERCAERRSSMRDGAAVVAVP
        S F+ +YGLRRFLFFDKL DESETVR GYT +LN         ++   + +    +       + +  +G V  +D+V    C     S      V+ + 
Subjt:  SNFIRRYGLRRFLFFDKLCDESETVRKGYTIKLN---------ITEGAVDVRGPLLRGGERVQNMVVRVGRV--ADSVPGERCAERRSSMRDGAAVVAVP

Query:  DDGDIPGLHPVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGPRHRQPIRL-------PPHHHQVPFQPQY--LHCW
            +  L  +   L+        L+D            V   +    S   +    R H      R+R  I L          +  +     Y  L+ +
Subjt:  DDGDIPGLHPVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGPRHRQPIRL-------PPHHHQVPFQPQY--LHCW

Query:  RT-----CAV------------LDDAAYKSDDITT------ECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPA
        R      C++                 +K+  +T        C    +   ++GETP       G    +PT    G+   +D G  EEDD DN  LIPA
Subjt:  RT-----CAV------------LDDAAYKSDDITT------ECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPA

Query:  YAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS
        YAYSTISFQKRQALV YFENNR+GIT++GFTLDRSTLHTIFGIE+SLVLWLLGKTIG S
Subjt:  YAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS

AT4G03820.1 Protein of unknown function (DUF3537)9.6e-2930.99Show/hide
Query:  SFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFCLSNFIRRYGLRRFLFFDKLCDESETVRK
        SF     W   DQS+     LSWS+FF+ AVIVP  SHF+L CA CD  H RP+D +VQLSLS  A +SF  LS++ ++YG+RRFLFFDKL D S+ VR 
Subjt:  SFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFCLSNFIRRYGLRRFLFFDKLCDESETVRK

Query:  GYTIKLNITEGAVDV---RGPLLRGGERVQNMVVRVGRVADSVPGE-----RCAERRSS--MRDGAAVVAVPDDGDIPGLHPV---------PSDLRPPD
        GY  K+  +   + +       L+   R+        ++   +         C  + SS   R    ++A     +I  L  +          S+++   
Subjt:  GYTIKLNITEGAVDV---RGPLLRGGERVQNMVVRVGRVADSVPGE-----RCAERRSS--MRDGAAVVAVPDDGDIPGLHPV---------PSDLRPPD

Query:  PTAAG----------LRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGPRHRQPIRL-----------PPHHHQVPFQPQYLHCWRTCAVLD
           A           +       +   +  V A +        +  VP  +  + G        L               H+          W  CA LD
Subjt:  PTAAG----------LRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGPRHRQPIRL-----------PPHHHQVPFQPQYLHCWRTCAVLD

Query:  DAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYAYSTISFQKRQALVMYFENNRAGITIYGF
              D  T +C        I        + +  VVQ            DD+EG  +++DL+   + P +A   IS QKRQALV Y ENNRAGIT+YGF
Subjt:  DAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYAYSTISFQKRQALVMYFENNRAGITIYGF

Query:  TLDRSTLHTIFGIELSLVLWLLGKTI
         +D++ L  IF IEL+L+LWLL KTI
Subjt:  TLDRSTLHTIFGIELSLVLWLLGKTI

AT4G22270.1 Protein of unknown function (DUF3537)8.7e-3832.29Show/hide
Query:  ALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFCLSNFIR
        A+IN++   F  S         +F S + W   DQS+  TA LSWS+FF+  VIVP  SHFLL C+ CD +H RP+D +VQLSLS  A +SF  LS + R
Subjt:  ALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFCLSNFIR

Query:  RYGLRRFLFFDKLCDESETVRKGY--TIKLNITEGAVDVRGPL-LRGGERVQNMVVRVGRVADSVPG-----ERCAERRSSMRDGAAVVAVPDDGDIPGL
        ++G+RRFLF DKL D S+ VR  Y   I+ ++    + V   L L    R+   +    ++   +         C  + SS     ++  +     +  L
Subjt:  RYGLRRFLFFDKLCDESETVRKGY--TIKLNITEGAVDVRGPL-LRGGERVQNMVVRVGRVADSVPG-----ERCAERRSSMRDGAAVVAVPDDGDIPGL

Query:  HPVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGPRH---------------------RQPIRLPPHH------HQV
        + +   L+        LR  +    RC    +   + ++A  +HQ     L  +    R                      R  + +  +         +
Subjt:  HPVPSDLRPPDPTAAGLRDGVPGGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGPRH---------------------RQPIRLPPHH------HQV

Query:  PFQPQYLHCWRTCAVLDDAAYKSDDITTE---CDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYAYSTISFQK
                C R+   +   A     +  +   C    +   ++GETP      G +++   +      E+ DDE    +DDLDNTK+ P YA +TIS+QK
Subjt:  PFQPQYLHCWRTCAVLDDAAYKSDDITTE---CDENYTQGAINGETPMAAAAQGGVVQVFPTTPCGGDESDDDEGCYEEDDLDNTKLIPAYAYSTISFQK

Query:  RQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTI
        RQALV Y ENN+AGIT+YGF +DRS L+TIFGIEL+L+LWLL KTI
Subjt:  RQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACAATAGAGAGGCGTTGATAAACAGAAAGGCGAACGTATTCAAGCGCTCCGCCTCTCACGCTCAAGACGAGTTGCAGAGCTTCAGATCGTACCTGCGGTGGAT
GTGTGTGGACCAATCGGACATCTGGACGGCCGGACTGTCGTGGTCGATGTTCTTCGTTTTCGCCGTCATCGTTCCGGCGACGTCACATTTCCTGCTGGCCTGCGCTTCCT
GCGACAGCAACCACGCCAGGCCGTTCGATCGCGTCGTCCAATTGTCGCTCAGCAGTGTGGCGACGGTTTCGTTTTTCTGTCTTTCGAACTTCATCAGACGGTACGGGCTG
AGGAGATTCTTGTTCTTCGATAAGCTCTGCGATGAAAGCGAAACTGTGAGGAAAGGATACACGATCAAGCTCAATATCACTGAGGGTGCTGTCGACGTTCGTGGTCCCCT
GCTTCGCGGCGGAGAGCGCGTACAAAATATGGTGGTACGCGTCGGGCGCGTCGCAGATTCCGTTCCTGGGGAACGTTGTGCTGAGCGACGCAGTAGCATGCGCGATGGAG
CTGCTGTCGTGGCTGTACCGGACGACGGTGATATTCCTGGTCTGCATCCTGTTCCGTCTGATCTGCGACCTCCAGATCCTACGGCTGCAGGACTTCGCGACGGTGTTCCA
GGTGGACTCAGATGTGGGGTCGGTGCTGTCGGAGCACCTGAGGATCAGACGGCATCTGAGGATCATCAGCCACCGGTACCGCGCCTTCATACTATGGACCCTGGTCCTCG
TCACAGGCAGCCAATTCGCCTCCCTCCTCATCACCACCAAGTCCCCTTCCAACCTCAATATTTACATTGCTGGCGAACTTGCGCTGTGCTCGATGACGCTGCTTACAAGT
CTGATGATATTACTACGGAGTGCGACGAAAATTACACACAAGGCGCAATCAACGGCGAGACACCCATGGCTGCGGCTGCCCAGGGCGGCGTAGTCCAAGTGTTTCCGACC
ACCCCATGCGGCGGAGATGAATCGGACGACGACGAAGGTTGCTACGAAGAAGATGATCTGGACAACACCAAGTTGATCCCAGCCTACGCTTACAGCACCATCTCCTTCCA
AAAGAGACAGGCCTTAGTGATGTATTTCGAGAACAACAGAGCGGGGATAACGATATACGGGTTTACCCTGGATAGGAGTACGCTCCACACCATCTTTGGAATAGAGTTAT
CCTTGGTTCTTTGGCTGCTTGGCAAGACAATCGGTTTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACAATAGAGAGGCGTTGATAAACAGAAAGGCGAACGTATTCAAGCGCTCCGCCTCTCACGCTCAAGACGAGTTGCAGAGCTTCAGATCGTACCTGCGGTGGAT
GTGTGTGGACCAATCGGACATCTGGACGGCCGGACTGTCGTGGTCGATGTTCTTCGTTTTCGCCGTCATCGTTCCGGCGACGTCACATTTCCTGCTGGCCTGCGCTTCCT
GCGACAGCAACCACGCCAGGCCGTTCGATCGCGTCGTCCAATTGTCGCTCAGCAGTGTGGCGACGGTTTCGTTTTTCTGTCTTTCGAACTTCATCAGACGGTACGGGCTG
AGGAGATTCTTGTTCTTCGATAAGCTCTGCGATGAAAGCGAAACTGTGAGGAAAGGATACACGATCAAGCTCAATATCACTGAGGGTGCTGTCGACGTTCGTGGTCCCCT
GCTTCGCGGCGGAGAGCGCGTACAAAATATGGTGGTACGCGTCGGGCGCGTCGCAGATTCCGTTCCTGGGGAACGTTGTGCTGAGCGACGCAGTAGCATGCGCGATGGAG
CTGCTGTCGTGGCTGTACCGGACGACGGTGATATTCCTGGTCTGCATCCTGTTCCGTCTGATCTGCGACCTCCAGATCCTACGGCTGCAGGACTTCGCGACGGTGTTCCA
GGTGGACTCAGATGTGGGGTCGGTGCTGTCGGAGCACCTGAGGATCAGACGGCATCTGAGGATCATCAGCCACCGGTACCGCGCCTTCATACTATGGACCCTGGTCCTCG
TCACAGGCAGCCAATTCGCCTCCCTCCTCATCACCACCAAGTCCCCTTCCAACCTCAATATTTACATTGCTGGCGAACTTGCGCTGTGCTCGATGACGCTGCTTACAAGT
CTGATGATATTACTACGGAGTGCGACGAAAATTACACACAAGGCGCAATCAACGGCGAGACACCCATGGCTGCGGCTGCCCAGGGCGGCGTAGTCCAAGTGTTTCCGACC
ACCCCATGCGGCGGAGATGAATCGGACGACGACGAAGGTTGCTACGAAGAAGATGATCTGGACAACACCAAGTTGATCCCAGCCTACGCTTACAGCACCATCTCCTTCCA
AAAGAGACAGGCCTTAGTGATGTATTTCGAGAACAACAGAGCGGGGATAACGATATACGGGTTTACCCTGGATAGGAGTACGCTCCACACCATCTTTGGAATAGAGTTAT
CCTTGGTTCTTTGGCTGCTTGGCAAGACAATCGGTTTTTCTTAG
Protein sequenceShow/hide protein sequence
MGDNREALINRKANVFKRSASHAQDELQSFRSYLRWMCVDQSDIWTAGLSWSMFFVFAVIVPATSHFLLACASCDSNHARPFDRVVQLSLSSVATVSFFCLSNFIRRYGL
RRFLFFDKLCDESETVRKGYTIKLNITEGAVDVRGPLLRGGERVQNMVVRVGRVADSVPGERCAERRSSMRDGAAVVAVPDDGDIPGLHPVPSDLRPPDPTAAGLRDGVP
GGLRCGVGAVGAPEDQTASEDHQPPVPRLHTMDPGPRHRQPIRLPPHHHQVPFQPQYLHCWRTCAVLDDAAYKSDDITTECDENYTQGAINGETPMAAAAQGGVVQVFPT
TPCGGDESDDDEGCYEEDDLDNTKLIPAYAYSTISFQKRQALVMYFENNRAGITIYGFTLDRSTLHTIFGIELSLVLWLLGKTIGFS