; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014590 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014590
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationtig00000729:867403..870622
RNA-Seq ExpressionSgr014590
SyntenySgr014590
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131393.1 uncharacterized protein LOC111004622 isoform X1 [Momordica charantia]2.4e-10088.89Show/hide
Query:  KKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRV
        +KKFEVIKLSKATNS+TNTKRANL V RKEKIKLP+YSD RGGRTYHISEFL HPSGIEAMLNKNAL+SFQLLDANTYRCTLP+LQLLNFEAAPTLDLRV
Subjt:  KKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRV

Query:  IATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDY
        I TD+D  VEMLSCKFEGSELVERQN HFSALMINHLTW+TVDSNSFL VDVKL+LSLEIYTLPFTLMPTAAVENPGNLMLQAL+D LVPLLLRQ+VQDY
Subjt:  IATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDY

Query:  EKWIRQQLDHSHVSMS
        EKWIRQQ DHS +S S
Subjt:  EKWIRQQLDHSHVSMS

XP_022131395.1 uncharacterized protein LOC111004622 isoform X2 [Momordica charantia]9.0e-10089.25Show/hide
Query:  KFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRVIA
        KFEVIKLSKATNS+TNTKRANL V RKEKIKLP+YSD RGGRTYHISEFL HPSGIEAMLNKNAL+SFQLLDANTYRCTLP+LQLLNFEAAPTLDLRVI 
Subjt:  KFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRVIA

Query:  TDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEK
        TD+D  VEMLSCKFEGSELVERQN HFSALMINHLTW+TVDSNSFL VDVKL+LSLEIYTLPFTLMPTAAVENPGNLMLQAL+D LVPLLLRQ+VQDYEK
Subjt:  TDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEK

Query:  WIRQQLDHSHVSMS
        WIRQQ DHS +S S
Subjt:  WIRQQLDHSHVSMS

XP_022929609.1 uncharacterized protein LOC111436145 [Cucurbita moschata]6.0e-9685.07Show/hide
Query:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL
        F   +KKFE IK+SKATNS+TNTKRANLSV RKEKIKLP+YS GRGGRTYHI EFL HPSGIEAM+NKNAL+SFQ LDANTYRCTL  LQLLNFEAAPTL
Subjt:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL

Query:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL
        DLRVI T+ED  VEMLSCKFEGSELVERQN+HFSALMINHLTW++V SNS+L VDVKL+LSLEIYTLPFTLMPTAAVENPGNLMLQALLD LVPLLLRQL
Subjt:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL

Query:  VQDYEKWI-RQQLDHSHVSMS
        VQDYEKWI +QQ+DHSHVS+S
Subjt:  VQDYEKWI-RQQLDHSHVSMS

XP_022997460.1 uncharacterized protein LOC111492370 [Cucurbita maxima]4.6e-9685.52Show/hide
Query:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL
        F   +KKFE IKLSKATNS+TNTKRANLSV RKEKIKLP+YS GRGGRTYHI EFL HPSGIEAM+NKNAL+SFQ LDANTYRCTL  LQLLNFEAAPTL
Subjt:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL

Query:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL
        DLRVI T+ED  VEMLSCKFEGSELVERQN+HFSALMINHLTW++V SNS+L VDVKL LSLEIYTLPFTLMPTAAVENPGNLMLQALLD LVPLLLRQL
Subjt:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL

Query:  VQDYEKWI-RQQLDHSHVSMS
        VQDYEKWI +QQ+DHSHVS+S
Subjt:  VQDYEKWI-RQQLDHSHVSMS

XP_038885685.1 uncharacterized protein LOC120075989 isoform X1 [Benincasa hispida]4.9e-9886.36Show/hide
Query:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL
        F   +KKFEVIKLSKATNS+TNTKRANLSV R+EKI+LP+YS  R GR YHI EFL HPSGIEAMLNKNALRSFQLLDANTYRCTLP LQLLNFEAAPTL
Subjt:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL

Query:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL
        DLRVI TD+D  VEMLSCKFEGSELVERQN+HFSALMINHLTW+TVDSNS+L VDVKL+LSLEIYTLPFTLMPTAAVENPGNLMLQALLD LVPLLLRQL
Subjt:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL

Query:  VQDYEKWIRQQLDHSHVSMS
        VQDYEKWI QQLDHS +S+S
Subjt:  VQDYEKWIRQQLDHSHVSMS

TrEMBL top hitse value%identityAlignment
A0A0A0LSA8 Uncharacterized protein6.5e-9683.64Show/hide
Query:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL
        F   +KKFE  KLSKATNS+TNTK+ANLSV ++EKI+LP+YS    GRTYHI EFL HPSGIEAMLNKNAL+SFQLLDANTYRCTLP LQLLNFEAAPTL
Subjt:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL

Query:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL
        DLRVI TDED  VEMLSCKFEGSELVERQN+HFSALMINHLTW+T+DSNS+L VDVKL+LSLEIYTLPFTLMPTAAVENPGNLMLQALLD LVPLLLRQL
Subjt:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL

Query:  VQDYEKWIRQQLDHSHVSMS
        +QDYEKWI QQLDHS +S+S
Subjt:  VQDYEKWIRQQLDHSHVSMS

A0A6J1BPL2 uncharacterized protein LOC111004622 isoform X24.4e-10089.25Show/hide
Query:  KFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRVIA
        KFEVIKLSKATNS+TNTKRANL V RKEKIKLP+YSD RGGRTYHISEFL HPSGIEAMLNKNAL+SFQLLDANTYRCTLP+LQLLNFEAAPTLDLRVI 
Subjt:  KFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRVIA

Query:  TDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEK
        TD+D  VEMLSCKFEGSELVERQN HFSALMINHLTW+TVDSNSFL VDVKL+LSLEIYTLPFTLMPTAAVENPGNLMLQAL+D LVPLLLRQ+VQDYEK
Subjt:  TDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEK

Query:  WIRQQLDHSHVSMS
        WIRQQ DHS +S S
Subjt:  WIRQQLDHSHVSMS

A0A6J1BT83 uncharacterized protein LOC111004622 isoform X11.1e-10088.89Show/hide
Query:  KKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRV
        +KKFEVIKLSKATNS+TNTKRANL V RKEKIKLP+YSD RGGRTYHISEFL HPSGIEAMLNKNAL+SFQLLDANTYRCTLP+LQLLNFEAAPTLDLRV
Subjt:  KKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRV

Query:  IATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDY
        I TD+D  VEMLSCKFEGSELVERQN HFSALMINHLTW+TVDSNSFL VDVKL+LSLEIYTLPFTLMPTAAVENPGNLMLQAL+D LVPLLLRQ+VQDY
Subjt:  IATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDY

Query:  EKWIRQQLDHSHVSMS
        EKWIRQQ DHS +S S
Subjt:  EKWIRQQLDHSHVSMS

A0A6J1EP97 uncharacterized protein LOC1114361452.9e-9685.07Show/hide
Query:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL
        F   +KKFE IK+SKATNS+TNTKRANLSV RKEKIKLP+YS GRGGRTYHI EFL HPSGIEAM+NKNAL+SFQ LDANTYRCTL  LQLLNFEAAPTL
Subjt:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL

Query:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL
        DLRVI T+ED  VEMLSCKFEGSELVERQN+HFSALMINHLTW++V SNS+L VDVKL+LSLEIYTLPFTLMPTAAVENPGNLMLQALLD LVPLLLRQL
Subjt:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL

Query:  VQDYEKWI-RQQLDHSHVSMS
        VQDYEKWI +QQ+DHSHVS+S
Subjt:  VQDYEKWI-RQQLDHSHVSMS

A0A6J1K9Q3 uncharacterized protein LOC1114923702.2e-9685.52Show/hide
Query:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL
        F   +KKFE IKLSKATNS+TNTKRANLSV RKEKIKLP+YS GRGGRTYHI EFL HPSGIEAM+NKNAL+SFQ LDANTYRCTL  LQLLNFEAAPTL
Subjt:  FQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTL

Query:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL
        DLRVI T+ED  VEMLSCKFEGSELVERQN+HFSALMINHLTW++V SNS+L VDVKL LSLEIYTLPFTLMPTAAVENPGNLMLQALLD LVPLLLRQL
Subjt:  DLRVIATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQL

Query:  VQDYEKWI-RQQLDHSHVSMS
        VQDYEKWI +QQ+DHSHVS+S
Subjt:  VQDYEKWI-RQQLDHSHVSMS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31115.1 Protein of unknown function (DUF1997)5.2e-5352.79Show/hide
Query:  TNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLD--ANTYRCTLPALQLLNFEAAPTLDLRVIATDEDCRVEMLSC
        ++ K+AN+S +RK++IKL    +  G +    SEFL HPSG+EA++N  AL+S+ L+D   +TYRCTLP +QL++FE  P L LRV  T EDC VE+LSC
Subjt:  TNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLD--ANTYRCTLPALQLLNFEAAPTLDLRVIATDEDCRVEMLSC

Query:  KFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEKWIRQQLDHS
        K EGSEL+E Q++ FSA+M N +TW       FL VDV+L+++LEI T PFT++P +AVE PGNL++Q L+D LVPLLL+QL++DY++WI++Q  +S
Subjt:  KFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEKWIRQQLDHS

AT4G31115.2 Protein of unknown function (DUF1997)5.2e-5352.79Show/hide
Query:  TNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLD--ANTYRCTLPALQLLNFEAAPTLDLRVIATDEDCRVEMLSC
        ++ K+AN+S +RK++IKL    +  G +    SEFL HPSG+EA++N  AL+S+ L+D   +TYRCTLP +QL++FE  P L LRV  T EDC VE+LSC
Subjt:  TNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLD--ANTYRCTLPALQLLNFEAAPTLDLRVIATDEDCRVEMLSC

Query:  KFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEKWIRQQLDHS
        K EGSEL+E Q++ FSA+M N +TW       FL VDV+L+++LEI T PFT++P +AVE PGNL++Q L+D LVPLLL+QL++DY++WI++Q  +S
Subjt:  KFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEKWIRQQLDHS

AT5G04440.1 Protein of unknown function (DUF1997)1.3e-1629.29Show/hide
Query:  SKATNSQTNTKRANLSVARKEK-IKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRVIATDEDCR
        S AT+S T+  R + S   K + I     S         + E+++ P+   ++L+   +   + +D NT+RC +   +  NFE  P L +RV      C 
Subjt:  SKATNSQTNTKRANLSVARKEK-IKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRVIATDEDCR

Query:  VEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSF---LGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEKW
        +++LSCK EGS +V  QN  F A M+N ++ ++    S    +  D  +++++EI    F + P  A+E  G  +L  +L  ++P  L QL +DY  W
Subjt:  VEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSF---LGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATTAGAATGTTTCAGTTTCTGAAGAAGAAATTTGAAGTGATTAAGCTATCTAAGGCCACCAATTCTCAGACTAATACCAAGAGGGCAAACTTATCCGTCGCAAG
GAAGGAAAAGATCAAATTACCCAATTACAGTGACGGCCGCGGAGGCAGGACATATCATATCAGCGAATTCTTGACTCACCCTTCAGGAATTGAAGCAATGCTTAACAAAA
ATGCCTTGCGAAGTTTCCAGTTGCTTGATGCTAACACATACAGATGCACTCTGCCAGCATTACAACTTTTGAACTTTGAAGCTGCCCCTACACTGGATCTACGAGTGATC
GCGACAGACGAAGATTGTAGAGTTGAGATGCTTTCGTGCAAGTTTGAAGGTTCAGAATTGGTGGAACGCCAAAACAAACATTTTTCGGCCTTGATGATTAATCACTTGAC
ATGGGAGACAGTTGATTCGAATTCGTTTCTGGGAGTTGATGTGAAGTTGGATCTGTCTCTGGAGATTTATACACTTCCCTTCACCCTGATGCCTACAGCTGCTGTGGAGA
ATCCAGGGAATTTGATGCTACAAGCTCTCCTGGACAGGCTTGTACCTCTGCTGCTGCGGCAATTAGTGCAAGATTATGAAAAATGGATCCGTCAGCAGCTCGATCATTCT
CATGTCTCTATGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGATTAGAATGTTTCAGTTTCTGAAGAAGAAATTTGAAGTGATTAAGCTATCTAAGGCCACCAATTCTCAGACTAATACCAAGAGGGCAAACTTATCCGTCGCAAG
GAAGGAAAAGATCAAATTACCCAATTACAGTGACGGCCGCGGAGGCAGGACATATCATATCAGCGAATTCTTGACTCACCCTTCAGGAATTGAAGCAATGCTTAACAAAA
ATGCCTTGCGAAGTTTCCAGTTGCTTGATGCTAACACATACAGATGCACTCTGCCAGCATTACAACTTTTGAACTTTGAAGCTGCCCCTACACTGGATCTACGAGTGATC
GCGACAGACGAAGATTGTAGAGTTGAGATGCTTTCGTGCAAGTTTGAAGGTTCAGAATTGGTGGAACGCCAAAACAAACATTTTTCGGCCTTGATGATTAATCACTTGAC
ATGGGAGACAGTTGATTCGAATTCGTTTCTGGGAGTTGATGTGAAGTTGGATCTGTCTCTGGAGATTTATACACTTCCCTTCACCCTGATGCCTACAGCTGCTGTGGAGA
ATCCAGGGAATTTGATGCTACAAGCTCTCCTGGACAGGCTTGTACCTCTGCTGCTGCGGCAATTAGTGCAAGATTATGAAAAATGGATCCGTCAGCAGCTCGATCATTCT
CATGTCTCTATGTCTTGA
Protein sequenceShow/hide protein sequence
MLIRMFQFLKKKFEVIKLSKATNSQTNTKRANLSVARKEKIKLPNYSDGRGGRTYHISEFLTHPSGIEAMLNKNALRSFQLLDANTYRCTLPALQLLNFEAAPTLDLRVI
ATDEDCRVEMLSCKFEGSELVERQNKHFSALMINHLTWETVDSNSFLGVDVKLDLSLEIYTLPFTLMPTAAVENPGNLMLQALLDRLVPLLLRQLVQDYEKWIRQQLDHS
HVSMS