; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025036 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025036
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionArabidopsis protein of unknown function (DUF241)
Genome locationtig00003063:999122..999724
RNA-Seq ExpressionSgr025036
SyntenySgr025036
Gene Ontology termsNA
InterPro domainsIPR004320 - Protein of unknown function DUF241, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144941.1 uncharacterized protein LOC101220720 [Cucumis sativus]6.4e-8180Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD
        MDS A+N KK+LH RSNSLPSK HPIV QVDEHLCRLKSSEAT STSSLCH+L  LQDLHDCIDKLLLLPFTQQT++N SDNKW DD LEGSL++L+LCD
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD

Query:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
        IAKD+LLQTKEC +ELESVLRRRR EAVI+ DLQKCL+SRKMIKK+V KALKGI S CSQ+ EE+SAT+SLLKEVEA+T+S++ESVLSFIAGPKLPS+ S
Subjt:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

XP_008448106.1 PREDICTED: uncharacterized protein LOC103490393 [Cucumis melo]1.2e-7980.5Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD
        MDS A+N KK+LH RSNSLPSK HPIV QVDEHLCRLKSSEAT STSSLCH+L  LQDLHDCIDKLLLLPFTQQT++N SDNKW DD LEGSL++L+LCD
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD

Query:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
        IAKD+LLQTKEC +ELESVLRRRR EAVI+ DLQKCL+SRKMIKK V KALKGI S CSQ+ EE+SAT+SLLKEVEAVT+S+VESVLSFIAG KLPS+ S
Subjt:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

XP_022135909.1 uncharacterized protein LOC111007747 [Momordica charantia]9.8e-8283.58Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD
        MDS A+ PKKSLH RSNSLPSK HPIVAQVDEHLCRLKSSEAT STSSL  +L  LQDL  CIDKLLLLPFTQQT++N SDNKWVDDLLEGSLRLLDLCD
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD

Query:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
        IAKD+LLQTKECA+ELESVLRRRRGE V ASDL+KCL+SR MIKK++HKALKGI  KCSQK EESSAT+ LLKEVEA+TYS+VESVLSFIAG KLPSKLS
Subjt:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

Query:  R
        R
Subjt:  R

XP_022953010.1 uncharacterized protein LOC111455527 [Cucurbita moschata]4.6e-7979Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDI
        MDS A+NPKKSLH RSNSLPSK HPIV+QV+EHLCRLKSSEATS+SSLCH+L  LQDLHD I+ LLLLP TQQT+++ + NKW DDLLEGSLRLLDLCDI
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDI

Query:  AKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLSR
        AKD+LLQT+EC+ ELESVLRRR+ E VI S +QKCLNSRKMIKK+VHKALKGI S+ SQK EESSA +SLL+EVEAVTYS+VESVLSFIAGPK+PSK+SR
Subjt:  AKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLSR

XP_038888982.1 uncharacterized protein LOC120078746 [Benincasa hispida]9.2e-8080.6Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD
        MDS A+N KK+LH RSNSLPSK HPIV QVDEHLCRLKSSEAT STSSLC +L  L DLHDCID LLLLPFTQQT++N SDNKW DDLLEGSLR+L+LCD
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD

Query:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
        IAKD+LLQTKEC  ELESVLRRRRGE VIASDLQKCL+SRKMIKK+VHK LKGI S  SQ+ EE+SAT++LLKEVEA+T+S+VESVLSFIAG KLPSKLS
Subjt:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

Query:  R
        R
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A0A0K2S2 Uncharacterized protein3.1e-8180Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD
        MDS A+N KK+LH RSNSLPSK HPIV QVDEHLCRLKSSEAT STSSLCH+L  LQDLHDCIDKLLLLPFTQQT++N SDNKW DD LEGSL++L+LCD
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD

Query:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
        IAKD+LLQTKEC +ELESVLRRRR EAVI+ DLQKCL+SRKMIKK+V KALKGI S CSQ+ EE+SAT+SLLKEVEA+T+S++ESVLSFIAGPKLPS+ S
Subjt:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

A0A1S3BJU7 uncharacterized protein LOC1034903935.8e-8080.5Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD
        MDS A+N KK+LH RSNSLPSK HPIV QVDEHLCRLKSSEAT STSSLCH+L  LQDLHDCIDKLLLLPFTQQT++N SDNKW DD LEGSL++L+LCD
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD

Query:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
        IAKD+LLQTKEC +ELESVLRRRR EAVI+ DLQKCL+SRKMIKK V KALKGI S CSQ+ EE+SAT+SLLKEVEAVT+S+VESVLSFIAG KLPS+ S
Subjt:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

A0A5D3DI43 Uncharacterized protein5.8e-8080.5Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD
        MDS A+N KK+LH RSNSLPSK HPIV QVDEHLCRLKSSEAT STSSLCH+L  LQDLHDCIDKLLLLPFTQQT++N SDNKW DD LEGSL++L+LCD
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD

Query:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
        IAKD+LLQTKEC +ELESVLRRRR EAVI+ DLQKCL+SRKMIKK V KALKGI S CSQ+ EE+SAT+SLLKEVEAVT+S+VESVLSFIAG KLPS+ S
Subjt:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

A0A6J1C2T6 uncharacterized protein LOC1110077474.8e-8283.58Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD
        MDS A+ PKKSLH RSNSLPSK HPIVAQVDEHLCRLKSSEAT STSSL  +L  LQDL  CIDKLLLLPFTQQT++N SDNKWVDDLLEGSLRLLDLCD
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEAT-STSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCD

Query:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
        IAKD+LLQTKECA+ELESVLRRRRGE V ASDL+KCL+SR MIKK++HKALKGI  KCSQK EESSAT+ LLKEVEA+TYS+VESVLSFIAG KLPSKLS
Subjt:  IAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

Query:  R
        R
Subjt:  R

A0A6J1GNH6 uncharacterized protein LOC1114555272.2e-7979Show/hide
Query:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDI
        MDS A+NPKKSLH RSNSLPSK HPIV+QV+EHLCRLKSSEATS+SSLCH+L  LQDLHD I+ LLLLP TQQT+++ + NKW DDLLEGSLRLLDLCDI
Subjt:  MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDI

Query:  AKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLSR
        AKD+LLQT+EC+ ELESVLRRR+ E VI S +QKCLNSRKMIKK+VHKALKGI S+ SQK EESSA +SLL+EVEAVTYS+VESVLSFIAGPK+PSK+SR
Subjt:  AKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G17070.1 Arabidopsis protein of unknown function (DUF241)1.5e-3544.27Show/hide
Query:  SLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATST---SSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDIAKDSLLQ
        S H RS+S PS  HP  A VDE L RL+SSE TST   SS+C +L  LQ+LH+ +DKL+ LP TQQ +    + K V+ LL+GSL++LD+C+I+KD+L Q
Subjt:  SLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATST---SSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDIAKDSLLQ

Query:  TKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
         KE   E++S+LRR+RG+  ++ +++K L SRK  KK+  K  K +  K +Q  +    ++++  E EAVT +  +S+ S+++G K  SK S
Subjt:  TKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

AT2G17080.1 Arabidopsis protein of unknown function (DUF241)2.4e-3845.83Show/hide
Query:  SLHTRSNSLPSKSHPIVAQVDEHLCRLKSSE---ATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDIAKDSLLQ
        S H RSNS PS+SHP  A VDE L RL+SSE   ++S+SS+C +L  LQ+LH+ +DKL+  P TQQ +    + K V+ LL+GSLR+LDLC+I+KD+L +
Subjt:  SLHTRSNSLPSKSHPIVAQVDEHLCRLKSSE---ATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDIAKDSLLQ

Query:  TKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
         KE   E++S+LRR+RG+  ++ +++K L SRK +KKS  K  K +  K +Q  + +  T+++  E EA+T S  +S+LS+++G K  SK S
Subjt:  TKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

AT2G17680.1 Arabidopsis protein of unknown function (DUF241)2.9e-1530.88Show/hide
Query:  HTRSNSLPSKSHPIVAQVDEHLCR----LKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNK----------WVDDLLEGSLRLLDLC
        H RS SL S+SHP  A ++E L +    + +S   S+ S+   L GL+DL+DC + LL +  TQ+ +++ SD K          +++++L+GSLRL+D+C
Subjt:  HTRSNSLPSKSHPIVAQVDEHLCR----LKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNK----------WVDDLLEGSLRLLDLC

Query:  DIAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHK---ALKGINSKCSQKIE--------ESSATISLLKEVEAVTYSSVESVLS
        ++++D +++T E    L+S +RRR+       D+   +  RK ++K V K   +LK IN     +             A I  ++ V  +T S ++S   
Subjt:  DIAKDSLLQTKECAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHK---ALKGINSKCSQKIE--------ESSATISLLKEVEAVTYSSVESVLS

Query:  FIAG
        F++G
Subjt:  FIAG

AT4G35200.1 Arabidopsis protein of unknown function (DUF241)2.8e-3443.39Show/hide
Query:  SLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDIAKDSLLQTKE
        S H RSNS PS+ HP  A VDE L RL+SS++ S+SS+C +L  LQDLHD ++K++ L  T      A     ++ LL+GSLR+LDLC+IAKD++ Q KE
Subjt:  SLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDIAKDSLLQTKE

Query:  CAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
           E++S+LRR+ G+  ++ +++K L SRK +KKS+ K +K +  K  Q  + ++A++ +    EAVT +  ES+ SF++G K   K S
Subjt:  CAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS

AT4G35210.1 Arabidopsis protein of unknown function (DUF241)6.9e-3341.27Show/hide
Query:  SLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDIAKDSLLQTKE
        S H RS+S PS+ HP  A VDE L RL+SS   S+SS+C +L  LQDLHD ++K++ L  T Q    A     ++ LL+GS+++LDLC I+KD L Q KE
Subjt:  SLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDIAKDSLLQTKE

Query:  CAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS
          +E++S++RR+RG+  ++++++K L SRK +KKS  K LK + +      +  +  +++  E E VT +  ES+ SF++G K   K S
Subjt:  CAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCTCTGCTTTGAATCCAAAGAAATCACTCCACACTCGCTCAAATAGCTTGCCCTCAAAGTCACATCCAATCGTTGCCCAAGTTGATGAACATTTGTGCAGATT
GAAGTCCTCTGAAGCTACTTCCACTTCTTCTTTATGCCACAAACTCAGAGGCCTTCAAGATTTGCATGATTGCATTGATAAGTTACTTCTTCTACCATTCACCCAACAGA
CTATTATCAATGCGAGTGATAATAAATGGGTTGATGACTTGCTAGAAGGATCTCTAAGGCTCTTAGATTTGTGTGATATTGCTAAAGATTCATTGTTGCAGACAAAAGAA
TGTGCACAGGAACTAGAATCAGTTTTGCGCAGAAGAAGAGGTGAGGCAGTAATTGCTAGTGATCTTCAGAAATGCTTAAATTCAAGGAAAATGATAAAGAAGTCGGTCCA
CAAGGCATTGAAGGGAATCAACAGCAAGTGTTCTCAGAAAATTGAAGAAAGTTCAGCAACCATTAGTTTGCTAAAAGAAGTAGAAGCAGTCACATATAGCAGTGTTGAAT
CAGTATTGTCCTTCATAGCAGGACCAAAGTTGCCGTCAAAGTTGAGTCGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCTCTGCTTTGAATCCAAAGAAATCACTCCACACTCGCTCAAATAGCTTGCCCTCAAAGTCACATCCAATCGTTGCCCAAGTTGATGAACATTTGTGCAGATT
GAAGTCCTCTGAAGCTACTTCCACTTCTTCTTTATGCCACAAACTCAGAGGCCTTCAAGATTTGCATGATTGCATTGATAAGTTACTTCTTCTACCATTCACCCAACAGA
CTATTATCAATGCGAGTGATAATAAATGGGTTGATGACTTGCTAGAAGGATCTCTAAGGCTCTTAGATTTGTGTGATATTGCTAAAGATTCATTGTTGCAGACAAAAGAA
TGTGCACAGGAACTAGAATCAGTTTTGCGCAGAAGAAGAGGTGAGGCAGTAATTGCTAGTGATCTTCAGAAATGCTTAAATTCAAGGAAAATGATAAAGAAGTCGGTCCA
CAAGGCATTGAAGGGAATCAACAGCAAGTGTTCTCAGAAAATTGAAGAAAGTTCAGCAACCATTAGTTTGCTAAAAGAAGTAGAAGCAGTCACATATAGCAGTGTTGAAT
CAGTATTGTCCTTCATAGCAGGACCAAAGTTGCCGTCAAAGTTGAGTCGATAG
Protein sequenceShow/hide protein sequence
MDSSALNPKKSLHTRSNSLPSKSHPIVAQVDEHLCRLKSSEATSTSSLCHKLRGLQDLHDCIDKLLLLPFTQQTIINASDNKWVDDLLEGSLRLLDLCDIAKDSLLQTKE
CAQELESVLRRRRGEAVIASDLQKCLNSRKMIKKSVHKALKGINSKCSQKIEESSATISLLKEVEAVTYSSVESVLSFIAGPKLPSKLSR