; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr001808 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr001808
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationtig00001138:58088..65332
RNA-Seq ExpressionSgr001808
SyntenySgr001808
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR016969 - Uncharacterised conserved protein UCP031088, alpha/beta hydrolase, At1g15070
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152293.1 uncharacterized protein LOC111020046 [Momordica charantia]1.2e-26584.99Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        MATVQSD+RCAYHIASSTFRSL+PNL+LRR AAVP  +FS SFKLRAFST AA    V+VSEKPSICTADELHYVSVPNSDWRLALWRYH SPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK
        PLLLLSGVGTNAIGYDLAPGCSFAR+MSGQGYDTWILEVRGAGLSLQEP+ KEIEHSANVKSE+MEA SE+K+NGTL MAE STKILND+SKS+SC NGK
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK

Query:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ
        ESD S+VEEE F GI TIWDESSLVT+LTETFMRLSERLSGFLSEGQSKIM AKLFDQ+SKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRD SQ
Subjt:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ

Query:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
        RLVEIIEEGQRSVSPPLFNLQDRFSSTI+DFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEG+DPRLA
Subjt:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA

Query:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------
        AIVTLASSLDYTSSKS LKLLLPLADPAQALNVPVVPLGALLSASYPLSS PPYVLSWLNNLISAEDMM PEMLKKLVLNNF                  
Subjt:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------

Query:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL
                              LALAGDQDLICPPEAVE TAKLIP+HL+TYKV GE GGPHYAHYDLVGGRL
Subjt:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL

XP_022939626.1 uncharacterized protein LOC111445461 isoform X1 [Cucurbita moschata]4.2e-25880.92Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        MATVQSD+  AYHIASSTF S + NL  RR AAVP KL   SFKLRAFSTGA   AAVRV +KPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK
        P+LLLSGVGTNAIGYDLAP CSFARYMSGQGYDTWILEVRGAGLSLQEPN KEIEHSA VKSE MEA S++KING+L++A  STK  N+++KS+S INGK
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK

Query:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ
        ESDFSIVEEEDFIGITTIWDESSLV++LTETFMRLSERLSGFLSEGQS+IM AKLFDQISKLLV+SQLSERFNEVRG L NLLETGQTSVI  QIRD SQ
Subjt:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ

Query:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
        RLVEIIE+GQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDH+LLED+P AIDYIRA+ KPKDGKLLA+GHSMGGILLYAKLSR GFEGRDPRLA
Subjt:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA

Query:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------
        AIVT+ASSLDYTSS S LKLLLPLADPAQALNVPVVPLGALLSASYPLSS PPY LSWLNNLISAEDMMHPEMLKKLVLNNF                  
Subjt:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------

Query:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT
                              LALAGDQDLICPP AVE TAKLIP+HL+TYK  GEPGGPHYAHYDLVGGRLV FSFP  YS SWT
Subjt:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT

XP_022993272.1 uncharacterized protein LOC111489336 isoform X1 [Cucurbita maxima]5.0e-25981.6Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        MATVQSD+  AYHIASSTF S + NL  RR AAVP KL   SFKLRAFSTGA   AAVRV +KPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK
        P+LLLSGVGTNAIGYDLAP CSFARYMSGQGYDTWILEVRGAGLSLQEPN KEIEHSA VKSEKMEA SE KINGTL +A  STK  N+++KS+S INGK
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK

Query:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ
        ESDFSIVEEEDFIGITTIWDESSLV++LTETFMRLSERLSGFLSEGQS+IM AKLFDQISKLLV+SQLSERFNEVRG L NLLETGQTSVI  QIRD SQ
Subjt:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ

Query:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
        RLVEIIE+GQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDH+LLED+P AIDYIRA+ KPKDGKLLAIGHSMGGILLYAKLSR GFEGRDPRLA
Subjt:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA

Query:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------
        AIVT+ASSLDYTSS S LKLLLPLADPAQALNVPVVPLGALLSASYPLSS PPY LSWLNNLISAEDMMHPEMLKKLVLNNF                  
Subjt:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------

Query:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT
                              LALAGDQDLICPP AVE TAKLIP+HL+TYK  GEPGGPHYAHYDLVGGRLV FSFP  YS SWT
Subjt:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT

XP_023550263.1 uncharacterized protein LOC111808492 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-25881.09Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        MATVQSD+  AYHIASSTF S + NL  +R AAVP KL   SFKLRAFSTGA   AAVRV +K SICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK
        P+LLLSGVGTNAIGYDLAP CSFARYMSGQGYDTWILEVRGAGLSLQEPN KEIEHSA VKSEKMEA SE+KING+L++A  STK  N+++KS+S INGK
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK

Query:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ
        ESDFSIVEEEDFIGITTIWDESSLV++LTETFMRLSERLSGFLSEGQS+IM AKLFDQISKLLV+SQLSERFNEVRG L NLLETGQTSVI  QIRD SQ
Subjt:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ

Query:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
        RLVEIIE+GQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDH+LLED+P AIDYIRA+ KPKDGKLLA+GHSMGGILLYAKLSR GFEGRDPRLA
Subjt:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA

Query:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------
        AIVT+ASSLDYTSS S LKLLLPLADPAQALNVPVVPLGALLSASYPLSS PPY LSWLNNLISAEDMMHPEMLKKLVLNNF                  
Subjt:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------

Query:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT
                              LALAGDQDLICPP AVE TAKLIP+HL+TYK  GEPGGPHYAHYDLVGGRLV FSFP  YSLSWT
Subjt:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT

XP_038907162.1 uncharacterized protein LOC120092965 [Benincasa hispida]1.0e-26483.94Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRRAAV-PAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        MATVQSDIRC YHIASSTFRSL+PNLM RR A    KL   SFKLRAFSTG     AVR+ EKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRRAAV-PAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK
        PLLLLSGVGTNAIGYDLAPGCSFARYMSGQG+DTWILEVRGAGLSL+EPN K IEHSA VKS+KMEAASE+KINGTLH+AE STKILND++KSDSCINGK
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK

Query:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ
        ESDFS+VEEE+FIGITTIWDESSLV++LTETFMRLSERLSGFLSEGQS++M AKLFDQISKLLVDSQLSERFNEVRGRL NLLETGQTSVIAGQIRD SQ
Subjt:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ

Query:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
        RLVEIIE+GQRSVSPPLFNLQDRFSSTI+DFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRA+SKPKDGKLLAIGHSMGGILLYA+LSRCGFEGRDPR A
Subjt:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA

Query:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------
        AIVTLASSLDYTSSKS LKLLLPLADPAQALNVPVVPLGALLSASYPLSS PPYV SWLNNLISAEDMM PEMLKKLVLNNF                  
Subjt:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------

Query:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL
                              LALAGDQDLICPP AVEETAKLIP+HL+TYK  GEPGGPHYAHYDLVGGRL
Subjt:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL

TrEMBL top hitse value%identityAlignment
A0A6J1DEH2 uncharacterized protein LOC1110200466.0e-26684.99Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        MATVQSD+RCAYHIASSTFRSL+PNL+LRR AAVP  +FS SFKLRAFST AA    V+VSEKPSICTADELHYVSVPNSDWRLALWRYH SPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK
        PLLLLSGVGTNAIGYDLAPGCSFAR+MSGQGYDTWILEVRGAGLSLQEP+ KEIEHSANVKSE+MEA SE+K+NGTL MAE STKILND+SKS+SC NGK
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK

Query:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ
        ESD S+VEEE F GI TIWDESSLVT+LTETFMRLSERLSGFLSEGQSKIM AKLFDQ+SKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRD SQ
Subjt:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ

Query:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
        RLVEIIEEGQRSVSPPLFNLQDRFSSTI+DFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEG+DPRLA
Subjt:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA

Query:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------
        AIVTLASSLDYTSSKS LKLLLPLADPAQALNVPVVPLGALLSASYPLSS PPYVLSWLNNLISAEDMM PEMLKKLVLNNF                  
Subjt:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------

Query:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL
                              LALAGDQDLICPPEAVE TAKLIP+HL+TYKV GE GGPHYAHYDLVGGRL
Subjt:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL

A0A6J1E189 uncharacterized protein LOC1114296736.6e-25782.12Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        MATVQSDIRCAYHIASSTFRSL+P    RR AAVP KL   SFKLRAFST     AAVRV EKPSICTADELHY SVPNSDWRLALWRY PSPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSC---I
        PLLLLSGVGTNAIGYDLAPGCSFAR+MSGQGYDTWILEVRGAGLSLQEPN KEIEHSA VKSEKMEA SE+KINGT+H+ + STKIL+D+SKSD+C   I
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSC---I

Query:  NGKESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRD
        NGKESDFS+VEEEDFIGI TIWDESS+V++LTETFMRLSERLSGFLSEGQS+IM AKLFDQISKLLVDSQLSERFNEVR +L  LLETGQTSVIAGQIRD
Subjt:  NGKESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRD

Query:  SSQRLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDP
         SQRLVEIIEEGQRSVSPPLFNLQDRFSSTI+DFQKQLDLIVKYDWDFDHYLLEDVPAA+DYI A+SKPKDGKLLAIGHSMGGILLYA+LSRCGFEGRDP
Subjt:  SSQRLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDP

Query:  RLAAIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF---------------
        RLAAIVTLASSLDYTSSKS LK+LLPLADPAQALNVPVVPLGALLSASYPLSS  PYVLSWLN+LISAEDMM PEMLKKLVLNNF               
Subjt:  RLAAIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF---------------

Query:  ------------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL
                                 LALAGDQDLICPP AVEETAKLIP+HL++YK  GEPGGPHYAHYDLVGGRL
Subjt:  ------------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL

A0A6J1FGG5 uncharacterized protein LOC111445461 isoform X12.1e-25880.92Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        MATVQSD+  AYHIASSTF S + NL  RR AAVP KL   SFKLRAFSTGA   AAVRV +KPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK
        P+LLLSGVGTNAIGYDLAP CSFARYMSGQGYDTWILEVRGAGLSLQEPN KEIEHSA VKSE MEA S++KING+L++A  STK  N+++KS+S INGK
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK

Query:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ
        ESDFSIVEEEDFIGITTIWDESSLV++LTETFMRLSERLSGFLSEGQS+IM AKLFDQISKLLV+SQLSERFNEVRG L NLLETGQTSVI  QIRD SQ
Subjt:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ

Query:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
        RLVEIIE+GQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDH+LLED+P AIDYIRA+ KPKDGKLLA+GHSMGGILLYAKLSR GFEGRDPRLA
Subjt:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA

Query:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------
        AIVT+ASSLDYTSS S LKLLLPLADPAQALNVPVVPLGALLSASYPLSS PPY LSWLNNLISAEDMMHPEMLKKLVLNNF                  
Subjt:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------

Query:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT
                              LALAGDQDLICPP AVE TAKLIP+HL+TYK  GEPGGPHYAHYDLVGGRLV FSFP  YS SWT
Subjt:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT

A0A6J1I6P7 uncharacterized protein LOC1114704637.8e-25882.37Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        M TVQSDIRCAYHIASSTFRSL+P+   RR AA P KL   SFKLRAFST     AAVRV EKPSICTADELHY SVPNSDWRLALWRY PSPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK
        PLLLLSGVGTNAIGYDLAPGCSFAR+MSGQGYDTWILEVRGAGLSLQEPN KEIEHSA VKSEKMEA SE+KINGT+H+ E STKIL D+SKSD+CINGK
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK

Query:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ
        ESDFS+VEEEDFIGI TIWDESS+V++LTETFMRLSERLSGFLSEGQS+IM AKLFDQISKLLVDSQLSERFNEVR +L  LLETGQTSVIAGQIRD SQ
Subjt:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ

Query:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
        RLVEIIEEGQRSVSPPLFNLQDRFSSTI+DFQKQLDLIVKYDWDFDHYLLEDVPAA+DYI A+SKPKDGKLLAIGHSMGGILLYA+LSRCGFEGRDPRLA
Subjt:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA

Query:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------
        AIVTLASSLDYTSSKS LK+LLPLADPAQALNVPVVPLGALLSASYPLSS  PYVLSWLN+LISAEDMM PEMLKKLVLNNF                  
Subjt:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------

Query:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL
                              LALAGDQDLICPP AVEETAKLIP+HL++YK  GEPGGPHYAHYDLVGGRL
Subjt:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL

A0A6J1JSB9 uncharacterized protein LOC111489336 isoform X12.4e-25981.6Show/hide
Query:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
        MATVQSD+  AYHIASSTF S + NL  RR AAVP KL   SFKLRAFSTGA   AAVRV +KPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH
Subjt:  MATVQSDIRCAYHIASSTFRSLAPNLMLRR-AAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNH

Query:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK
        P+LLLSGVGTNAIGYDLAP CSFARYMSGQGYDTWILEVRGAGLSLQEPN KEIEHSA VKSEKMEA SE KINGTL +A  STK  N+++KS+S INGK
Subjt:  PLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGK

Query:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ
        ESDFSIVEEEDFIGITTIWDESSLV++LTETFMRLSERLSGFLSEGQS+IM AKLFDQISKLLV+SQLSERFNEVRG L NLLETGQTSVI  QIRD SQ
Subjt:  ESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQ

Query:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
        RLVEIIE+GQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDH+LLED+P AIDYIRA+ KPKDGKLLAIGHSMGGILLYAKLSR GFEGRDPRLA
Subjt:  RLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA

Query:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------
        AIVT+ASSLDYTSS S LKLLLPLADPAQALNVPVVPLGALLSASYPLSS PPY LSWLNNLISAEDMMHPEMLKKLVLNNF                  
Subjt:  AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------

Query:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT
                              LALAGDQDLICPP AVE TAKLIP+HL+TYK  GEPGGPHYAHYDLVGGRLV FSFP  YS SWT
Subjt:  ---------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15060.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase2.2e-18059.06Show/hide
Query:  DIRCAYHIASST---FRSLA-----PNLMLRRAAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRN
        +IR A   ASST    RS++     P+   R   +  + FSSS              +V++  KPS+CTADELHYVSVPN+DWRLALWRY P PQAP RN
Subjt:  DIRCAYHIASST---FRSLA-----PNLMLRRAAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRN

Query:  HPLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCING
        HPLLLLSGVGTNAIGYDL+PGCSFAR+MSGQG++TWILEVRGAGLS +  + K++E SA+  S ++E+ +           E  T    D+   DS    
Subjt:  HPLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCING

Query:  KESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSS
          SD S+V      G  + WDES LV +LT TFM LSERLSGFLSEGQS  M AKLFD+I+ L+ D+QL ERFN++R +LL+L+E+ Q S +  Q+RD +
Subjt:  KESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSS

Query:  QRLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRL
        QRLV + ++GQRSVSPPL +LQ+R ++TIEDFQKQLDLIVKYDWDFDHYL EDVPAAI+Y+RA SKPKDGKL AIGHSMGGILLYA LSRC FEGR+P +
Subjt:  QRLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRL

Query:  AAIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF-----------------
        AA+ TLASS+DYT+S S LKLL+PLA+PA+AL+VPVVPLGALL+A++PLS+ PPYVLSWLN+LIS+ DMMHPEML+KLVLNNF                 
Subjt:  AAIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF-----------------

Query:  ----------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL
                               LALAGD+DLICPP AVE+T KL P++L+TYK++GEP GPHYAHYDLVGGRL
Subjt:  ----------------------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGRL

AT1G73750.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase8.3e-10339.64Show/hide
Query:  YHIASSTFRSLAPNLMLRRAAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNHPLLLLSGVGTNA
        + ++S++F S      L      ++ F SS +L    + +  I +V  +    ICTADELHYV VPNSDWR+ALWRY PSP+AP RNHPLLLLSG+GTNA
Subjt:  YHIASSTFRSLAPNLMLRRAAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTADELHYVSVPNSDWRLALWRYHPSPQAPPRNHPLLLLSGVGTNA

Query:  IGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGKESDFSIVEEEDF
        + YDL+P CSFAR MSG G+DTWILE+RGAGLS                                     S  +  ++ K +                  
Subjt:  IGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMAEVSTKILNDVSKSDSCINGKESDFSIVEEEDF

Query:  IGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQRLVEIIEEGQRS
               ++  +V+ L E F+ +SERL   L +G SKI+                                                             
Subjt:  IGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSVIAGQIRDSSQRLVEIIEEGQRS

Query:  VSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLAAIVTLASSLDYT
               +QDR S    DF+++ +LI  Y+WDFD+YL EDVP+A+DY+R  +K KDGKLLA+GHSMGGILLYA LSRCGF+G D  LA + TLAS+ DY+
Subjt:  VSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLAAIVTLASSLDYT

Query:  SSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------------------
        SS + LK LLP+ +PAQA+N+P++P+  +L+ ++PL   PPY LSWL   ISA  MM PE+++KLVLN+                               
Subjt:  SSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNF------------------------------

Query:  ---------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGR
                  LALAGD D+ICPP+AV +T KLIP+HL TYKVVG PGGPHY H DL+ GR
Subjt:  ---------FLALAGDQDLICPPEAVEETAKLIPKHLITYKVVGEPGGPHYAHYDLVGGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAAAAACAAAGGGAAAAAAAAGGGAAGAAGAAAAAGAAAAAAAGAGGATTCTCCGTTCTTCATTTACGCAACGGAAAAAAGAGCACATTGGTCGGTGACGCTCTCGCGGA
ACCGCCTCCGGAGATGGCGACGGTGCAATCTGATATCCGCTGTGCTTATCACATAGCTTCCTCCACGTTCCGCTCCCTCGCCCCCAATCTCATGCTCCGCCGTGCCGCCG
TTCCAGCTAAACTTTTCTCCTCGTCCTTCAAGTTGAGAGCCTTCTCCACCGGTGCTGCCGGCATAGCTGCCGTTAGGGTATCCGAGAAGCCGTCGATCTGTACGGCAGAT
GAGCTTCATTATGTCTCCGTACCAAATTCCGACTGGAGGCTCGCACTCTGGCGCTACCATCCCTCCCCTCAGGCGCCTCCGAGGAATCACCCGCTGTTGCTGTTATCAGG
AGTCGGGACTAATGCCATTGGTTATGATCTCGCCCCTGGGTGTTCTTTTGCACGGTATATGTCTGGCCAGGGGTATGATACATGGATTCTTGAAGTTCGAGGTGCAGGAC
TAAGCTTGCAGGAACCAAATTTCAAAGAAATTGAACATTCAGCTAATGTTAAATCTGAGAAAATGGAAGCAGCCTCTGAGAACAAAATTAATGGAACTTTACATATGGCA
GAAGTGTCAACAAAAATTCTAAATGACGTCTCAAAGTCGGATAGTTGCATCAATGGGAAAGAATCTGACTTTTCTATAGTTGAAGAAGAAGACTTCATAGGAATAACAAC
AATTTGGGATGAGTCAAGTCTGGTGACAAAGTTAACAGAAACTTTTATGCGTTTGTCAGAACGGCTATCTGGCTTTCTAAGTGAAGGTCAATCAAAGATTATGTTTGCCA
AACTATTTGATCAAATTTCAAAACTATTAGTTGATTCCCAATTATCTGAACGTTTTAATGAGGTAAGGGGAAGACTTTTAAATTTGCTGGAAACAGGGCAGACCTCAGTA
ATTGCTGGCCAGATCAGGGATTCGAGTCAAAGGCTTGTAGAAATTATTGAAGAAGGTCAACGATCTGTTTCACCTCCATTGTTCAATTTGCAAGATCGGTTTTCTTCAAC
GATCGAAGATTTTCAGAAACAACTTGATTTAATAGTAAAGTATGACTGGGACTTTGATCATTACCTGCTGGAGGACGTTCCTGCTGCGATTGATTATATAAGGGCTGTAA
GCAAGCCAAAGGATGGTAAATTGCTTGCTATTGGGCACTCCATGGGTGGTATTTTGCTTTATGCAAAACTTTCTCGTTGTGGTTTTGAGGGACGAGACCCCAGATTGGCT
GCTATTGTTACTTTGGCATCATCCCTTGACTATACTTCTTCAAAATCAACCCTGAAATTGCTATTGCCCCTTGCAGACCCTGCACAGGCTCTTAATGTTCCTGTTGTTCC
ATTGGGAGCATTGCTATCAGCTTCATATCCTCTTTCATCCCATCCTCCTTATGTCTTATCTTGGCTAAATAATCTGATTTCGGCAGAAGATATGATGCATCCTGAAATGT
TAAAAAAGCTTGTCTTGAACAACTTCTTCTTAGCACTTGCTGGAGACCAGGATCTAATCTGTCCACCTGAAGCTGTAGAAGAAACGGCAAAGCTCATTCCCAAGCACCTG
ATTACTTATAAAGTTGTTGGAGAACCAGGAGGTCCACATTATGCACATTACGATTTAGTTGGAGGCCGATTGGTAGGATTTTCGTTCCCTTCCATCTATTCATTGTCATG
GACTTCATAG
mRNA sequenceShow/hide mRNA sequence
AAAAAACAAAGGGAAAAAAAAGGGAAGAAGAAAAAGAAAAAAAGAGGATTCTCCGTTCTTCATTTACGCAACGGAAAAAAGAGCACATTGGTCGGTGACGCTCTCGCGGA
ACCGCCTCCGGAGATGGCGACGGTGCAATCTGATATCCGCTGTGCTTATCACATAGCTTCCTCCACGTTCCGCTCCCTCGCCCCCAATCTCATGCTCCGCCGTGCCGCCG
TTCCAGCTAAACTTTTCTCCTCGTCCTTCAAGTTGAGAGCCTTCTCCACCGGTGCTGCCGGCATAGCTGCCGTTAGGGTATCCGAGAAGCCGTCGATCTGTACGGCAGAT
GAGCTTCATTATGTCTCCGTACCAAATTCCGACTGGAGGCTCGCACTCTGGCGCTACCATCCCTCCCCTCAGGCGCCTCCGAGGAATCACCCGCTGTTGCTGTTATCAGG
AGTCGGGACTAATGCCATTGGTTATGATCTCGCCCCTGGGTGTTCTTTTGCACGGTATATGTCTGGCCAGGGGTATGATACATGGATTCTTGAAGTTCGAGGTGCAGGAC
TAAGCTTGCAGGAACCAAATTTCAAAGAAATTGAACATTCAGCTAATGTTAAATCTGAGAAAATGGAAGCAGCCTCTGAGAACAAAATTAATGGAACTTTACATATGGCA
GAAGTGTCAACAAAAATTCTAAATGACGTCTCAAAGTCGGATAGTTGCATCAATGGGAAAGAATCTGACTTTTCTATAGTTGAAGAAGAAGACTTCATAGGAATAACAAC
AATTTGGGATGAGTCAAGTCTGGTGACAAAGTTAACAGAAACTTTTATGCGTTTGTCAGAACGGCTATCTGGCTTTCTAAGTGAAGGTCAATCAAAGATTATGTTTGCCA
AACTATTTGATCAAATTTCAAAACTATTAGTTGATTCCCAATTATCTGAACGTTTTAATGAGGTAAGGGGAAGACTTTTAAATTTGCTGGAAACAGGGCAGACCTCAGTA
ATTGCTGGCCAGATCAGGGATTCGAGTCAAAGGCTTGTAGAAATTATTGAAGAAGGTCAACGATCTGTTTCACCTCCATTGTTCAATTTGCAAGATCGGTTTTCTTCAAC
GATCGAAGATTTTCAGAAACAACTTGATTTAATAGTAAAGTATGACTGGGACTTTGATCATTACCTGCTGGAGGACGTTCCTGCTGCGATTGATTATATAAGGGCTGTAA
GCAAGCCAAAGGATGGTAAATTGCTTGCTATTGGGCACTCCATGGGTGGTATTTTGCTTTATGCAAAACTTTCTCGTTGTGGTTTTGAGGGACGAGACCCCAGATTGGCT
GCTATTGTTACTTTGGCATCATCCCTTGACTATACTTCTTCAAAATCAACCCTGAAATTGCTATTGCCCCTTGCAGACCCTGCACAGGCTCTTAATGTTCCTGTTGTTCC
ATTGGGAGCATTGCTATCAGCTTCATATCCTCTTTCATCCCATCCTCCTTATGTCTTATCTTGGCTAAATAATCTGATTTCGGCAGAAGATATGATGCATCCTGAAATGT
TAAAAAAGCTTGTCTTGAACAACTTCTTCTTAGCACTTGCTGGAGACCAGGATCTAATCTGTCCACCTGAAGCTGTAGAAGAAACGGCAAAGCTCATTCCCAAGCACCTG
ATTACTTATAAAGTTGTTGGAGAACCAGGAGGTCCACATTATGCACATTACGATTTAGTTGGAGGCCGATTGGTAGGATTTTCGTTCCCTTCCATCTATTCATTGTCATG
GACTTCATAG
Protein sequenceShow/hide protein sequence
KKQREKKGKKKKKKRGFSVLHLRNGKKSTLVGDALAEPPPEMATVQSDIRCAYHIASSTFRSLAPNLMLRRAAVPAKLFSSSFKLRAFSTGAAGIAAVRVSEKPSICTAD
ELHYVSVPNSDWRLALWRYHPSPQAPPRNHPLLLLSGVGTNAIGYDLAPGCSFARYMSGQGYDTWILEVRGAGLSLQEPNFKEIEHSANVKSEKMEAASENKINGTLHMA
EVSTKILNDVSKSDSCINGKESDFSIVEEEDFIGITTIWDESSLVTKLTETFMRLSERLSGFLSEGQSKIMFAKLFDQISKLLVDSQLSERFNEVRGRLLNLLETGQTSV
IAGQIRDSSQRLVEIIEEGQRSVSPPLFNLQDRFSSTIEDFQKQLDLIVKYDWDFDHYLLEDVPAAIDYIRAVSKPKDGKLLAIGHSMGGILLYAKLSRCGFEGRDPRLA
AIVTLASSLDYTSSKSTLKLLLPLADPAQALNVPVVPLGALLSASYPLSSHPPYVLSWLNNLISAEDMMHPEMLKKLVLNNFFLALAGDQDLICPPEAVEETAKLIPKHL
ITYKVVGEPGGPHYAHYDLVGGRLVGFSFPSIYSLSWTS