; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023610 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023610
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLanC-like protein
Genome locationtig00000892:4986906..5003772
RNA-Seq ExpressionSgr023610
SyntenySgr023610
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007112 - Expansin/pollen allergen, DPBB domain
IPR007117 - Expansin, cellulose-binding-like domain
IPR007822 - Lanthionine synthetase C-like
IPR009009 - RlpA-like protein, double-psi beta-barrel domain
IPR012341 - Six-hairpin glycosidase-like superfamily
IPR018499 - Tetraspanin/Peripherin
IPR020464 - LanC-like protein, eukaryotic
IPR036749 - Expansin, cellulose-binding-like domain superfamily
IPR036908 - RlpA-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010366.1 LanC-like protein GCL2 [Cucurbita argyrosperma subsp. argyrosperma]2.9e-13664.96Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSEST---------------------------------
        A+ +KE IVLETWGVTGQ+VRDFTLY+GALGTAFLLLKA+E+TSNHIDLSLCAQIVKACDQASS ST                                 
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSEST---------------------------------

Query:  --------DVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLGQFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEER
                DVTFICGRAG CAIGAVAAKRAGDEQLL YYLGQF EIKLPRNLPDELL G+VGFLWACL+LNK I EGT    H         ++G    +
Subjt:  --------DVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLGQFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEER

Query:  G--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFC
        G   PLMFEWYGERYWG AHGLAGI+H+LM+MELKP+E +DVKGTLR    N FPSGNYPSSEEDR RD LVHWCHGAPG+ALTLVKAA +         
Subjt:  G--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFC

Query:  KLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYVFL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLD
           FGEEEF+QAAVDAGEVVWR GLLKRVGICHG+SGNSYVFL                         AHRLIAEGEMHGGDS +S+FEGVGGMA+LFLD
Subjt:  KLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYVFL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLD

Query:  MIMPSMARFPG
        MI PSMA+FPG
Subjt:  MIMPSMARFPG

XP_022140440.1 lanC-like protein GCL2 [Momordica charantia]3.1e-14673.78Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ +KE IVLETWGVTGQ+VRDFTLYSGALGTA LLLKA+E+TSNHIDLSLCAQIVKACDQASS STDVTFICGRAG CAIGAVAAKRAGDEQLL YYLG
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKG--AGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED
        +F EIKLP NLPDELL GRVGFLWACLFLNK I  GT    H         R+G    +  G PLMFEWYGERYWGAAHGLAGI+HVLM+MELKP+E ED
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKG--AGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED

Query:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV
        VKGT+RYMIQN FPSGNYPSSEEDRNRD LVHWCHGAPGIALTLVKAA +            FGE+EF++AAV+AGEVVWR GLLKRVGICHGISGNSYV
Subjt:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV

Query:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        FL                         AHRLIAEGEMHGGDS  S+FEG+GGMAYLFLDMI+P+MARFPG
Subjt:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

XP_022943721.1 lanC-like protein GCL2 isoform X1 [Cucurbita moschata]2.0e-14572.7Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ +KE IVLETWGVTGQ+VRDFTLY+GALGTAFLLLKA+E+TSNHIDLSLCAQIVKACDQASS STDVTFICGRAG CAIG VAAKRAGDEQLL YYL 
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED
        QF EIKLPRNLPDELL G+VGFLWACL+LNK I EGT    H         ++G    +G   PLMFEWYGERYWG AHGLAGI+H+LM+MELKP+E +D
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED

Query:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV
        VKGTLRYMI+N FPSGNYPSSEEDR RD LVHWCHGAPG+ALTLVKAA +            FGEEEF+QAAVDAGEVVWR GLLKRVGICHG+SGNSYV
Subjt:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV

Query:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        FL                         AHRLIAEGEMHGGDS +S+FEGVGGMAYLFLDMI PSMA+FPG
Subjt:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

XP_022985978.1 lanC-like protein GCL2 isoform X1 [Cucurbita maxima]1.3e-14472.16Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ +KE IVLETWGVTGQ+VRDFTLYSGALGTAFLLLKA+E+T+N IDLSLCAQIVKACDQASS STDVTFICGRAG CAIGAVAA+RAGDEQLL YYLG
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED
        QF EIKLPRNLPDELL G+VGFLWACLFLNK I EGT    H         ++G    +G   PLMFEWYGERYWG AHGLAGI+H+LM+MELKP+E +D
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED

Query:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV
        VKGT+RYMI+N FP+GNYPSSEEDR RD LVHWCHGAPG+ALTLVKAA +            FGEEEF+QAA DAGEVVWR GLLKRVGICHG+SGNSYV
Subjt:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV

Query:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        FL                         AHRLIAEGEMHGGDS +S+FEGVGGMAYLFLDMI PSMA+FPG
Subjt:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

XP_023512306.1 lanC-like protein GCL2 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-14673.24Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ +KE IVLETWGVTGQ+VRDFTLYSGALGTAFLLLKA+E+TSNHIDLSLCAQI+KACDQASS STDVTFICGRAG CAIGAVAAKRAGDEQLL YYLG
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED
        QF EIKLPRNLPDELL G+VGFLWACL+LNK I EGT    H         ++G    +G   PLMFEWYGERYWG AHGLAGI+H+LM+MELKP+E +D
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED

Query:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV
        VKGTLRYMI+N FPSGNYPSSEEDR RD LVHWCHGAPG+ALTLVKAA +            FGEEEF+QAAVDAGEVVWR GLLKRVGICHG+SGNSYV
Subjt:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV

Query:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        FL                         AHRLIAEGEMHGGDS +S+FEGVGGMAYLFLDMI PSMA+FPG
Subjt:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

TrEMBL top hitse value%identityAlignment
A0A498HD34 Uncharacterized protein5.2e-13154.94Show/hide
Query:  CTQSLLKFVNSGNGMVGVAMILYGIWLMRAWQRQMGHLP---FEGRDDPVPW--------------------FMCSFLGLGILLCAVTCLGHIAAETANG
        C QSLLK VN   GMVG+AMI+Y +WL  AWQR M H P         P PW                    F+ +FLGLGI L  +TC GHIAA+TANG
Subjt:  CTQSLLKFVNSGNGMVGVAMILYGIWLMRAWQRQMGHLP---FEGRDDPVPW--------------------FMCSFLGLGILLCAVTCLGHIAAETANG

Query:  CCLHMYMVLVFVLFMMEAGVTTDVFLNRDWEEDFPEDPSGSFDQFKHFIRLNFNICKWIGISMVSIQGLSLLLAMVLKAIGTRRFYDSDDDYAPEKLPLL
        CCL++YMV VF+LFM+EAGVT D+FLNRDWEEDFP+DP+G FDQFK F+R NF+ICKWIG+S+V +QGLSLLLAM+LKA+G   +YDSDD+Y PE++PLL
Subjt:  CCLHMYMVLVFVLFMMEAGVTTDVFLNRDWEEDFPEDPSGSFDQFKHFIRLNFNICKWIGISMVSIQGLSLLLAMVLKAIGTRRFYDSDDDYAPEKLPLL

Query:  KNALHSPTTFVVGDP-----VFASKNDVWNKRNEGKLTLWAELLVSPPTVIAHRRRNAPPPPPWFWPLCGLFLCGGCAPVFHGVGSAPTPSAGSTLAPGH
        KNA+  P  +VVGDP     V+ SK+D WN R   KL      LV                      L  L L   C            P A +   P  
Subjt:  KNALHSPTTFVVGDP-----VFASKNDVWNKRNEGKLTLWAELLVSPPTVIAHRRRNAPPPPPWFWPLCGLFLCGGCAPVFHGVGSAPTPSAGSTLAPGH

Query:  RHLVWKPR------GRCGREAP------KGRVGAVSPVLFKNGEGCGACYKVKCLDQSICSRRAVTIIVTDECPGGYCSNGNTHFDLSGAAFGRMAIAGE
              P       G CG  +       + RVGAV+PVLFK+GEGCGACYK+KCLDQSICSRR VTIIVTDECPG  CS G   FDLSGAAFGRMA+AGE
Subjt:  RHLVWKPR------GRCGREAP------KGRVGAVSPVLFKNGEGCGACYKVKCLDQSICSRRAVTIIVTDECPGGYCSNGNTHFDLSGAAFGRMAIAGE

Query:  GGPLRNRGEIPVIYRRTPCKYPGKNIAFHVNEGSTDYWLSLLVEFEDGDGDVGAMQIKEIVLETWG
        GG LRN+GE+ V+YRRTPCKYPGK IAFHVNEGST+YWLSLLVEFEDGDGD  + +  E +   WG
Subjt:  GGPLRNRGEIPVIYRRTPCKYPGKNIAFHVNEGSTDYWLSLLVEFEDGDGDVGAMQIKEIVLETWG

A0A4D6LGL9 LanC-like protein6.1e-13244.73Show/hide
Query:  GRCG-----REAPKGRVGAVSPVLFKNGEGCGACYKVKCLDQ---SICSRRAVTIIVTDECP---------GGYCSNGNTHFDLSGAAFGRMAIAGEGGP
        G CG     +E+   R   +S +LF  G  CGACY+++C+D     +    +V + VTD C          GG+C+    HF++S  AF  +A       
Subjt:  GRCG-----REAPKGRVGAVSPVLFKNGEGCGACYKVKCLDQ---SICSRRAVTIIVTDECP---------GGYCSNGNTHFDLSGAAFGRMAIAGEGGP

Query:  LRNRGEI-PVIYRRTPCKYPGKNIAFHVNEGSTDYWLSLLVEFEDGDGDVGAMQIK--------------------------------------------
         +NR +I PV YRR  C+  G  + F ++ GS  ++  +L+     DG+V A+++K                                            
Subjt:  LRNRGEI-PVIYRRTPCKYPGKNIAFHVNEGSTDYWLSLLVEFEDGDGDVGAMQIK--------------------------------------------

Query:  ------------------------------------------------------EIVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLC
                                                               IV+ETWGV+G++  DF+LY G LGTAFLLLK+YE+T N  DLSLC
Subjt:  ------------------------------------------------------EIVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLC

Query:  AQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLGQFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHPAR-------
        +QIVKACD AS  S DVTFICGRAG C++GAVAAK AGD++ L YYL QF++IKL ++LPDELL GRVGFLWACLFLNK + +GT   ++ A        
Subjt:  AQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLGQFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHPAR-------

Query:  --KGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDF
          +  G +   PLM+EWYGE+YWGAAHGLAGI+HVLM+MELKP+E EDVKGTL+YMI N FPSGNYP+SE+DR  D LVHWCHGAPGIALTL KAA +  
Subjt:  --KGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDF

Query:  FHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYVFL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGG
                  FG++EFL AA++AGEVVW  GLLK+VGICHGISGN+YVFL                         AH+LI+EGEMHGGD  +S+FEG GG
Subjt:  FHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYVFL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGG

Query:  MAYLFLDMIMPSMARFP
        MAYLFLDMI PS+A+FP
Subjt:  MAYLFLDMIMPSMARFP

A0A6J1CF36 lanC-like protein GCL21.5e-14673.78Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ +KE IVLETWGVTGQ+VRDFTLYSGALGTA LLLKA+E+TSNHIDLSLCAQIVKACDQASS STDVTFICGRAG CAIGAVAAKRAGDEQLL YYLG
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKG--AGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED
        +F EIKLP NLPDELL GRVGFLWACLFLNK I  GT    H         R+G    +  G PLMFEWYGERYWGAAHGLAGI+HVLM+MELKP+E ED
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKG--AGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED

Query:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV
        VKGT+RYMIQN FPSGNYPSSEEDRNRD LVHWCHGAPGIALTLVKAA +            FGE+EF++AAV+AGEVVWR GLLKRVGICHGISGNSYV
Subjt:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV

Query:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        FL                         AHRLIAEGEMHGGDS  S+FEG+GGMAYLFLDMI+P+MARFPG
Subjt:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

A0A6J1FSH6 lanC-like protein GCL2 isoform X19.7e-14672.7Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ +KE IVLETWGVTGQ+VRDFTLY+GALGTAFLLLKA+E+TSNHIDLSLCAQIVKACDQASS STDVTFICGRAG CAIG VAAKRAGDEQLL YYL 
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED
        QF EIKLPRNLPDELL G+VGFLWACL+LNK I EGT    H         ++G    +G   PLMFEWYGERYWG AHGLAGI+H+LM+MELKP+E +D
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED

Query:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV
        VKGTLRYMI+N FPSGNYPSSEEDR RD LVHWCHGAPG+ALTLVKAA +            FGEEEF+QAAVDAGEVVWR GLLKRVGICHG+SGNSYV
Subjt:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV

Query:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        FL                         AHRLIAEGEMHGGDS +S+FEGVGGMAYLFLDMI PSMA+FPG
Subjt:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

A0A6J1JCS6 lanC-like protein GCL2 isoform X16.3e-14572.16Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ +KE IVLETWGVTGQ+VRDFTLYSGALGTAFLLLKA+E+T+N IDLSLCAQIVKACDQASS STDVTFICGRAG CAIGAVAA+RAGDEQLL YYLG
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED
        QF EIKLPRNLPDELL G+VGFLWACLFLNK I EGT    H         ++G    +G   PLMFEWYGERYWG AHGLAGI+H+LM+MELKP+E +D
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI-EGTRGGNHP-------ARKGAGEERG--PPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENED

Query:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV
        VKGT+RYMI+N FP+GNYPSSEEDR RD LVHWCHGAPG+ALTLVKAA +            FGEEEF+QAA DAGEVVWR GLLKRVGICHG+SGNSYV
Subjt:  VKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYV

Query:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        FL                         AHRLIAEGEMHGGDS +S+FEGVGGMAYLFLDMI PSMA+FPG
Subjt:  FL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

SwissProt top hitse value%identityAlignment
F4IEM5 LanC-like protein GCR21.5e-10654.13Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ IK+ +V ETW  +G++VRD+ LY+G LGTA+LL K+Y++T N  DL LC + V+ACD AS +S  VTFICG AG CA+GAVAAK  GD+QL   YL 
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI---------------EGTRGGNHPARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKP
        +F+ I+LP +LP ELL GR G+LWACLFLNK I               E  R G     KG       PLM+EW+G+RYWGAAHGLAGI++VLM  EL+P
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI---------------EGTRGGNHPARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKP

Query:  NENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGIS
        +E +DVKGTL YMIQN FPSGNY SSE  ++ D LVHWCHGAPG+ALTLVKAA +            +  +EF++AA++AGEVVW  GLLKRVGICHGIS
Subjt:  NENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGIS

Query:  GNSYVFL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        GN+YVFL                         + +LI+EG+MHGGD   S+FEG+GGMAY+ LDM  P+ A FPG
Subjt:  GNSYVFL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

Q0DZ85 Expansin-B161.6e-6078.1Show/hide
Query:  KGRVGAVSPVLFKNGEGCGACYKVKCLDQSICSRRAVTIIVTDECPGGYCSNGNTHFDLSGAAFGRMAIAGEGGPLRNRGEIPVIYRRTPCKYPGKNIAF
        K RVGAVSPVLFK GEGCGACYKV+CLD SICSRRAVT+IVTDECPGG C+ G THFDLSGAAF R+A+AG GG L+NRGEI V+YRRT CKY GKNIAF
Subjt:  KGRVGAVSPVLFKNGEGCGACYKVKCLDQSICSRRAVTIIVTDECPGGYCSNGNTHFDLSGAAFGRMAIAGEGGPLRNRGEIPVIYRRTPCKYPGKNIAF

Query:  HVNEGSTDYWLSLLVEFEDGDGDVGAMQIKEIVLETW
        HVNEGST +WLSLLVEFEDGDGD+G+MQ+K+     W
Subjt:  HVNEGSTDYWLSLLVEFEDGDGDVGAMQIKEIVLETW

Q8VZQ6 LanC-like protein GCL23.8e-12359.73Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ +KE +V+ETWG +GQ V DFTLYSG LG AFLL +AY++T N  DLSLC +IVKACD AS+ S DVTF+CGRAG C +GAVAAK +G+E LL YYLG
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI---------------EGTRGGNHPARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKP
        QF+ I+L  +LP+ELL GRVG+LWACLF+NK I               E  + G   A+KG+      PLMFEWYG+RYWGAAHGLAGI+HVLM+++LKP
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI---------------EGTRGGNHPARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKP

Query:  NENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGIS
        +E EDVKGTL+YMI+N FPSGNYP+SEED+ +D LVHWCHGAPGIALTL KAA +            FGE EFL+A+  A EVVW  GLLKRVGICHGIS
Subjt:  NENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGIS

Query:  GNSYVFLA-------------------------HRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        GN+YVFLA                          +L+++GEMHGGDS +S+FEGV GMAYLFLDM+ PS ARFPG
Subjt:  GNSYVFLA-------------------------HRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

Q940P5 Tetraspanin-192.9e-7058.57Show/hide
Query:  MLKVCTQSLLKFVNSGNGMVGVAMILYGIWLMRAWQRQMGHLPFEGRDDPVPWFMCSFLGLGILLCAVTCLGHIAAETANGCCLHMYMVLVFVLFMMEAG
        +++ C QS+LK VNS  GMVG+AMILY +WL+R WQ QMG+LPF   D PVPWF+ SFLGLG +LC VTC GHIAAET NGCCL++YM  + +L M+E G
Subjt:  MLKVCTQSLLKFVNSGNGMVGVAMILYGIWLMRAWQRQMGHLPFEGRDDPVPWFMCSFLGLGILLCAVTCLGHIAAETANGCCLHMYMVLVFVLFMMEAG

Query:  VTTDVFLNRDWEEDFPEDPSGSFDQFKHFIRLNFNICKWIGISMVSIQGLSLLLAMVLKAIG--TRRFYDSDDDYAPEKLPLLKNALHSPTTFVVGDPVF
        V  D+FLNRDW++DFPEDPSG+F QF  FI  NF ICKWIG+S+V +QGLS+L+AM+LKA+G    R YDSDD+Y    + LL++A   P  +VVG+P++
Subjt:  VTTDVFLNRDWEEDFPEDPSGSFDQFKHFIRLNFNICKWIGISMVSIQGLSLLLAMVLKAIG--TRRFYDSDDDYAPEKLPLLKNALHSPTTFVVGDPVF

Query:  ASKNDVWNKR
         +K   W  R
Subjt:  ASKNDVWNKR

Q9FJN7 LanC-like protein GCL11.1e-6640.11Show/hide
Query:  VRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSEST-DVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLGQFKEIKLPRNLPD-----
        V D T+Y+G LGTAF  LK+YE+T NH DL  CA+I+  C   +  +T  VTF+CGR G C +GA+ A   GD+    ++LG F E+   R LP      
Subjt:  VRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSEST-DVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLGQFKEIKLPRNLPD-----

Query:  ------ELLCGRVGFLWACLFLNKRIEGTRGGNH-----------PARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENEDVKGTLR
              +LL GR GFLWA LFLN+ +      +H             R GA +    PL++ ++G R+WGAA+GLAGI++VL+   L   + +DV+GTLR
Subjt:  ------ELLCGRVGFLWACLFLNKRIEGTRGGNH-----------PARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENEDVKGTLR

Query:  YMIQNCFP-SGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYVFLA-H
        YM+ N FP SGNYP S E   RD LV W HGA G+A+TL KA+ +       F K    E +F +AA++AGEVVW+ GL+K+VG+  G++GN+Y FL+ +
Subjt:  YMIQNCFP-SGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYVFLA-H

Query:  RLIAE---------------------GEMHGGDSHH--SMFEGVGGMAYLFLDMIMPSMARFPG
        RL  +                       M   ++ H  S+F G+ G   L+ D++ P  ++FPG
Subjt:  RLIAE---------------------GEMHGGDSHH--SMFEGVGGMAYLFLDMIMPSMARFPG

Arabidopsis top hitse value%identityAlignment
AT1G52920.1 G protein coupled receptor1.0e-10754.13Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ IK+ +V ETW  +G++VRD+ LY+G LGTA+LL K+Y++T N  DL LC + V+ACD AS +S  VTFICG AG CA+GAVAAK  GD+QL   YL 
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI---------------EGTRGGNHPARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKP
        +F+ I+LP +LP ELL GR G+LWACLFLNK I               E  R G     KG       PLM+EW+G+RYWGAAHGLAGI++VLM  EL+P
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI---------------EGTRGGNHPARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKP

Query:  NENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGIS
        +E +DVKGTL YMIQN FPSGNY SSE  ++ D LVHWCHGAPG+ALTLVKAA +            +  +EF++AA++AGEVVW  GLLKRVGICHGIS
Subjt:  NENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGIS

Query:  GNSYVFL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        GN+YVFL                         + +LI+EG+MHGGD   S+FEG+GGMAY+ LDM  P+ A FPG
Subjt:  GNSYVFL-------------------------AHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

AT2G20740.1 Tetraspanin family protein2.0e-7158.57Show/hide
Query:  MLKVCTQSLLKFVNSGNGMVGVAMILYGIWLMRAWQRQMGHLPFEGRDDPVPWFMCSFLGLGILLCAVTCLGHIAAETANGCCLHMYMVLVFVLFMMEAG
        +++ C QS+LK VNS  GMVG+AMILY +WL+R WQ QMG+LPF   D PVPWF+ SFLGLG +LC VTC GHIAAET NGCCL++YM  + +L M+E G
Subjt:  MLKVCTQSLLKFVNSGNGMVGVAMILYGIWLMRAWQRQMGHLPFEGRDDPVPWFMCSFLGLGILLCAVTCLGHIAAETANGCCLHMYMVLVFVLFMMEAG

Query:  VTTDVFLNRDWEEDFPEDPSGSFDQFKHFIRLNFNICKWIGISMVSIQGLSLLLAMVLKAIG--TRRFYDSDDDYAPEKLPLLKNALHSPTTFVVGDPVF
        V  D+FLNRDW++DFPEDPSG+F QF  FI  NF ICKWIG+S+V +QGLS+L+AM+LKA+G    R YDSDD+Y    + LL++A   P  +VVG+P++
Subjt:  VTTDVFLNRDWEEDFPEDPSGSFDQFKHFIRLNFNICKWIGISMVSIQGLSLLLAMVLKAIG--TRRFYDSDDDYAPEKLPLLKNALHSPTTFVVGDPVF

Query:  ASKNDVWNKR
         +K   W  R
Subjt:  ASKNDVWNKR

AT2G20770.1 GCR2-like 22.7e-12459.73Show/hide
Query:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG
        A+ +KE +V+ETWG +GQ V DFTLYSG LG AFLL +AY++T N  DLSLC +IVKACD AS+ S DVTF+CGRAG C +GAVAAK +G+E LL YYLG
Subjt:  AMQIKE-IVLETWGVTGQQVRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLG

Query:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI---------------EGTRGGNHPARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKP
        QF+ I+L  +LP+ELL GRVG+LWACLF+NK I               E  + G   A+KG+      PLMFEWYG+RYWGAAHGLAGI+HVLM+++LKP
Subjt:  QFKEIKLPRNLPDELLCGRVGFLWACLFLNKRI---------------EGTRGGNHPARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKP

Query:  NENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGIS
        +E EDVKGTL+YMI+N FPSGNYP+SEED+ +D LVHWCHGAPGIALTL KAA +            FGE EFL+A+  A EVVW  GLLKRVGICHGIS
Subjt:  NENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGIS

Query:  GNSYVFLA-------------------------HRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG
        GN+YVFLA                          +L+++GEMHGGDS +S+FEGV GMAYLFLDM+ PS ARFPG
Subjt:  GNSYVFLA-------------------------HRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPG

AT4G28250.1 expansin B34.3e-6178.52Show/hide
Query:  RVGAVSPVLFKNGEGCGACYKVKCLDQSICSRRAVTIIVTDECPGGYCSNGNTHFDLSGAAFGRMAIAGEGGPLRNRGEIPVIYRRTPCKYPGKNIAFHV
        RVGAV+P+LFKNGEGCGACYKV+CLD+SICSRRAVT+I+TDECPG  CS  +THFDLSGA FGR+AIAGE GPLRNRG IPVIYRRT CKY GKNIAFHV
Subjt:  RVGAVSPVLFKNGEGCGACYKVKCLDQSICSRRAVTIIVTDECPGGYCSNGNTHFDLSGAAFGRMAIAGEGGPLRNRGEIPVIYRRTPCKYPGKNIAFHV

Query:  NEGSTDYWLSLLVEFEDGDGDVGAMQIKEIVLETW
        NEGSTD+WLSLLVEFEDG+GD+G+M I++     W
Subjt:  NEGSTDYWLSLLVEFEDGDGDVGAMQIKEIVLETW

AT5G65280.1 GCR2-like 18.0e-6840.11Show/hide
Query:  VRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSEST-DVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLGQFKEIKLPRNLPD-----
        V D T+Y+G LGTAF  LK+YE+T NH DL  CA+I+  C   +  +T  VTF+CGR G C +GA+ A   GD+    ++LG F E+   R LP      
Subjt:  VRDFTLYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSEST-DVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLGQFKEIKLPRNLPD-----

Query:  ------ELLCGRVGFLWACLFLNKRIEGTRGGNH-----------PARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENEDVKGTLR
              +LL GR GFLWA LFLN+ +      +H             R GA +    PL++ ++G R+WGAA+GLAGI++VL+   L   + +DV+GTLR
Subjt:  ------ELLCGRVGFLWACLFLNKRIEGTRGGNH-----------PARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENEDVKGTLR

Query:  YMIQNCFP-SGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYVFLA-H
        YM+ N FP SGNYP S E   RD LV W HGA G+A+TL KA+ +       F K    E +F +AA++AGEVVW+ GL+K+VG+  G++GN+Y FL+ +
Subjt:  YMIQNCFP-SGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFHGRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYVFLA-H

Query:  RLIAE---------------------GEMHGGDSHH--SMFEGVGGMAYLFLDMIMPSMARFPG
        RL  +                       M   ++ H  S+F G+ G   L+ D++ P  ++FPG
Subjt:  RLIAE---------------------GEMHGGDSHH--SMFEGVGGMAYLFLDMIMPSMARFPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAAGGTCTGCACGCAATCTTTGTTGAAGTTCGTGAATTCTGGAAATGGGATGGTAGGAGTCGCCATGATTTTGTACGGAATTTGGTTGATGAGAGCTTGGCAGAG
GCAAATGGGTCATTTGCCCTTTGAAGGTCGCGATGACCCGGTTCCATGGTTCATGTGCAGTTTTCTTGGCCTAGGCATTCTTTTATGTGCGGTCACATGCTTGGGTCATA
TTGCTGCTGAAACTGCTAATGGTTGTTGCCTTCATATGTACATGGTGCTTGTCTTTGTGCTTTTTATGATGGAGGCTGGGGTGACTACTGATGTGTTTCTGAATCGTGAC
TGGGAGGAGGACTTTCCCGAGGATCCAAGTGGAAGTTTTGATCAGTTCAAACATTTCATCAGATTGAATTTCAATATCTGCAAATGGATAGGGATCTCAATGGTGTCTAT
TCAGGGTTTGTCTCTCTTGTTGGCAATGGTGCTAAAAGCCATTGGAACGCGTCGATTTTATGATAGTGATGATGACTATGCTCCTGAGAAGCTTCCACTTCTTAAGAATG
CTCTTCATTCACCAACTACGTTTGTTGTTGGTGACCCTGTCTTTGCATCTAAAAACGATGTGTGGAACAAGCGAAATGAGGGCAAGCTGACATTGTGGGCGGAGCTATTG
GTATCGCCGCCTACTGTTATAGCCCACCGCAGACGCAATGCACCTCCTCCGCCGCCGTGGTTTTGGCCTCTTTGCGGCTTGTTTCTTTGTGGTGGCTGCGCTCCAGTGTT
TCACGGCGTCGGCTCAGCTCCAACACCGTCTGCCGGATCTACATTGGCTCCCGGCCACCGCCACCTGGTATGGAAGCCCAGAGGGCGATGTGGACGTGAAGCCCCTAAAG
GCAGGGTTGGGGCTGTGAGTCCGGTGCTGTTCAAAAATGGCGAAGGGTGTGGCGCCTGTTACAAGGTGAAGTGCCTGGACCAGAGCATTTGCTCCCGACGAGCCGTGACT
ATAATCGTCACCGACGAGTGCCCTGGTGGGTATTGTTCCAATGGCAATACCCACTTCGATCTCAGCGGCGCCGCCTTCGGTCGCATGGCCATCGCCGGCGAAGGTGGCCC
GCTCAGGAACCGAGGCGAAATCCCAGTCATTTACCGACGGACTCCATGTAAGTACCCAGGCAAGAACATTGCCTTCCATGTCAACGAAGGCTCAACAGACTACTGGCTCT
CACTCTTGGTTGAATTCGAGGATGGCGATGGAGACGTCGGTGCAATGCAAATAAAAGAAATAGTTTTGGAGACGTGGGGAGTAACCGGGCAACAAGTGCGGGATTTCACA
CTCTATTCTGGGGCTCTTGGGACGGCTTTCTTGCTGTTAAAAGCTTACGAGATCACTTCGAATCACATTGATCTTAGTCTCTGCGCTCAAATTGTTAAGGCCTGTGATCA
AGCATCTTCCGAGTCCACGGATGTAACTTTCATTTGTGGGCGTGCCGGTTTCTGCGCTATTGGAGCTGTGGCAGCAAAGCGTGCTGGTGATGAGCAGCTGCTCATTTACT
ATCTAGGTCAATTCAAAGAGATTAAGCTTCCAAGAAATCTTCCGGATGAGTTGTTATGTGGAAGAGTTGGTTTCTTGTGGGCATGTCTATTTCTAAACAAACGCATCGAG
GGGACTCGTGGAGGAAATCATCCGGCGAGGAAGGGCGCAGGCGAAGAGAGGGGGCCGCCATTGATGTTTGAATGGTATGGCGAGAGGTACTGGGGTGCTGCACATGGATT
AGCAGGGATTGTGCATGTTCTGATGGAGATGGAGTTGAAGCCAAATGAGAATGAAGATGTGAAAGGCACTCTTAGGTACATGATTCAAAACTGTTTCCCCAGTGGAAACT
ACCCTTCAAGTGAAGAAGATAGGAACAGAGATACTCTTGTGCATTGGTGTCATGGCGCTCCTGGGATTGCCCTCACGCTTGTCAAAGCAGCACATATAGATTTCTTCCAT
GGCAGGCATTTCTGCAAGCTTGATTTTGGAGAAGAAGAGTTTCTGCAAGCGGCTGTGGATGCAGGAGAGGTAGTATGGAGGTGTGGGCTGCTCAAGCGAGTTGGGATCTG
TCATGGCATTAGTGGGAATTCTTATGTGTTTCTAGCTCACAGGCTGATTGCAGAGGGAGAGATGCATGGAGGCGACAGCCATCACTCTATGTTTGAAGGAGTTGGAGGGA
TGGCTTATCTTTTTCTTGACATGATTATGCCCTCCATGGCCAGGTTTCCGGGGAGCTACGCAAGTTTCCAGCTTTTGCACGGTAGAAGAGAGGGAAAGCTAGAGAGAGAG
CGTGTGATGTTGGCTCATTTTTCTTCAACGAAGGCAACCGACAGAGCAGACATGAAGGAGAAGATGAAGAACGTTCCACCCACAGTGATTGCTCGAGAAACAGACAAGAA
AGACATGGCCACCAGCCCACTGGAAACCCTGTTGCCCACTGCTCCAAGAGCGGCTGCTTGGGCTCGTAGCTTCAATGGGAAAATTTCAGAGGTCAAAACCCAGCAAACAG
GGCCAATTCCTACTGAAAAGAAAGCCACATTTCCACACACCCAAAAGATGGCTAAAGCAACCCCAACTTTCCCGTTCCCAATAAAAGTGAGCGTAAAACCCAAGCTGAGC
AAACACACTGTCATTCCAATTGTGCTCAGATACAGCAATGGCTTCCTACCAAGCTTGTCAATGAGGAGAATGGCGACCAATATGAAAGATGTCTTTGCAATACCAACCGC
CACAGTTGCTGAAAGAAGCTTCGAGTTACCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAAGGTCTGCACGCAATCTTTGTTGAAGTTCGTGAATTCTGGAAATGGGATGGTAGGAGTCGCCATGATTTTGTACGGAATTTGGTTGATGAGAGCTTGGCAGAG
GCAAATGGGTCATTTGCCCTTTGAAGGTCGCGATGACCCGGTTCCATGGTTCATGTGCAGTTTTCTTGGCCTAGGCATTCTTTTATGTGCGGTCACATGCTTGGGTCATA
TTGCTGCTGAAACTGCTAATGGTTGTTGCCTTCATATGTACATGGTGCTTGTCTTTGTGCTTTTTATGATGGAGGCTGGGGTGACTACTGATGTGTTTCTGAATCGTGAC
TGGGAGGAGGACTTTCCCGAGGATCCAAGTGGAAGTTTTGATCAGTTCAAACATTTCATCAGATTGAATTTCAATATCTGCAAATGGATAGGGATCTCAATGGTGTCTAT
TCAGGGTTTGTCTCTCTTGTTGGCAATGGTGCTAAAAGCCATTGGAACGCGTCGATTTTATGATAGTGATGATGACTATGCTCCTGAGAAGCTTCCACTTCTTAAGAATG
CTCTTCATTCACCAACTACGTTTGTTGTTGGTGACCCTGTCTTTGCATCTAAAAACGATGTGTGGAACAAGCGAAATGAGGGCAAGCTGACATTGTGGGCGGAGCTATTG
GTATCGCCGCCTACTGTTATAGCCCACCGCAGACGCAATGCACCTCCTCCGCCGCCGTGGTTTTGGCCTCTTTGCGGCTTGTTTCTTTGTGGTGGCTGCGCTCCAGTGTT
TCACGGCGTCGGCTCAGCTCCAACACCGTCTGCCGGATCTACATTGGCTCCCGGCCACCGCCACCTGGTATGGAAGCCCAGAGGGCGATGTGGACGTGAAGCCCCTAAAG
GCAGGGTTGGGGCTGTGAGTCCGGTGCTGTTCAAAAATGGCGAAGGGTGTGGCGCCTGTTACAAGGTGAAGTGCCTGGACCAGAGCATTTGCTCCCGACGAGCCGTGACT
ATAATCGTCACCGACGAGTGCCCTGGTGGGTATTGTTCCAATGGCAATACCCACTTCGATCTCAGCGGCGCCGCCTTCGGTCGCATGGCCATCGCCGGCGAAGGTGGCCC
GCTCAGGAACCGAGGCGAAATCCCAGTCATTTACCGACGGACTCCATGTAAGTACCCAGGCAAGAACATTGCCTTCCATGTCAACGAAGGCTCAACAGACTACTGGCTCT
CACTCTTGGTTGAATTCGAGGATGGCGATGGAGACGTCGGTGCAATGCAAATAAAAGAAATAGTTTTGGAGACGTGGGGAGTAACCGGGCAACAAGTGCGGGATTTCACA
CTCTATTCTGGGGCTCTTGGGACGGCTTTCTTGCTGTTAAAAGCTTACGAGATCACTTCGAATCACATTGATCTTAGTCTCTGCGCTCAAATTGTTAAGGCCTGTGATCA
AGCATCTTCCGAGTCCACGGATGTAACTTTCATTTGTGGGCGTGCCGGTTTCTGCGCTATTGGAGCTGTGGCAGCAAAGCGTGCTGGTGATGAGCAGCTGCTCATTTACT
ATCTAGGTCAATTCAAAGAGATTAAGCTTCCAAGAAATCTTCCGGATGAGTTGTTATGTGGAAGAGTTGGTTTCTTGTGGGCATGTCTATTTCTAAACAAACGCATCGAG
GGGACTCGTGGAGGAAATCATCCGGCGAGGAAGGGCGCAGGCGAAGAGAGGGGGCCGCCATTGATGTTTGAATGGTATGGCGAGAGGTACTGGGGTGCTGCACATGGATT
AGCAGGGATTGTGCATGTTCTGATGGAGATGGAGTTGAAGCCAAATGAGAATGAAGATGTGAAAGGCACTCTTAGGTACATGATTCAAAACTGTTTCCCCAGTGGAAACT
ACCCTTCAAGTGAAGAAGATAGGAACAGAGATACTCTTGTGCATTGGTGTCATGGCGCTCCTGGGATTGCCCTCACGCTTGTCAAAGCAGCACATATAGATTTCTTCCAT
GGCAGGCATTTCTGCAAGCTTGATTTTGGAGAAGAAGAGTTTCTGCAAGCGGCTGTGGATGCAGGAGAGGTAGTATGGAGGTGTGGGCTGCTCAAGCGAGTTGGGATCTG
TCATGGCATTAGTGGGAATTCTTATGTGTTTCTAGCTCACAGGCTGATTGCAGAGGGAGAGATGCATGGAGGCGACAGCCATCACTCTATGTTTGAAGGAGTTGGAGGGA
TGGCTTATCTTTTTCTTGACATGATTATGCCCTCCATGGCCAGGTTTCCGGGGAGCTACGCAAGTTTCCAGCTTTTGCACGGTAGAAGAGAGGGAAAGCTAGAGAGAGAG
CGTGTGATGTTGGCTCATTTTTCTTCAACGAAGGCAACCGACAGAGCAGACATGAAGGAGAAGATGAAGAACGTTCCACCCACAGTGATTGCTCGAGAAACAGACAAGAA
AGACATGGCCACCAGCCCACTGGAAACCCTGTTGCCCACTGCTCCAAGAGCGGCTGCTTGGGCTCGTAGCTTCAATGGGAAAATTTCAGAGGTCAAAACCCAGCAAACAG
GGCCAATTCCTACTGAAAAGAAAGCCACATTTCCACACACCCAAAAGATGGCTAAAGCAACCCCAACTTTCCCGTTCCCAATAAAAGTGAGCGTAAAACCCAAGCTGAGC
AAACACACTGTCATTCCAATTGTGCTCAGATACAGCAATGGCTTCCTACCAAGCTTGTCAATGAGGAGAATGGCGACCAATATGAAAGATGTCTTTGCAATACCAACCGC
CACAGTTGCTGAAAGAAGCTTCGAGTTACCCTGA
Protein sequenceShow/hide protein sequence
MLKVCTQSLLKFVNSGNGMVGVAMILYGIWLMRAWQRQMGHLPFEGRDDPVPWFMCSFLGLGILLCAVTCLGHIAAETANGCCLHMYMVLVFVLFMMEAGVTTDVFLNRD
WEEDFPEDPSGSFDQFKHFIRLNFNICKWIGISMVSIQGLSLLLAMVLKAIGTRRFYDSDDDYAPEKLPLLKNALHSPTTFVVGDPVFASKNDVWNKRNEGKLTLWAELL
VSPPTVIAHRRRNAPPPPPWFWPLCGLFLCGGCAPVFHGVGSAPTPSAGSTLAPGHRHLVWKPRGRCGREAPKGRVGAVSPVLFKNGEGCGACYKVKCLDQSICSRRAVT
IIVTDECPGGYCSNGNTHFDLSGAAFGRMAIAGEGGPLRNRGEIPVIYRRTPCKYPGKNIAFHVNEGSTDYWLSLLVEFEDGDGDVGAMQIKEIVLETWGVTGQQVRDFT
LYSGALGTAFLLLKAYEITSNHIDLSLCAQIVKACDQASSESTDVTFICGRAGFCAIGAVAAKRAGDEQLLIYYLGQFKEIKLPRNLPDELLCGRVGFLWACLFLNKRIE
GTRGGNHPARKGAGEERGPPLMFEWYGERYWGAAHGLAGIVHVLMEMELKPNENEDVKGTLRYMIQNCFPSGNYPSSEEDRNRDTLVHWCHGAPGIALTLVKAAHIDFFH
GRHFCKLDFGEEEFLQAAVDAGEVVWRCGLLKRVGICHGISGNSYVFLAHRLIAEGEMHGGDSHHSMFEGVGGMAYLFLDMIMPSMARFPGSYASFQLLHGRREGKLERE
RVMLAHFSSTKATDRADMKEKMKNVPPTVIARETDKKDMATSPLETLLPTAPRAAAWARSFNGKISEVKTQQTGPIPTEKKATFPHTQKMAKATPTFPFPIKVSVKPKLS
KHTVIPIVLRYSNGFLPSLSMRRMATNMKDVFAIPTATVAERSFELP