; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g38400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g38400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF819)
Genome locationchr6:29734236..29739775
RNA-Seq ExpressionMoc06g38400
SyntenyMoc06g38400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN49926.1 hypothetical protein Csa_000607 [Cucumis sativus]6.2e-19681.58Show/hide
Query:  MASS-IKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAP-----AKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHW
        MAS  I N AMFAPH PLLQLPS   HSLLPL HGR SAP     +KSS RR N +  LLS SSSS Y NS        TR+VKV +QLRHPIIA DD+W
Subjt:  MASS-IKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAP-----AKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHW

Query:  GTWTALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVA
        GTWTALFAIG LG+WSEKTK+GSTVSAALVSTLVGLAASN GIIPYEA+PYSIV++FLLPLSVPLLLFRA +RH+IRSTGTLLGVFLLGSV+T+IGTVVA
Subjt:  GTWTALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVA

Query:  FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTAT
        FLMVPMRSLGPDNWK+AAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVI A+YFVALFALAS   PEP TSTDD S + D DHGTKLPVL TAT
Subjt:  FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTAT

Query:  AIVTSLAICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGF
        A+VTS AICKFVTWITN+  IQGANLPGITAVVV LAT+ PKQF+YLAPA DTIALILMQVFF VVGASGSIW VINN PSIF+FALVQVTVHLA+IL F
Subjt:  AIVTSLAICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGF

Query:  GKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        GKLF I+LKLLLLASNANIGGPTTACGMATAKGWR LVVP+ILAGIFGIAIATFLG+GFGLMILR +
Subjt:  GKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

XP_008437525.1 PREDICTED: uncharacterized membrane protein YjcL-like isoform X1 [Cucumis melo]1.6e-19681.58Show/hide
Query:  MASS-IKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPA-----KSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHW
        MASS I N AMFAPH PLLQLPS   HSLLPL HGR SAP      KSS RR N +  LLSPSSSS YRNS        +++VKV +QLRHPIIA DD+W
Subjt:  MASS-IKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPA-----KSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHW

Query:  GTWTALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVA
        GTWTALFAIG LG+WSEKTKIGSTVSAALVSTLVGLAASN GIIPYEA+PYSIV++FLLPLSVPLLLFRA +RH++R+TGTLLGVFLLGSV+T+IGTVVA
Subjt:  GTWTALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVA

Query:  FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTAT
        FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVI A+YFVALFALAS   PEP TSTDD S + D DHGTKLPVL TAT
Subjt:  FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTAT

Query:  AIVTSLAICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGF
        A+VTS AICKFVTWITN+  IQGANLPGITAVVV LAT+ PKQF+YLAPA DTIALILMQVFF VVGASGS+W VINN PSIF+FALVQVTVHLA+IL F
Subjt:  AIVTSLAICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGF

Query:  GKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        GKLF I+LKLLLLASNANIGGPTTACGMATAKGWR LVVP+ILAGIFGIAIATFLG+GFGLMILR +
Subjt:  GKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

XP_022157777.1 uncharacterized protein LOC111024399 isoform X1 [Momordica charantia]1.9e-245100Show/hide
Query:  MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL
        MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL
Subjt:  MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL

Query:  FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM
        FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM
Subjt:  FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM

Query:  RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL
        RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL
Subjt:  RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL

Query:  AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI
        AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI
Subjt:  AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI

Query:  ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
Subjt:  ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

XP_022157784.1 uncharacterized protein LOC111024399 isoform X2 [Momordica charantia]6.3e-241100Show/hide
Query:  MFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLW
        MFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLW
Subjt:  MFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLW

Query:  SEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWK
        SEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWK
Subjt:  SEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWK

Query:  IAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWI
        IAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWI
Subjt:  IAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWI

Query:  TNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLAS
        TNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLAS
Subjt:  TNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLAS

Query:  NANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        NANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
Subjt:  NANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

XP_022157792.1 uncharacterized protein LOC111024399 isoform X3 [Momordica charantia]5.7e-21089.59Show/hide
Query:  MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL
        MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL
Subjt:  MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL

Query:  FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM
        FAIGALGLW+    I   V   + S       SN                   PL +       GLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM
Subjt:  FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM

Query:  RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL
        RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL
Subjt:  RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL

Query:  AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI
        AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI
Subjt:  AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI

Query:  ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
Subjt:  ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

TrEMBL top hitse value%identityAlignment
A0A1S3AU88 uncharacterized membrane protein YjcL-like isoform X17.9e-19781.58Show/hide
Query:  MASS-IKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPA-----KSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHW
        MASS I N AMFAPH PLLQLPS   HSLLPL HGR SAP      KSS RR N +  LLSPSSSS YRNS        +++VKV +QLRHPIIA DD+W
Subjt:  MASS-IKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPA-----KSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHW

Query:  GTWTALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVA
        GTWTALFAIG LG+WSEKTKIGSTVSAALVSTLVGLAASN GIIPYEA+PYSIV++FLLPLSVPLLLFRA +RH++R+TGTLLGVFLLGSV+T+IGTVVA
Subjt:  GTWTALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVA

Query:  FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTAT
        FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVI A+YFVALFALAS   PEP TSTDD S + D DHGTKLPVL TAT
Subjt:  FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTAT

Query:  AIVTSLAICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGF
        A+VTS AICKFVTWITN+  IQGANLPGITAVVV LAT+ PKQF+YLAPA DTIALILMQVFF VVGASGS+W VINN PSIF+FALVQVTVHLA+IL F
Subjt:  AIVTSLAICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGF

Query:  GKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        GKLF I+LKLLLLASNANIGGPTTACGMATAKGWR LVVP+ILAGIFGIAIATFLG+GFGLMILR +
Subjt:  GKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

A0A5A7TII9 Putative membrane protein YjcL-like isoform X17.9e-19781.58Show/hide
Query:  MASS-IKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPA-----KSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHW
        MASS I N AMFAPH PLLQLPS   HSLLPL HGR SAP      KSS RR N +  LLSPSSSS YRNS        +++VKV +QLRHPIIA DD+W
Subjt:  MASS-IKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPA-----KSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHW

Query:  GTWTALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVA
        GTWTALFAIG LG+WSEKTKIGSTVSAALVSTLVGLAASN GIIPYEA+PYSIV++FLLPLSVPLLLFRA +RH++R+TGTLLGVFLLGSV+T+IGTVVA
Subjt:  GTWTALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVA

Query:  FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTAT
        FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVI A+YFVALFALAS   PEP TSTDD S + D DHGTKLPVL TAT
Subjt:  FLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTAT

Query:  AIVTSLAICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGF
        A+VTS AICKFVTWITN+  IQGANLPGITAVVV LAT+ PKQF+YLAPA DTIALILMQVFF VVGASGS+W VINN PSIF+FALVQVTVHLA+IL F
Subjt:  AIVTSLAICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGF

Query:  GKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        GKLF I+LKLLLLASNANIGGPTTACGMATAKGWR LVVP+ILAGIFGIAIATFLG+GFGLMILR +
Subjt:  GKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

A0A6J1DVE0 uncharacterized protein LOC111024399 isoform X19.1e-246100Show/hide
Query:  MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL
        MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL
Subjt:  MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL

Query:  FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM
        FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM
Subjt:  FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM

Query:  RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL
        RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL
Subjt:  RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL

Query:  AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI
        AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI
Subjt:  AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI

Query:  ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
Subjt:  ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

A0A6J1DVF7 uncharacterized protein LOC111024399 isoform X32.8e-21089.59Show/hide
Query:  MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL
        MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL
Subjt:  MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTAL

Query:  FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM
        FAIGALGLW+    I   V   + S       SN                   PL +       GLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM
Subjt:  FAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPM

Query:  RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL
        RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL
Subjt:  RSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSL

Query:  AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI
        AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI
Subjt:  AICKFVTWITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSI

Query:  ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
Subjt:  ELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

A0A6J1DXJ3 uncharacterized protein LOC111024399 isoform X23.0e-241100Show/hide
Query:  MFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLW
        MFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLW
Subjt:  MFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLW

Query:  SEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWK
        SEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWK
Subjt:  SEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWK

Query:  IAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWI
        IAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWI
Subjt:  IAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWI

Query:  TNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLAS
        TNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLAS
Subjt:  TNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLAS

Query:  NANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        NANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
Subjt:  NANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL7.7e-4031.39Show/hide
Query:  HPIIAPDDHWGTW--TALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLL
        H +I+ DD W  W   A++A  ++GL  ++ K  S VS A+++    +  +N G++P E+  Y  V  +++PL++PLLLF+  +R + + +  LL +FL+
Subjt:  HPIIAPDDHWGTW--TALFAIGALGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLL

Query:  GSVSTMIGTVVAFLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALAS--------SISPEPATSTDD
         SV T++G+++AF ++      P   KI   +  SYIGG VN+ A++         ++A V ADN + A+ F  L ++ +        ++  E     D 
Subjt:  GSVSTMIGTVVAFLMVPMRSLGPDNWKIAAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALAS--------SISPEPATSTDD

Query:  VSINTDSDHGTK-----LPVLHTATAIVTSLAICKFVT----------WITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTV
         S N+   +  +       +   A A    +A+   V+           +T   G Q   L  +T +++FL   FP+ F  L      +   L+ +FF V
Subjt:  VSINTDSDHGTK-----LPVLHTATAIVTSLAICKFVT----------WITNIYGIQGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTV

Query:  VGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFG
        +G    +  ++ NAP I LF  +    +LAV L  GKLF + L+ +LLA NA +GGPTTA  MA AKGWR LV P +L G  G  I  ++G   G
Subjt:  VGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLASNANIGGPTTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFG

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)1.9e-13459.59Show/hide
Query:  PHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLWSEKTKIGSTVSAALV
        P+ L P     H +P++  L    ++   +  + S     +      +R R VKV +QLR P+I+PDDHW  W ALFA GA G+WSEKTKIGS VS AL 
Subjt:  PHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLWSEKTKIGSTVSAALV

Query:  STLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWKIAAALMGSYIGGSVN
        STL+GLAASN  +IP+E   Y   ++FLLP ++PLLLFRA LR +IRSTG+LL  FL+GSV+T++GTVVAF++VPMRSLGPDNWKIAAALMGSYIGGS+N
Subjt:  STLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWKIAAALMGSYIGGSVN

Query:  YVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPAT-STDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWITNIYGIQGANLPGI
        +VAISEAL +SPSV+AAGVA DNVICA++F+ LFALAS I PE A+ S+ D  +  D     K  V+ T+ A+  S  ICK    +T ++ IQG  LP +
Subjt:  YVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPAT-STDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWITNIYGIQGANLPGI

Query:  TAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLASNANIGGPTTACGMA
        TA+ + LAT FP  F+ LAP+ +TI+LILMQVFFT++GA+GS+WNVIN APSIFLFA +QV VHLAV L  GKLF I++KLLLLASNANIGGPTTAC MA
Subjt:  TAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLASNANIGGPTTACGMA

Query:  TAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        TAKGW  LVVP IL+G+FG++IATFLGIG G+ +L+R+
Subjt:  TAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM

AT5G52540.1 Protein of unknown function (DUF819)1.6e-13863.15Show/hide
Query:  RPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNST----RTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLWSEKTKIGSTV
        R  S +  R       AKS+  R++      SP+S S +RN T       + +  RSV V + L  P+I+P+D WGTWTALFA GALGLWSEKTK+G+ +
Subjt:  RPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNST----RTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGALGLWSEKTKIGSTV

Query:  SAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWKIAAALMGSYI
        S ALVSTLVGLAASN GII  +A  +++VL FLLPL+VPLLLFRA LR V++STG LL  FL+GSV+T +GT +A+ +VPM+SLGPD+WKIAAALMG +I
Subjt:  SAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWKIAAALMGSYI

Query:  GGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPE---PATSTDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWITNIYGIQ
        GG+VNYVAIS ALGV+PSVLAAG+AADNVICA+YF  LFAL S I  E   P T+  D   N  S+   K+PVL  AT I  SLAICK    +T  +GI 
Subjt:  GGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPE---PATSTDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWITNIYGIQ

Query:  GANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLASNANIGGP
        G +LP ITAVVV LATVFP QF  LAP+G+ +ALILMQVFFTVVGASG+IW+VIN APSIFLFALVQ+  HLAVILG GKL +IEL+LLLLASNAN+GGP
Subjt:  GANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLASNANIGGP

Query:  TTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM
        TTA GMATAKGW  L+VP ILAGIFGIAIATF+GI FG+ +L+ M
Subjt:  TTACGMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCTCGATAAAGAACTCAGCCATGTTCGCGCCTCATTCTCCATTGCTGCAACTACCATCACTTCGCCCTCACTCGCTTCTCCCTCTTCGCCATGGCCGC
CACTCTGCTCCTGCTAAATCCTCATTACGGAGAGTAAACGCCAATGAAGAGCTTCTCTCGCCTTCGTCTTCCTCTGGCTACAGGAACTCGACTCGTACTCGGACT
CGGACTCGGACTCGGTCTGTGAAGGTTATGGCGCAGCTGAGGCATCCAATTATTGCGCCGGATGACCACTGGGGCACATGGACTGCTTTGTTTGCTATCGGCGCA
TTGGGGCTCTGGTCTGAGAAGACCAAGATTGGTAGCACAGTGAGTGCTGCTTTGGTTAGCACTTTGGTTGGCTTGGCTGCCAGTAACTCTGGCATAATCCCTTAT
GAAGCCTTACCTTATTCAATCGTCCTGCAATTTCTGCTTCCATTATCTGTTCCATTGCTTCTATTTCGAGCAGGCCTGCGTCACGTAATCCGATCAACTGGAACA
TTGCTTGGAGTTTTCCTGCTTGGATCAGTTTCAACTATGATAGGAACTGTTGTGGCATTTCTCATGGTGCCTATGAGATCACTTGGTCCTGACAATTGGAAAATA
GCTGCTGCTCTCATGGGTAGTTATATTGGTGGATCTGTTAATTATGTTGCAATTTCAGAGGCTCTTGGTGTTTCTCCATCAGTTTTAGCAGCAGGAGTGGCTGCT
GACAATGTCATCTGTGCTATATACTTTGTGGCCCTTTTTGCATTGGCTTCTAGTATATCTCCCGAACCTGCAACATCAACTGACGATGTTTCAATCAATACGGAC
TCTGATCATGGCACGAAGCTCCCTGTGTTGCACACTGCTACTGCCATTGTTACATCATTAGCAATATGCAAGTTTGTTACATGGATTACAAATATATATGGAATT
CAAGGTGCCAATCTTCCTGGGATAACTGCAGTAGTCGTCTTTTTAGCTACAGTTTTCCCAAAACAGTTCAGTTATTTAGCACCCGCTGGAGATACCATTGCCCTG
ATTCTTATGCAGGTATTTTTTACCGTTGTTGGTGCTAGTGGGAGTATATGGAATGTCATCAACAATGCACCAAGTATTTTCCTGTTTGCTCTAGTTCAAGTTACA
GTCCATCTTGCTGTAATTCTTGGGTTTGGAAAGCTATTCAGCATTGAACTAAAGCTCTTGCTTCTTGCTTCAAATGCCAATATTGGAGGCCCAACAACAGCATGT
GGAATGGCAACTGCCAAGGGTTGGAGACCTTTGGTGGTTCCTGCTATTCTTGCTGGCATTTTCGGTATCGCAATTGCTACTTTTCTGGGCATCGGTTTTGGATTG
ATGATCCTCAGGCGCATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCTCGATAAAGAACTCAGCCATGTTCGCGCCTCATTCTCCATTGCTGCAACTACCATCACTTCGCCCTCACTCGCTTCTCCCTCTTCGCCATGGCCGC
CACTCTGCTCCTGCTAAATCCTCATTACGGAGAGTAAACGCCAATGAAGAGCTTCTCTCGCCTTCGTCTTCCTCTGGCTACAGGAACTCGACTCGTACTCGGACT
CGGACTCGGACTCGGTCTGTGAAGGTTATGGCGCAGCTGAGGCATCCAATTATTGCGCCGGATGACCACTGGGGCACATGGACTGCTTTGTTTGCTATCGGCGCA
TTGGGGCTCTGGTCTGAGAAGACCAAGATTGGTAGCACAGTGAGTGCTGCTTTGGTTAGCACTTTGGTTGGCTTGGCTGCCAGTAACTCTGGCATAATCCCTTAT
GAAGCCTTACCTTATTCAATCGTCCTGCAATTTCTGCTTCCATTATCTGTTCCATTGCTTCTATTTCGAGCAGGCCTGCGTCACGTAATCCGATCAACTGGAACA
TTGCTTGGAGTTTTCCTGCTTGGATCAGTTTCAACTATGATAGGAACTGTTGTGGCATTTCTCATGGTGCCTATGAGATCACTTGGTCCTGACAATTGGAAAATA
GCTGCTGCTCTCATGGGTAGTTATATTGGTGGATCTGTTAATTATGTTGCAATTTCAGAGGCTCTTGGTGTTTCTCCATCAGTTTTAGCAGCAGGAGTGGCTGCT
GACAATGTCATCTGTGCTATATACTTTGTGGCCCTTTTTGCATTGGCTTCTAGTATATCTCCCGAACCTGCAACATCAACTGACGATGTTTCAATCAATACGGAC
TCTGATCATGGCACGAAGCTCCCTGTGTTGCACACTGCTACTGCCATTGTTACATCATTAGCAATATGCAAGTTTGTTACATGGATTACAAATATATATGGAATT
CAAGGTGCCAATCTTCCTGGGATAACTGCAGTAGTCGTCTTTTTAGCTACAGTTTTCCCAAAACAGTTCAGTTATTTAGCACCCGCTGGAGATACCATTGCCCTG
ATTCTTATGCAGGTATTTTTTACCGTTGTTGGTGCTAGTGGGAGTATATGGAATGTCATCAACAATGCACCAAGTATTTTCCTGTTTGCTCTAGTTCAAGTTACA
GTCCATCTTGCTGTAATTCTTGGGTTTGGAAAGCTATTCAGCATTGAACTAAAGCTCTTGCTTCTTGCTTCAAATGCCAATATTGGAGGCCCAACAACAGCATGT
GGAATGGCAACTGCCAAGGGTTGGAGACCTTTGGTGGTTCCTGCTATTCTTGCTGGCATTTTCGGTATCGCAATTGCTACTTTTCTGGGCATCGGTTTTGGATTG
ATGATCCTCAGGCGCATGTAG
Protein sequenceShow/hide protein sequence
MASSIKNSAMFAPHSPLLQLPSLRPHSLLPLRHGRHSAPAKSSLRRVNANEELLSPSSSSGYRNSTRTRTRTRTRSVKVMAQLRHPIIAPDDHWGTWTALFAIGA
LGLWSEKTKIGSTVSAALVSTLVGLAASNSGIIPYEALPYSIVLQFLLPLSVPLLLFRAGLRHVIRSTGTLLGVFLLGSVSTMIGTVVAFLMVPMRSLGPDNWKI
AAALMGSYIGGSVNYVAISEALGVSPSVLAAGVAADNVICAIYFVALFALASSISPEPATSTDDVSINTDSDHGTKLPVLHTATAIVTSLAICKFVTWITNIYGI
QGANLPGITAVVVFLATVFPKQFSYLAPAGDTIALILMQVFFTVVGASGSIWNVINNAPSIFLFALVQVTVHLAVILGFGKLFSIELKLLLLASNANIGGPTTAC
GMATAKGWRPLVVPAILAGIFGIAIATFLGIGFGLMILRRM