; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC00g0239 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC00g0239
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF707)
Genome locationscaffold99:731711..736749
RNA-Seq ExpressionMC00g0239
SyntenyMC00g0239
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464732.2 PREDICTED: uncharacterized protein LOC103502548, partial [Cucumis melo]3.01e-24279.8Show/hide
Query:  NCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNL
        N ILS+SK+RL + TFF A+ LGAGVYFIASEFITKE  RWEVFY+ARNVKSSTCK++CRPPGSE+LPEGI+SKTSNFE QPLWG S++QNK P+A KNL
Subjt:  NCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNL

Query:  LAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKE
        LA+AVGIKQ+HVVS+I+EKFP DDFDV+LFHYDG+VDEW++ +W    +H+SALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVE FDPKRY+SILKE
Subjt:  LAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKE

Query:  EGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLK
        EGLEISQPALDPVKSKVHQ LTARKT SKVHRRFYNFKG GRC ANST PPC GWVEMMAPVFSRA WRCTWYMIQNDLIHAWGLDRQLGYCAQGDRT K
Subjt:  EGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLK

Query:  VGVVDAEYIVHLGLPTLGTSHGNVF----KLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDR
        VGVVDAEYIVHLGLPTLG SH N      KL  +   TK  FP  F QL+SNA  LS+KDSSN D SEP+VDNRV+VR+QSS+EMQIFK+RW DAAK DR
Subjt:  VGVVDAEYIVHLGLPTLGTSHGNVF----KLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDR

Query:  CWIDPY
        CWIDPY
Subjt:  CWIDPY

XP_022157819.1 uncharacterized protein LOC111024438 isoform X1 [Momordica charantia]9.78e-28894.8Show/hide
Query:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
         NCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
Subjt:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN

Query:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
        LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
Subjt:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK

Query:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
        EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
Subjt:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL

Query:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI
        KVGVVDAEYIVHLGLPTLGTSHGNV                    LNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI
Subjt:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI

Query:  DPYR
        DPYR
Subjt:  DPYR

XP_022157820.1 uncharacterized protein LOC111024438 isoform X2 [Momordica charantia]1.67e-26194.55Show/hide
Query:  EFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIV
        EFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIV
Subjt:  EFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIV

Query:  DEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYN
        DEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYN
Subjt:  DEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYN

Query:  FKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKP
        FKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNV            
Subjt:  FKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKP

Query:  TFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWIDPYR
                LNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWIDPYR
Subjt:  TFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWIDPYR

XP_022943062.1 uncharacterized protein LOC111447910 isoform X1 [Cucurbita moschata]2.49e-24280.25Show/hide
Query:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
        QN IL ESK+RL LCTFF+A+ILGAGVYFIASEFITKE  RWEVFY+ARNV SS CK++CRPPGSE LPEGIVSKTSNFE QPLWG ST+ NK P+  KN
Subjt:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN

Query:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
        LL++AVGI Q+H+VSKIVEKFPRDDFDV+LFHYDG+VDEWKD +WS   +H+S+LNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
Subjt:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK

Query:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
        EEGLEISQPALDPVKSKVHQALTARKT SKVHRRFYN KG  RCDANST PPC GWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRT 
Subjt:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL

Query:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQK-DSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCW
        KVGVVDAEYIVHLGLPTLG S+GNV                    LN++APA SQK + SNF+  E KVDNRV+VRIQSS+EMQIFKERWT+AAK+DRCW
Subjt:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQK-DSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCW

Query:  IDPYR
        IDPYR
Subjt:  IDPYR

XP_022975263.1 uncharacterized protein LOC111474390 isoform X1 [Cucurbita maxima]2.49e-24280.49Show/hide
Query:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
        QN IL ESK+RL LCTFF+A+ILGAGVYFIASEFITKE  RWEVFY+ARNV SS CK++CR PGSE LPEGIVSKTSNFELQPLWG ST+ NK P+  KN
Subjt:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN

Query:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
        LL++AVGIKQ+HVVSKIVEKFPRDDFDVMLFHYDG+VDEW+D +WS   +H+S+LNQTKWWFAKRFLHPDIV+EYNYIFLWDEDLGVENFDPKRYISILK
Subjt:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK

Query:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
        EEGLEISQPALDPVKSKVHQALTARKT SKVHRRFYN KG  RCDANST PPC GWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRT 
Subjt:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL

Query:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPA-LSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCW
        KVGVVDAEYIVHLGLPTLG S+ NV                    LN++APA L +K+ SNF+ SEPKVDNRV+VRIQSS+EMQIFKERWT+AAK+DRCW
Subjt:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPA-LSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCW

Query:  IDPYR
        IDPYR
Subjt:  IDPYR

TrEMBL top hitse value%identityAlignment
A0A1S3CM49 uncharacterized protein LOC1035025481.46e-24279.8Show/hide
Query:  NCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNL
        N ILS+SK+RL + TFF A+ LGAGVYFIASEFITKE  RWEVFY+ARNVKSSTCK++CRPPGSE+LPEGI+SKTSNFE QPLWG S++QNK P+A KNL
Subjt:  NCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNL

Query:  LAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKE
        LA+AVGIKQ+HVVS+I+EKFP DDFDV+LFHYDG+VDEW++ +W    +H+SALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVE FDPKRY+SILKE
Subjt:  LAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKE

Query:  EGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLK
        EGLEISQPALDPVKSKVHQ LTARKT SKVHRRFYNFKG GRC ANST PPC GWVEMMAPVFSRA WRCTWYMIQNDLIHAWGLDRQLGYCAQGDRT K
Subjt:  EGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLK

Query:  VGVVDAEYIVHLGLPTLGTSHGNVF----KLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDR
        VGVVDAEYIVHLGLPTLG SH N      KL  +   TK  FP  F QL+SNA  LS+KDSSN D SEP+VDNRV+VR+QSS+EMQIFK+RW DAAK DR
Subjt:  VGVVDAEYIVHLGLPTLGTSHGNVF----KLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDR

Query:  CWIDPY
        CWIDPY
Subjt:  CWIDPY

A0A6J1DU47 uncharacterized protein LOC111024438 isoform X28.08e-26294.55Show/hide
Query:  EFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIV
        EFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIV
Subjt:  EFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIV

Query:  DEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYN
        DEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYN
Subjt:  DEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYN

Query:  FKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKP
        FKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNV            
Subjt:  FKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKP

Query:  TFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWIDPYR
                LNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWIDPYR
Subjt:  TFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWIDPYR

A0A6J1DXM8 uncharacterized protein LOC111024438 isoform X14.73e-28894.8Show/hide
Query:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
         NCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
Subjt:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN

Query:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
        LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
Subjt:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK

Query:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
        EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
Subjt:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL

Query:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI
        KVGVVDAEYIVHLGLPTLGTSHGNV                    LNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI
Subjt:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI

Query:  DPYR
        DPYR
Subjt:  DPYR

A0A6J1FRZ3 uncharacterized protein LOC111447910 isoform X11.20e-24280.25Show/hide
Query:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
        QN IL ESK+RL LCTFF+A+ILGAGVYFIASEFITKE  RWEVFY+ARNV SS CK++CRPPGSE LPEGIVSKTSNFE QPLWG ST+ NK P+  KN
Subjt:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN

Query:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
        LL++AVGI Q+H+VSKIVEKFPRDDFDV+LFHYDG+VDEWKD +WS   +H+S+LNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
Subjt:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK

Query:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
        EEGLEISQPALDPVKSKVHQALTARKT SKVHRRFYN KG  RCDANST PPC GWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRT 
Subjt:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL

Query:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQK-DSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCW
        KVGVVDAEYIVHLGLPTLG S+GNV                    LN++APA SQK + SNF+  E KVDNRV+VRIQSS+EMQIFKERWT+AAK+DRCW
Subjt:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQK-DSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCW

Query:  IDPYR
        IDPYR
Subjt:  IDPYR

A0A6J1IIQ2 uncharacterized protein LOC111474390 isoform X11.20e-24280.49Show/hide
Query:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
        QN IL ESK+RL LCTFF+A+ILGAGVYFIASEFITKE  RWEVFY+ARNV SS CK++CR PGSE LPEGIVSKTSNFELQPLWG ST+ NK P+  KN
Subjt:  QNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN

Query:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
        LL++AVGIKQ+HVVSKIVEKFPRDDFDVMLFHYDG+VDEW+D +WS   +H+S+LNQTKWWFAKRFLHPDIV+EYNYIFLWDEDLGVENFDPKRYISILK
Subjt:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK

Query:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
        EEGLEISQPALDPVKSKVHQALTARKT SKVHRRFYN KG  RCDANST PPC GWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRT 
Subjt:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL

Query:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPA-LSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCW
        KVGVVDAEYIVHLGLPTLG S+ NV                    LN++APA L +K+ SNF+ SEPKVDNRV+VRIQSS+EMQIFKERWT+AAK+DRCW
Subjt:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPA-LSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCW

Query:  IDPYR
        IDPYR
Subjt:  IDPYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)6.3e-9750.59Show/hide
Query:  EALPEGIVSKTSNFELQPLWGGSTMQNK-IPRALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAK
        + LP GI+   S+ EL+PLW   ++++K +    +NLLAI VG+KQK  V  +V+KF   +F ++LFHYDG +D+W DL WS   IHI A NQTKWWFAK
Subjt:  EALPEGIVSKTSNFELQPLWGGSTMQNK-IPRALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAK

Query:  RFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVF
        RFLHPD+V+ Y+YIFLWDEDLGVENF+P+RY+ I+K  GLEISQPALD   +++H  +T R    K HRR Y  +G  RC   S+ PPC G+VE MAPVF
Subjt:  RFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVF

Query:  SRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDAS
        S+AAW CTW +IQNDL+H WG+D +LGYCAQGDRT  VG+VD+EYI+H G+ TLG S            + K T                 +D       
Subjt:  SRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDAS

Query:  EPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWIDP
            D+R ++R QS+ E+Q FKERW+ A ++D  WIDP
Subjt:  EPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWIDP

AT1G61240.1 Protein of unknown function (DUF707)1.2e-9549.85Show/hide
Query:  EALPEGIVSKTSNFELQPLWGGSTMQNKIPRAL-KNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAK
        + LP GI+   S+ EL+PLW  S++++K      +NLLA+ VG+KQK  V  +V+KF   +F V+LFHYDG +D+W DL WS   IHI A NQTKWWFAK
Subjt:  EALPEGIVSKTSNFELQPLWGGSTMQNKIPRAL-KNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAK

Query:  RFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVF
        RFLHPDIV+ Y+Y+FLWDEDLGVENF+P++Y+ I+K  GLEISQPAL P  ++VH  +T R      HRR Y+ +G  +C   S  PPC G+VE MAPVF
Subjt:  RFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVF

Query:  SRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDAS
        SR+AW CTW +IQNDL+H WG+D +LGYCAQGDR+ KVG+VD+EYI H G+ TLG S                   YP  + ++ +    ++ S+ F   
Subjt:  SRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDAS

Query:  EPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWID
            D+R ++R QS+ E+Q FKERW  A  +D+ W++
Subjt:  EPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWID

AT4G12840.1 Protein of unknown function (DUF707)2.0e-11952.23Show/hide
Query:  LSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLR----WEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
        +++ +  L L   F  +   A ++ I + FIT ++      W         K   CK + RPPGSE LP GIV+ TS+ E++PLWG    ++K P+   +
Subjt:  LSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLR----WEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN

Query:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
        LLA+AVGI+QK  V+KIV+KFP  +F VMLFHYDG VDEWK+  WS   IHIS +NQTKWWFAKRFLHPDIV+ Y+YIFLWDEDLGV++FD +RY+SI+K
Subjt:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK

Query:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
        EE LEISQPALDP  S+VH  LT+R  +S+VHRR Y   G  RC+ NST PPC G+VEMMAPVFSRAAWRCTW+MIQNDL H WG+D QLGYCAQGDRT 
Subjt:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL

Query:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI
         +G+VD+EYI+H+GLPTLG   G+    +D     K   P+                    D S      R +VR Q+ +E++ FK RW +A K D CWI
Subjt:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI

Query:  DPYR
        D ++
Subjt:  DPYR

AT4G12840.2 Protein of unknown function (DUF707)2.0e-11952.23Show/hide
Query:  LSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLR----WEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
        +++ +  L L   F  +   A ++ I + FIT ++      W         K   CK + RPPGSE LP GIV+ TS+ E++PLWG    ++K P+   +
Subjt:  LSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLR----WEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN

Query:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK
        LLA+AVGI+QK  V+KIV+KFP  +F VMLFHYDG VDEWK+  WS   IHIS +NQTKWWFAKRFLHPDIV+ Y+YIFLWDEDLGV++FD +RY+SI+K
Subjt:  LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILK

Query:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL
        EE LEISQPALDP  S+VH  LT+R  +S+VHRR Y   G  RC+ NST PPC G+VEMMAPVFSRAAWRCTW+MIQNDL H WG+D QLGYCAQGDRT 
Subjt:  EEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTL

Query:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI
         +G+VD+EYI+H+GLPTLG   G+    +D     K   P+                    D S      R +VR Q+ +E++ FK RW +A K D CWI
Subjt:  KVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWI

Query:  DPYR
        D ++
Subjt:  DPYR

AT4G18530.1 Protein of unknown function (DUF707)2.4e-13355.75Show/hide
Query:  SKSRLHLCTFFIAIILGAGVYFIASEFITKEF----LRWEVFYTARN--------VKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIP
        S +R  LC+  I   L  G YFI + ++ K+F    L+WE+     N          +STCK+  +P G+EALP+GI+ KTSN E Q LW     + + P
Subjt:  SKSRLHLCTFFIAIILGAGVYFIASEFITKEF----LRWEVFYTARN--------VKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIP

Query:  RALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
            +LLA+AVGIKQK +V+K+++KFP  DF VMLFHYDG+VD+WK   W+ H IH+S +NQTKWWFAKRFLHPDIVAEY YIFLWDEDLGV +F+P+RY
Subjt:  RALKNLLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY

Query:  ISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
        +SI+KEEGLEISQPALD  KS+VH  +TAR+ +SKVHRR Y +KG GRCD +ST PPC GWVEMMAPVFSRAAWRC+WYMIQNDLIHAWGLD QLGYCAQ
Subjt:  ISILKEEGLEISQPALDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ

Query:  GDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPK-VDNRVQVRIQSSIEMQIFKERWTDAAK
        GDR   VGVVDAEYI+H GLPTLG                            +++   ++ DS + ++ E + VDNR +VR++S +EM+ FKERW  A +
Subjt:  GDRTLKVGVVDAEYIVHLGLPTLGTSHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPK-VDNRVQVRIQSSIEMQIFKERWTDAAK

Query:  KDRCWIDPY
         D CW+DPY
Subjt:  KDRCWIDPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGGTCTTCGTTTTTTTATGTTCTGTGGTTACAGAATTGCATACTATCGGAGTCTAAGAGTAGATTGCATCTCTGTACTTTCTTCATCGCCATAATCCTCGGTGCAGGAGT
TTATTTCATTGCAAGTGAATTTATTACGAAGGAATTTTTGAGATGGGAGGTATTTTATACAGCACGTAATGTTAAATCCAGCACATGCAAGGATCGATGCAGGCCTCCTG
GGAGTGAGGCTTTGCCAGAAGGAATCGTTAGTAAAACATCTAACTTCGAGTTGCAGCCTCTATGGGGCGGCTCGACCATGCAAAATAAAATTCCGAGGGCTTTGAAGAAC
TTGTTAGCTATTGCTGTTGGAATCAAACAAAAACATGTAGTGTCAAAAATTGTTGAGAAGTTCCCTCGAGATGATTTCGACGTGATGCTTTTTCATTATGATGGCATTGT
GGATGAATGGAAGGATTTAGCTTGGAGTCCTCATCCAATTCACATTTCGGCATTGAACCAAACAAAATGGTGGTTTGCCAAGCGTTTCTTGCACCCGGATATAGTTGCTG
AATATAATTATATATTTCTTTGGGATGAGGACCTGGGCGTCGAGAATTTTGACCCAAAACGATATATATCAATCCTCAAGGAGGAGGGGCTTGAGATATCACAACCAGCT
CTCGATCCGGTTAAGTCCAAGGTTCACCAGGCGCTTACTGCACGGAAAACCAGATCGAAAGTTCACAGAAGGTTTTACAACTTCAAAGGCATGGGACGGTGCGATGCTAA
TAGCACGGCTCCTCCATGTGCAGGATGGGTGGAAATGATGGCTCCTGTGTTCTCAAGGGCAGCATGGAGATGCACATGGTATATGATTCAGAATGACTTGATCCATGCTT
GGGGCTTAGATAGGCAGCTTGGCTATTGTGCACAAGGCGACAGAACACTAAAAGTCGGTGTCGTCGATGCAGAATACATAGTTCATTTAGGCCTACCTACGCTCGGCACT
TCTCACGGCAATGTGTTCAAATTATCGGATATATGTTCCGACACTAAACCGACCTTCCCATACCCATTCATGCAGCTGAATTCCAATGCTCCAGCTCTTTCCCAGAAGGA
CTCGTCAAACTTCGATGCATCGGAACCCAAAGTTGATAATAGAGTTCAAGTGAGGATACAGTCTTCCATAGAAATGCAGATCTTCAAGGAACGATGGACCGATGCTGCAA
AGAAGGATAGATGTTGGATCGACCCGTATCGATAG
mRNA sequenceShow/hide mRNA sequence
GGGTCTTCGTTTTTTTATGTTCTGTGGTTACAGAATTGCATACTATCGGAGTCTAAGAGTAGATTGCATCTCTGTACTTTCTTCATCGCCATAATCCTCGGTGCAGGAGT
TTATTTCATTGCAAGTGAATTTATTACGAAGGAATTTTTGAGATGGGAGGTATTTTATACAGCACGTAATGTTAAATCCAGCACATGCAAGGATCGATGCAGGCCTCCTG
GGAGTGAGGCTTTGCCAGAAGGAATCGTTAGTAAAACATCTAACTTCGAGTTGCAGCCTCTATGGGGCGGCTCGACCATGCAAAATAAAATTCCGAGGGCTTTGAAGAAC
TTGTTAGCTATTGCTGTTGGAATCAAACAAAAACATGTAGTGTCAAAAATTGTTGAGAAGTTCCCTCGAGATGATTTCGACGTGATGCTTTTTCATTATGATGGCATTGT
GGATGAATGGAAGGATTTAGCTTGGAGTCCTCATCCAATTCACATTTCGGCATTGAACCAAACAAAATGGTGGTTTGCCAAGCGTTTCTTGCACCCGGATATAGTTGCTG
AATATAATTATATATTTCTTTGGGATGAGGACCTGGGCGTCGAGAATTTTGACCCAAAACGATATATATCAATCCTCAAGGAGGAGGGGCTTGAGATATCACAACCAGCT
CTCGATCCGGTTAAGTCCAAGGTTCACCAGGCGCTTACTGCACGGAAAACCAGATCGAAAGTTCACAGAAGGTTTTACAACTTCAAAGGCATGGGACGGTGCGATGCTAA
TAGCACGGCTCCTCCATGTGCAGGATGGGTGGAAATGATGGCTCCTGTGTTCTCAAGGGCAGCATGGAGATGCACATGGTATATGATTCAGAATGACTTGATCCATGCTT
GGGGCTTAGATAGGCAGCTTGGCTATTGTGCACAAGGCGACAGAACACTAAAAGTCGGTGTCGTCGATGCAGAATACATAGTTCATTTAGGCCTACCTACGCTCGGCACT
TCTCACGGCAATGTGTTCAAATTATCGGATATATGTTCCGACACTAAACCGACCTTCCCATACCCATTCATGCAGCTGAATTCCAATGCTCCAGCTCTTTCCCAGAAGGA
CTCGTCAAACTTCGATGCATCGGAACCCAAAGTTGATAATAGAGTTCAAGTGAGGATACAGTCTTCCATAGAAATGCAGATCTTCAAGGAACGATGGACCGATGCTGCAA
AGAAGGATAGATGTTGGATCGACCCGTATCGATAGTTCCCGATTCGAGCAATGCGGTTATTTTGAAGGGAAGGTGTTCCATTCTCCACACAGATTTTGAAGGGAAGGTGT
TCCATTCTCCACACAGATTCTGGTTGTCTCAGAACAAATTATACCCAACAAAATTCAAATCACTGAGGAGCCTTTTCATTGGAAACAATCTCACTTTATTTTTTCCTTTG
TATACAAAATAAAAGTTTTTTTTTTCTCTTTCTTTATGGGGTTTGTAAATGCATTATAGTTTTCCTGGAAAATTTAGGACTTTCCTTTAGAAAATATGAGCCGTTGGAAA
GGTTTCATTGTAATATGAATAATGTAAAGTTGTTTTGTGAAATTGTTTTAGCTTCTGCCATTTGTTACCAGAACAGAACTCAGATTGGTATAGAAAATTGACAGAGTAAG
TATACTTCTTAAAAATAGGGTTAGATTACG
Protein sequenceShow/hide protein sequence
GSSFFYVLWLQNCILSESKSRLHLCTFFIAIILGAGVYFIASEFITKEFLRWEVFYTARNVKSSTCKDRCRPPGSEALPEGIVSKTSNFELQPLWGGSTMQNKIPRALKN
LLAIAVGIKQKHVVSKIVEKFPRDDFDVMLFHYDGIVDEWKDLAWSPHPIHISALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKEEGLEISQPA
LDPVKSKVHQALTARKTRSKVHRRFYNFKGMGRCDANSTAPPCAGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTLKVGVVDAEYIVHLGLPTLGT
SHGNVFKLSDICSDTKPTFPYPFMQLNSNAPALSQKDSSNFDASEPKVDNRVQVRIQSSIEMQIFKERWTDAAKKDRCWIDPYR