; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g00870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g00870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF3537)
Genome locationchr8:516897..519130
RNA-Seq ExpressionMoc08g00870
SyntenyMoc08g00870
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598276.1 hypothetical protein SDJN03_08054, partial [Cucurbita argyrosperma subsp. sororia]7.5e-20786.55Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD
        MA+ RD VDI+DP A +PLL+S Q+Q+S+ TGRE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCA C++YQIKD
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILYAERVSWG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSAM+TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSMT  YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

XP_022131489.1 uncharacterized protein LOC111004677 isoform X1 [Momordica charantia]8.3e-214100Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
        MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

XP_022131491.1 uncharacterized protein LOC111004677 isoform X2 [Momordica charantia]2.0e-239100Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
        MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
Subjt:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

XP_022996746.1 uncharacterized protein LOC111491891 [Cucurbita maxima]1.8e-20585.87Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD
        MA++RD VDI+DP A +PLL+S Q+Q+S+ TGRE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCA C++YQIKD
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILY ERV+WG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSAM+TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSM   YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

XP_023547131.1 uncharacterized protein LOC111806032 [Cucurbita pepo subsp. pepo]2.8e-20686.32Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD
        MA+ RD VDI+DP A +PLL+S Q+Q+S+ TGRE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCA C++YQIKD
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILYAERVSWG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSA++TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSMT  YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

TrEMBL top hitse value%identityAlignment
A0A5A7V5N0 Uncharacterized protein4.7e-19981.8Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
        MAE RDQVD+EDP A IPLLES  +Q+S+PT  EDDEEAHLDSA +L D LL  LGFHQSSV SC LSW+ FVLVG+VLPVVVLQL+DCA  EKYQIKDF
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELD+VAS ACLAAVSLLCLSHNLRKYGI RFL+VDRQ +SLARFRK+YV+KI GSIRLL+FWAL CF+LK  REVIRILYAER+SWG+S+A +LAM ISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TY++LIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVL+LIE+HIFLRYHLSKISHRFRIFLLL FLVV+A+QFMTLFQTT Y+  +TL+NGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRA+G ASVASRWHALVTCGPG+VSQPR+ NGNGNSD+P  RL SMT  YSESDLESLDI+TMPTTTQLASYM+SYHKR+AF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        VMYLQMNPGGITIFGWTV+RAL+NTIFF+ELTLVTFVLGKT+VF+
Subjt:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

A0A6J1BPM9 uncharacterized protein LOC111004677 isoform X29.5e-240100Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
        MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
Subjt:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

A0A6J1BQD8 uncharacterized protein LOC111004677 isoform X14.0e-214100Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
        MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

A0A6J1HD22 uncharacterized protein LOC1114628591.2e-20586.1Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD
        MA+ RD VDI+DP A +PLL+S Q+Q+S+ T RE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCA C+++QIKD
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILYAERVSWG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSAM+TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSMT  YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

A0A6J1K5M7 uncharacterized protein LOC1114918918.9e-20685.87Show/hide
Query:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD
        MA++RD VDI+DP A +PLL+S Q+Q+S+ TGRE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCA C++YQIKD
Subjt:  MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILY ERV+WG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSAM+TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSM   YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)8.9e-4930.54Show/hide
Query:  AHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQT
        +H    L      L ++    SS  +  LSWT F++  +V+P +   L  CA C+ Y  + ++  +  S + +A VS LCL+  + KYG++RFL  D+  
Subjt:  AHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQT

Query:  SSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIVFHLMCNLQLIHFDDYAKL
              R++Y  ++  S+ ++ ++ + CF   +  ++        RI +        ++A ++ +  SW Y + +     ++F L+C+LQ++   D+AKL
Subjt:  SSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIVFHLMCNLQLIHFDDYAKL

Query:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRW
         Q +S+V  ++ EH+ +R HL  ISHR+R F+L   ++VT SQF +L  TT     + +   G+ A+ ++  V  +++ L  A+KI+H+AQ    +A++W
Subjt:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRW

Query:  HALVTCGPGD---------VSQP----RHQNGNGNSDAPSSRLNSMTSNY--SESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGW
        H   T    D         V  P    R+ N N N     +   S +  Y   E DL++ DI+ +      A    S+ KRQA V Y + N  GIT++G+
Subjt:  HALVTCGPGD---------VSQP----RHQNGNGNSDAPSSRLNSMTSNY--SESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGW

Query:  TVDRALMNTIFFIELTLVTFVLGKTIVFS
        T+DR  ++TIF +EL+LV ++LGKTI  S
Subjt:  TVDRALMNTIFFIELTLVTFVLGKTIVFS

AT1G67570.1 Protein of unknown function (DUF3537)2.1e-13058.17Show/hide
Query:  EDPPARIP-LLESIQSQNSEPT---------GREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF
        + P    P LL S Q     P+         G  +     LD  L+ L+  L  LGF+QSS  S  LSW  F+ +G+VLPV VL+L  C  CE+YQ K F
Subjt:  EDPPARIP-LLESIQSQNSEPT---------GREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        EL+IV SQA LA VSLLC+SHNLRK+GI++FL VD+ +  + R +  Y+++IL S+RLL  W+L CF LK VRE+IR+ Y       +S+AILL+M +SW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TY+S I ++ + +FHL+CNLQ+IHF+DYAKLL+ ESE+ L I EH+ LR++LSKISHRFRIFLLLQFLVVTASQF TLFQTT YS  IT INGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLD-ILTMPTTTQLASY--MTSYHKR
        +VQVVG+ILCLH ATKISHRAQ  ASVASRWHA+++C   D +Q R      + +A ++   S   + S+SD+ES+D  + MP T Q  SY  M+SYHKR
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLD-ILTMPTTTQLASY--MTSYHKR

Query:  QAFVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVF
        QAFV+YLQMNPGGITIFGWTVDR L+NTIFFIEL+LVTFVLGKT+VF
Subjt:  QAFVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVF

AT3G20300.1 Protein of unknown function (DUF3537)8.4e-4730.33Show/hide
Query:  AHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQT
        +H    L      L ++   QSS  +  LSW+ FV+  +V+P     +  C+ C+ +  + ++  +  S +  AA+S LCLS  + KYG++RFL  D+  
Subjt:  AHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQT

Query:  SSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIVFHLMCNLQLIHFDDYAKL
              R  Y  ++  S+++L ++   CF+  +  ++        +I +   V    ++A L+ +  SW Y + +     ++F L+C+LQ++   D+A++
Subjt:  SSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIVFHLMCNLQLIHFDDYAKL

Query:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRW
         Q +S+V  ++ EH+ +R HL  ISHR+R F+LL  ++VT SQF +L  TT   A + +   G+ A+ ++  V  +++ L  A+KI+H+AQ    +A++W
Subjt:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRW

Query:  HALVTC---GPGDVSQPR---HQNGNGNSDAPSSRLNSMTSNY--SESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDRALM
        H   T       D   PR     +G+G          S + +Y   E D ++ +++        A    S+ KRQA V Y + N  GIT+FG+T+DR+ +
Subjt:  HALVTC---GPGDVSQPR---HQNGNGNSDAPSSRLNSMTSNY--SESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDRALM

Query:  NTIFFIELTLVTFVLGKTIVFS
        +TIF IE++LV ++LGKTI  S
Subjt:  NTIFFIELTLVTFVLGKTIVFS

AT4G03820.1 Protein of unknown function (DUF3537)7.1e-3829.66Show/hide
Query:  ESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDFELDIVASQACLAAVSLLCLS
        ES   Q      R D  +   +S    L     FL F QS+ +   LSW+ F L+ +++P++   +  CA C+    + ++  +  S +  A +S + LS
Subjt:  ESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDFELDIVASQACLAAVSLLCLS

Query:  HNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIV
           +KYGI+RFL  D+      + R  Y  KI  S++LL  + L    L+A+  +        +I Y    +    +A  L ++ SW Y + + I   I+
Subjt:  HNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIV

Query:  FHLMCNLQLIHFDDYAKLLQTE-SEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLH
        +  +C+LQ++  D++A+   +E  +   ++ EH+ +R  L  +SHRFR F+LL    VTA+QFM L  T   S    +   G+ A+ +   V G+ +CL 
Subjt:  FHLMCNLQLIHFDDYAKLLQTE-SEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLH

Query:  GATKISHRAQGTASVASRWHALVTCGPGDV----SQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNP
         AT+++H+AQ   S+A++W+   +    DV      P+      +S   S R N + S+  + + E  D         + +   S  KRQA V YL+ N 
Subjt:  GATKISHRAQGTASVASRWHALVTCGPGDV----SQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNP

Query:  GGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIV
         GIT++G+ VD+  +  IF IEL L+ ++L KTI+
Subjt:  GGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIV

AT4G22270.1 Protein of unknown function (DUF3537)1.7e-4432.35Show/hide
Query:  LGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILG
        L F QS+  +  LSW+ F L+ +++P++   L  C+ C+ +  + +++ +  S +  A +S + LS   RK+G++RFL +D+      + R +Y  +I  
Subjt:  LGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILG

Query:  SIRLLVFWALACFILKAVREVIRILYAERVSWGIS------------IAILLAMTI---SWTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTE-SEV
        S++ L+ + L    L+A           R+ W IS            ++ ++A T+   SW Y + + I   I++ + C+LQ +  DD+A+   +E ++V
Subjt:  SIRLLVFWALACFILKAVREVIRILYAERVSWGIS------------IAILLAMTI---SWTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTE-SEV

Query:  LLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCG
           + EH  +R +L  +SHRFR F+LL  ++VTA+QFM L  TT  S  + +   G+ A+ ++  V GV +CL  ATKI+H+AQ   S+A++W+   T  
Subjt:  LLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCG

Query:  PGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVL
          D        G+      S R N++ ++  E   E  D L       + +   SY KRQA V YL+ N  GIT++G+ VDR+ +NTIF IEL L+ ++L
Subjt:  PGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVL

Query:  GKTIV
         KTIV
Subjt:  GKTIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGCGCCGTGATCAAGTGGACATTGAGGACCCTCCCGCCCGTATTCCACTTCTCGAGTCAATTCAGAGTCAGAACTCAGAACCGACAGGGCGAGAAGAC
GACGAAGAAGCCCATTTGGACAGTGCCCTCCAACTGCTCGACGTATTGCTTGCTTTTCTGGGCTTCCACCAATCCTCTGTGCTGAGCTGTGCCCTGTCTTGGACT
GGTTTTGTTCTTGTCGGTATTGTGCTGCCGGTTGTGGTGCTCCAGCTCACCGATTGCGCCACCTGCGAGAAGTACCAGATTAAGGATTTTGAGCTCGACATAGTT
GCCTCGCAAGCTTGTCTTGCAGCTGTGTCTCTGCTCTGTCTCTCTCACAACCTCAGAAAATATGGTATAAAGAGGTTCCTCACTGTTGATAGGCAGACGAGTTCT
TTGGCTCGGTTCCGCAAAGACTATGTCGAGAAGATACTGGGTTCGATACGCTTACTTGTCTTCTGGGCGCTAGCATGTTTCATTCTGAAGGCTGTACGAGAGGTG
ATTCGAATATTATATGCGGAACGTGTGTCGTGGGGGATATCAATTGCTATCTTACTGGCTATGACCATATCCTGGACTTATGTGAGCTTGATCTCTATTTCAACC
GCCATTGTGTTCCATTTGATGTGCAATTTGCAACTCATCCACTTCGATGACTATGCAAAGCTACTGCAAACAGAGTCTGAAGTTTTGTTATTAATAGAGGAACAT
ATCTTCCTACGCTATCATTTGTCCAAGATAAGCCACAGATTCCGAATCTTTCTTCTTCTACAGTTCTTGGTTGTTACTGCAAGCCAGTTTATGACCCTGTTCCAG
ACAACGGGGTATAGTGCTATGATCACCCTCATTAACGGTGGAGATTTTGCAGTCTCGGCAATTGTTCAAGTGGTTGGAGTTATTCTTTGCTTGCACGGAGCTACA
AAAATTTCCCACAGAGCCCAGGGAACCGCATCAGTAGCTAGTAGATGGCATGCTTTAGTCACTTGTGGCCCGGGCGATGTATCTCAACCTCGACATCAAAATGGT
AACGGGAACTCAGATGCTCCCAGTAGTAGACTGAACTCAATGACTAGCAATTACTCTGAAAGTGATTTGGAGTCTTTGGATATTCTTACAATGCCTACAACTACG
CAGCTGGCTTCTTATATGACCTCCTATCATAAAAGACAAGCATTTGTTATGTATTTGCAGATGAATCCTGGAGGAATTACCATTTTTGGGTGGACAGTTGATAGA
GCTCTGATGAACACCATCTTCTTTATTGAACTCACACTGGTCACCTTTGTGCTAGGGAAAACGATAGTTTTCTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGCGCCGTGATCAAGTGGACATTGAGGACCCTCCCGCCCGTATTCCACTTCTCGAGTCAATTCAGAGTCAGAACTCAGAACCGACAGGGCGAGAAGAC
GACGAAGAAGCCCATTTGGACAGTGCCCTCCAACTGCTCGACGTATTGCTTGCTTTTCTGGGCTTCCACCAATCCTCTGTGCTGAGCTGTGCCCTGTCTTGGACT
GGTTTTGTTCTTGTCGGTATTGTGCTGCCGGTTGTGGTGCTCCAGCTCACCGATTGCGCCACCTGCGAGAAGTACCAGATTAAGGATTTTGAGCTCGACATAGTT
GCCTCGCAAGCTTGTCTTGCAGCTGTGTCTCTGCTCTGTCTCTCTCACAACCTCAGAAAATATGGTATAAAGAGGTTCCTCACTGTTGATAGGCAGACGAGTTCT
TTGGCTCGGTTCCGCAAAGACTATGTCGAGAAGATACTGGGTTCGATACGCTTACTTGTCTTCTGGGCGCTAGCATGTTTCATTCTGAAGGCTGTACGAGAGGTG
ATTCGAATATTATATGCGGAACGTGTGTCGTGGGGGATATCAATTGCTATCTTACTGGCTATGACCATATCCTGGACTTATGTGAGCTTGATCTCTATTTCAACC
GCCATTGTGTTCCATTTGATGTGCAATTTGCAACTCATCCACTTCGATGACTATGCAAAGCTACTGCAAACAGAGTCTGAAGTTTTGTTATTAATAGAGGAACAT
ATCTTCCTACGCTATCATTTGTCCAAGATAAGCCACAGATTCCGAATCTTTCTTCTTCTACAGTTCTTGGTTGTTACTGCAAGCCAGTTTATGACCCTGTTCCAG
ACAACGGGGTATAGTGCTATGATCACCCTCATTAACGGTGGAGATTTTGCAGTCTCGGCAATTGTTCAAGTGGTTGGAGTTATTCTTTGCTTGCACGGAGCTACA
AAAATTTCCCACAGAGCCCAGGGAACCGCATCAGTAGCTAGTAGATGGCATGCTTTAGTCACTTGTGGCCCGGGCGATGTATCTCAACCTCGACATCAAAATGGT
AACGGGAACTCAGATGCTCCCAGTAGTAGACTGAACTCAATGACTAGCAATTACTCTGAAAGTGATTTGGAGTCTTTGGATATTCTTACAATGCCTACAACTACG
CAGCTGGCTTCTTATATGACCTCCTATCATAAAAGACAAGCATTTGTTATGTATTTGCAGATGAATCCTGGAGGAATTACCATTTTTGGGTGGACAGTTGATAGA
GCTCTGATGAACACCATCTTCTTTATTGAACTCACACTGGTCACCTTTGTGCTAGGGAAAACGATAGTTTTCTCCTGA
Protein sequenceShow/hide protein sequence
MAERRDQVDIEDPPARIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCATCEKYQIKDFELDIV
ASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISWTYVSLISIST
AIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGAT
KISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDR
ALMNTIFFIELTLVTFVLGKTIVFS