; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000964 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000964
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationscaffold36:501259..503490
RNA-Seq ExpressionMS000964
SyntenyMS000964
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598276.1 hypothetical protein SDJN03_08054, partial [Cucurbita argyrosperma subsp. sororia]2.3e-20887Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD
        MA+ RD VDI+DP AH+PLL+S Q+Q+S+ TGRE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCAAC++YQIKD
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILYAERVSWG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSAM+TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSMT  YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

XP_022131489.1 uncharacterized protein LOC111004677 isoform X1 [Momordica charantia]9.1e-21399.5Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF
        MAERRDQVDIEDPPA IPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCA CEKYQIKDF
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

XP_022131491.1 uncharacterized protein LOC111004677 isoform X2 [Momordica charantia]2.8e-23899.55Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF
        MAERRDQVDIEDPPA IPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCA CEKYQIKDF
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
Subjt:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

XP_022996746.1 uncharacterized protein LOC111491891 [Cucurbita maxima]5.7e-20786.32Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD
        MA++RD VDI+DP AH+PLL+S Q+Q+S+ TGRE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCAAC++YQIKD
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILY ERV+WG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSAM+TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSM   YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

XP_023547131.1 uncharacterized protein LOC111806032 [Cucurbita pepo subsp. pepo]8.8e-20886.77Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD
        MA+ RD VDI+DP AH+PLL+S Q+Q+S+ TGRE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCAAC++YQIKD
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILYAERVSWG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSA++TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSMT  YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

TrEMBL top hitse value%identityAlignment
A0A5A7V5N0 Uncharacterized protein1.5e-20082.25Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF
        MAE RDQVD+EDP AHIPLLES  +Q+S+PT  EDDEEAHLDSA +L D LL  LGFHQSSV SC LSW+ FVLVG+VLPVVVLQL+DCAA EKYQIKDF
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELD+VAS ACLAAVSLLCLSHNLRKYGI RFL+VDRQ +SLARFRK+YV+KI GSIRLL+FWAL CF+LK  REVIRILYAER+SWG+S+A +LAM ISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TY++LIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVL+LIE+HIFLRYHLSKISHRFRIFLLL FLVV+A+QFMTLFQTT Y+  +TL+NGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRA+G ASVASRWHALVTCGPG+VSQPR+ NGNGNSD+P  RL SMT  YSESDLESLDI+TMPTTTQLASYM+SYHKR+AF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        VMYLQMNPGGITIFGWTV+RAL+NTIFF+ELTLVTFVLGKT+VF+
Subjt:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

A0A6J1BPM9 uncharacterized protein LOC111004677 isoform X21.4e-23899.55Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF
        MAERRDQVDIEDPPA IPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCA CEKYQIKDF
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

Query:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
Subjt:  VMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

A0A6J1BQD8 uncharacterized protein LOC111004677 isoform X14.4e-21399.5Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF
        MAERRDQVDIEDPPA IPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCA CEKYQIKDF
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
        IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAF

A0A6J1HD22 uncharacterized protein LOC1114628593.6e-20786.55Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD
        MA+ RD VDI+DP AH+PLL+S Q+Q+S+ T RE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCAAC+++QIKD
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILYAERVSWG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSAM+TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSMT  YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

A0A6J1K5M7 uncharacterized protein LOC1114918912.8e-20786.32Show/hide
Query:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD
        MA++RD VDI+DP AH+PLL+S Q+Q+S+ TGRE+D EEAHLDSAL+L D LL FLGFHQSSVLSC LSW+GFVLVGIVLPVV+LQLTDCAAC++YQIKD
Subjt:  MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDD-EEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKD

Query:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS
        FELDIVASQACLAAVSLLCLSHNLRKYGIKRFL+VDRQ SSLA FRKDYV+KI GSIRLLVFWAL CFILKA REVIRILY ERV+WG+S+AILLAMTIS
Subjt:  FELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTIS

Query:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS
        WTY+SLIS+S AIVFHLMCNLQ+IHFD+YAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLL FLVVTASQFMTLFQTT YSAM+TLIN GDFAVS
Subjt:  WTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVS

Query:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA
        AIVQVVGVILCLHGATKISHRAQG ASVASRWHALVTCGPGDVSQ RH NG+GNS++P  RLNSM   YSESDLESLDI+TMPTTTQLASYM+SYHKR+A
Subjt:  AIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQA

Query:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS
        FVMYLQMNPGGITIFGWTV+RAL+NTIFFIELTLVTFVLGKT+VF+
Subjt:  FVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)1.2e-4830.54Show/hide
Query:  AHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQT
        +H    L      L ++    SS  +  LSWT F++  +V+P +   L  CA C+ Y  + ++  +  S + +A VS LCL+  + KYG++RFL  D+  
Subjt:  AHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQT

Query:  SSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIVFHLMCNLQLIHFDDYAKL
              R++Y  ++  S+ ++ ++ + CF   +  ++        RI +        ++A ++ +  SW Y + +     ++F L+C+LQ++   D+AKL
Subjt:  SSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIVFHLMCNLQLIHFDDYAKL

Query:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRW
         Q +S+V  ++ EH+ +R HL  ISHR+R F+L   ++VT SQF +L  TT     + +   G+ A+ ++  V  +++ L  A+KI+H+AQ    +A++W
Subjt:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRW

Query:  HALVTCGPGD---------VSQP----RHQNGNGNSDAPSSRLNSMTSNY--SESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGW
        H   T    D         V  P    R+ N N N     +   S +  Y   E DL++ DI+ +      A    S+ KRQA V Y + N  GIT++G+
Subjt:  HALVTCGPGD---------VSQP----RHQNGNGNSDAPSSRLNSMTSNY--SESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGW

Query:  TVDRALMNTIFFIELTLVTFVLGKTIVFS
        T+DR  ++TIF +EL+LV ++LGKTI  S
Subjt:  TVDRALMNTIFFIELTLVTFVLGKTIVFS

AT1G67570.1 Protein of unknown function (DUF3537)1.2e-13058.17Show/hide
Query:  EDPPAHIP-LLESIQSQNSEPT---------GREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF
        + P    P LL S Q     P+         G  +     LD  L+ L+  L  LGF+QSS  S  LSW  F+ +G+VLPV VL+L  C  CE+YQ K F
Subjt:  EDPPAHIP-LLESIQSQNSEPT---------GREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDF

Query:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW
        EL+IV SQA LA VSLLC+SHNLRK+GI++FL VD+ +  + R +  Y+++IL S+RLL  W+L CF LK VRE+IR+ Y       +S+AILL+M +SW
Subjt:  ELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISW

Query:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA
        TY+S I ++ + +FHL+CNLQ+IHF+DYAKLL+ ESE+ L I EH+ LR++LSKISHRFRIFLLLQFLVVTASQF TLFQTT YS  IT INGGDFAVSA
Subjt:  TYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSA

Query:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLD-ILTMPTTTQLASY--MTSYHKR
        +VQVVG+ILCLH ATKISHRAQ  ASVASRWHA+++C   D +Q R      + +A ++   S   + S+SD+ES+D  + MP T Q  SY  M+SYHKR
Subjt:  IVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLD-ILTMPTTTQLASY--MTSYHKR

Query:  QAFVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVF
        QAFV+YLQMNPGGITIFGWTVDR L+NTIFFIEL+LVTFVLGKT+VF
Subjt:  QAFVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGKTIVF

AT3G20300.1 Protein of unknown function (DUF3537)1.1e-4630.33Show/hide
Query:  AHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQT
        +H    L      L ++   QSS  +  LSW+ FV+  +V+P     +  C+ C+ +  + ++  +  S +  AA+S LCLS  + KYG++RFL  D+  
Subjt:  AHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQT

Query:  SSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIVFHLMCNLQLIHFDDYAKL
              R  Y  ++  S+++L ++   CF+  +  ++        +I +   V    ++A L+ +  SW Y + +     ++F L+C+LQ++   D+A++
Subjt:  SSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIVFHLMCNLQLIHFDDYAKL

Query:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRW
         Q +S+V  ++ EH+ +R HL  ISHR+R F+LL  ++VT SQF +L  TT   A + +   G+ A+ ++  V  +++ L  A+KI+H+AQ    +A++W
Subjt:  LQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRW

Query:  HALVTC---GPGDVSQPR---HQNGNGNSDAPSSRLNSMTSNY--SESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDRALM
        H   T       D   PR     +G+G          S + +Y   E D ++ +++        A    S+ KRQA V Y + N  GIT+FG+T+DR+ +
Subjt:  HALVTC---GPGDVSQPR---HQNGNGNSDAPSSRLNSMTSNY--SESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDRALM

Query:  NTIFFIELTLVTFVLGKTIVFS
        +TIF IE++LV ++LGKTI  S
Subjt:  NTIFFIELTLVTFVLGKTIVFS

AT4G03820.2 Protein of unknown function (DUF3537)9.3e-3829.72Show/hide
Query:  ESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDFELDIVASQACLAAVSLLCLS
        ES   Q      R D  +   +S    L     FL F QS+ +   LSW+ F L+ +++P++   +  CA C+    + ++  +  S +  A +S + LS
Subjt:  ESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDFELDIVASQACLAAVSLLCLS

Query:  HNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIV
           +KYGI+RFL  D+      + R  Y  KI  S++LL  + L    L+A+  +        +I Y    +    +A  L ++ SW Y + + I   I+
Subjt:  HNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREV-------IRILYAERVSWGISIAILLAMTISWTYVSLISISTAIV

Query:  FHLMCNLQLIHFDDYAKLLQTE-SEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLH
        +  +C+LQ++  D++A+   +E  +   ++ EH+ +R  L  +SHRFR F+LL    VTA+QFM L  T   S    +   G+ A+ +   V G+ +CL 
Subjt:  FHLMCNLQLIHFDDYAKLLQTE-SEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLH

Query:  GATKISHRAQGTASVASRWHALVTCGPGDV----SQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNP
         AT+++H+AQ   S+A++W+   +    DV      P+      +S   S R N + S+  + + E  D         + +   S  KRQA V YL+ N 
Subjt:  GATKISHRAQGTASVASRWHALVTCGPGDV----SQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNP

Query:  GGITIFGWTVDRALMNTIFFIELTLVTFVLGKTI
         GIT++G+ VD+  +  IF IEL L+ ++L KTI
Subjt:  GGITIFGWTVDRALMNTIFFIELTLVTFVLGKTI

AT4G22270.1 Protein of unknown function (DUF3537)2.3e-4432.35Show/hide
Query:  LGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILG
        L F QS+  +  LSW+ F L+ +++P++   L  C+ C+ +  + +++ +  S +  A +S + LS   RK+G++RFL +D+      + R +Y  +I  
Subjt:  LGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDFELDIVASQACLAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILG

Query:  SIRLLVFWALACFILKAVREVIRILYAERVSWGIS------------IAILLAMTI---SWTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTE-SEV
        S++ L+ + L    L+A           R+ W IS            ++ ++A T+   SW Y + + I   I++ + C+LQ +  DD+A+   +E ++V
Subjt:  SIRLLVFWALACFILKAVREVIRILYAERVSWGIS------------IAILLAMTI---SWTYVSLISISTAIVFHLMCNLQLIHFDDYAKLLQTE-SEV

Query:  LLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCG
           + EH  +R +L  +SHRFR F+LL  ++VTA+QFM L  TT  S  + +   G+ A+ ++  V GV +CL  ATKI+H+AQ   S+A++W+   T  
Subjt:  LLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASRWHALVTCG

Query:  PGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVL
          D        G+      S R N++ ++  E   E  D L       + +   SY KRQA V YL+ N  GIT++G+ VDR+ +NTIF IEL L+ ++L
Subjt:  PGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVL

Query:  GKTIV
         KTIV
Subjt:  GKTIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGCGCCGTGATCAAGTGGACATTGAGGACCCTCCCGCCCATATTCCACTTCTCGAGTCAATTCAGAGTCAGAACTCAGAACCGACAGGGCGAGAAGACGACGA
AGAAGCCCATTTGGACAGTGCCCTCCAACTGCTCGACGTATTGCTTGCTTTTCTGGGCTTCCACCAATCCTCTGTGCTGAGCTGTGCCCTGTCTTGGACTGGTTTTGTTC
TTGTCGGTATTGTGCTGCCGGTTGTGGTGCTCCAGCTCACCGATTGCGCCGCCTGCGAGAAGTACCAGATTAAGGATTTTGAGCTCGACATAGTTGCCTCACAAGCTTGT
CTTGCAGCTGTGTCTCTGCTCTGTCTCTCTCACAACCTCAGAAAATATGGTATAAAGAGGTTCCTCACTGTTGATAGGCAGACGAGTTCTTTGGCTCGGTTCCGCAAAGA
CTATGTCGAGAAGATACTGGGTTCGATACGCTTACTTGTCTTCTGGGCGCTAGCATGTTTCATTCTGAAGGCTGTACGAGAGGTGATTCGAATATTATATGCGGAACGTG
TGTCGTGGGGGATATCAATTGCTATCTTACTGGCTATGACCATATCCTGGACTTATGTGAGCTTGATCTCTATTTCAACCGCCATTGTGTTCCATTTGATGTGCAATTTG
CAACTCATCCACTTCGATGACTATGCAAAGCTACTGCAAACAGAGTCTGAAGTTTTGTTATTAATAGAGGAACATATCTTCCTACGCTATCATTTGTCCAAGATAAGCCA
CAGATTCCGAATCTTTCTTCTTCTACAGTTCTTGGTTGTTACTGCAAGCCAGTTTATGACTCTGTTCCAGACAACGGGGTATAGTGCTATGATCACCCTCATTAACGGTG
GAGATTTTGCAGTCTCGGCAATTGTTCAAGTGGTTGGAGTTATTCTTTGCTTGCACGGAGCTACAAAAATTTCCCACAGAGCCCAGGGAACCGCATCAGTAGCTAGTAGA
TGGCATGCTTTAGTCACTTGTGGCCCGGGCGATGTATCTCAACCTCGACATCAAAATGGTAACGGGAACTCAGATGCTCCCAGTAGTAGACTGAACTCAATGACTAGCAA
TTACTCTGAAAGTGATTTGGAGTCTTTGGATATTCTTACAATGCCTACAACTACGCAGCTGGCTTCTTATATGACCTCCTATCATAAAAGACAAGCATTTGTTATGTATT
TGCAGATGAATCCTGGAGGAATTACCATTTTTGGGTGGACAGTTGATAGAGCTCTGATGAACACCATCTTCTTTATTGAACTCACACTGGTCACCTTTGTGCTAGGGAAA
ACGATAGTTTTCTCC
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGCGCCGTGATCAAGTGGACATTGAGGACCCTCCCGCCCATATTCCACTTCTCGAGTCAATTCAGAGTCAGAACTCAGAACCGACAGGGCGAGAAGACGACGA
AGAAGCCCATTTGGACAGTGCCCTCCAACTGCTCGACGTATTGCTTGCTTTTCTGGGCTTCCACCAATCCTCTGTGCTGAGCTGTGCCCTGTCTTGGACTGGTTTTGTTC
TTGTCGGTATTGTGCTGCCGGTTGTGGTGCTCCAGCTCACCGATTGCGCCGCCTGCGAGAAGTACCAGATTAAGGATTTTGAGCTCGACATAGTTGCCTCACAAGCTTGT
CTTGCAGCTGTGTCTCTGCTCTGTCTCTCTCACAACCTCAGAAAATATGGTATAAAGAGGTTCCTCACTGTTGATAGGCAGACGAGTTCTTTGGCTCGGTTCCGCAAAGA
CTATGTCGAGAAGATACTGGGTTCGATACGCTTACTTGTCTTCTGGGCGCTAGCATGTTTCATTCTGAAGGCTGTACGAGAGGTGATTCGAATATTATATGCGGAACGTG
TGTCGTGGGGGATATCAATTGCTATCTTACTGGCTATGACCATATCCTGGACTTATGTGAGCTTGATCTCTATTTCAACCGCCATTGTGTTCCATTTGATGTGCAATTTG
CAACTCATCCACTTCGATGACTATGCAAAGCTACTGCAAACAGAGTCTGAAGTTTTGTTATTAATAGAGGAACATATCTTCCTACGCTATCATTTGTCCAAGATAAGCCA
CAGATTCCGAATCTTTCTTCTTCTACAGTTCTTGGTTGTTACTGCAAGCCAGTTTATGACTCTGTTCCAGACAACGGGGTATAGTGCTATGATCACCCTCATTAACGGTG
GAGATTTTGCAGTCTCGGCAATTGTTCAAGTGGTTGGAGTTATTCTTTGCTTGCACGGAGCTACAAAAATTTCCCACAGAGCCCAGGGAACCGCATCAGTAGCTAGTAGA
TGGCATGCTTTAGTCACTTGTGGCCCGGGCGATGTATCTCAACCTCGACATCAAAATGGTAACGGGAACTCAGATGCTCCCAGTAGTAGACTGAACTCAATGACTAGCAA
TTACTCTGAAAGTGATTTGGAGTCTTTGGATATTCTTACAATGCCTACAACTACGCAGCTGGCTTCTTATATGACCTCCTATCATAAAAGACAAGCATTTGTTATGTATT
TGCAGATGAATCCTGGAGGAATTACCATTTTTGGGTGGACAGTTGATAGAGCTCTGATGAACACCATCTTCTTTATTGAACTCACACTGGTCACCTTTGTGCTAGGGAAA
ACGATAGTTTTCTCC
Protein sequenceShow/hide protein sequence
MAERRDQVDIEDPPAHIPLLESIQSQNSEPTGREDDEEAHLDSALQLLDVLLAFLGFHQSSVLSCALSWTGFVLVGIVLPVVVLQLTDCAACEKYQIKDFELDIVASQAC
LAAVSLLCLSHNLRKYGIKRFLTVDRQTSSLARFRKDYVEKILGSIRLLVFWALACFILKAVREVIRILYAERVSWGISIAILLAMTISWTYVSLISISTAIVFHLMCNL
QLIHFDDYAKLLQTESEVLLLIEEHIFLRYHLSKISHRFRIFLLLQFLVVTASQFMTLFQTTGYSAMITLINGGDFAVSAIVQVVGVILCLHGATKISHRAQGTASVASR
WHALVTCGPGDVSQPRHQNGNGNSDAPSSRLNSMTSNYSESDLESLDILTMPTTTQLASYMTSYHKRQAFVMYLQMNPGGITIFGWTVDRALMNTIFFIELTLVTFVLGK
TIVFS