; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g29780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g29780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPlant protein of unknown function (DUF639)
Genome locationchr4:22134576..22155139
RNA-Seq ExpressionMoc04g29780
SyntenyMoc04g29780
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006927 - Protein of unknown function DUF639


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579090.1 hypothetical protein SDJN03_23538, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.9Show/hide
Query:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
        MPKK K  M++TLMKNQPNTFRSIFQRKKS+N ++ SP+DSPKSIP LS  ANSVV RCSKILQ+STEE+  LFDSELPGINKEPETYSRSLLEFCSYQ 
Subjt:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA

Query:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA
        LYS+ KRPDYLS+KDFRRL YD+MLAWE PGS SEPL QFDDKKTVG EAFARIAPAC ALADIITVHNLFDSLTSSSG RLH+LVFDKYIRSLDK+IKA
Subjt:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA

Query:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST
        TKN+LHPSTGNLHLSEGEIVLE+DGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAV+YDL AD KQ+IKPELTGPLGARLFDKAVMYKST
Subjt:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST

Query:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL
        SV EPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRK+NLNE QKSEVLARAVFGIFRIRAIREAFHVFSSHYRT+LTFNL ESLP GDSILETLLSRL
Subjt:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL

Query:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
        +++N D  Q D SGSP AKQ R+ +P+FLLALSQLGFTLQKEI++E D  L+GD+W GETNPLEI VRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
Subjt:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK

Query:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ
        ELLFPF+EL   +QILASWED+ KST FLLLFCYAI+RNW RFILPC LV LA LMLCRR FGKSKPLEPF +TSPPNRNAVEQLLTLQEVITQVEA IQ
Subjt:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ

Query:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
        DGNI LLKIRALLFAVLPQATD VAL+L+ AAL+FAFLPFKYI+ML LVEAYTREMPYRKETSNK+ RRAREWWI+IPAAPVQLVK DD+KKKKS
Subjt:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS

XP_022141676.1 uncharacterized protein LOC111011981 isoform X1 [Momordica charantia]0.0e+0095.99Show/hide
Query:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
        MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
Subjt:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA

Query:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLL-----------------------------QFDDKKTVGSEAFARIAPACIALADIITVHNLF
        LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLL                             QFDDKKTVGSEAFARIAPACIALADIITVHNLF
Subjt:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLL-----------------------------QFDDKKTVGSEAFARIAPACIALADIITVHNLF

Query:  DSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLG
        DSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLG
Subjt:  DSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLG

Query:  ADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFS
        ADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFS
Subjt:  ADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFS

Query:  SHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSI
        SHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSI
Subjt:  SHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSI

Query:  SDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPF
        SDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPF
Subjt:  SDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPF

Query:  MVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAR
        MVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAR
Subjt:  MVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAR

Query:  EWWIKIPAAPVQLVKPDDSKKKKS
        EWWIKIPAAPVQLVKPDDSKKKKS
Subjt:  EWWIKIPAAPVQLVKPDDSKKKKS

XP_022141677.1 uncharacterized protein LOC111011981 isoform X2 [Momordica charantia]0.0e+00100Show/hide
Query:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
        MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
Subjt:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA

Query:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA
        LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA
Subjt:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA

Query:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST
        TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST
Subjt:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST

Query:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL
        SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL
Subjt:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL

Query:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
        MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
Subjt:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK

Query:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ
        ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ
Subjt:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ

Query:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
        DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
Subjt:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS

XP_022938414.1 uncharacterized protein LOC111444665 isoform X2 [Cucurbita moschata]0.0e+0085.76Show/hide
Query:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
        MPKK K  M++TLMKNQPNTFRSIFQRKKS+N ++ SP+DSPKSIP LS  ANSVV RCSKILQ+STEE+  LFDSELPGINKEPETYSRSLLEFCSYQ 
Subjt:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA

Query:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA
        LYS+ KRPDYLS+KDFRRL YD+MLAWE PGS SEPL QFDDKKTVG EAFARIAPAC ALADIITVHNLFDSLTSSSG RLH+LVFDKYIRSLDK+IKA
Subjt:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA

Query:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST
        TKN+LHPSTGNLHLSEGEIVLE+DGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAV+YDL AD KQ+IKPELTGPLGARLFDKAVMYKST
Subjt:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST

Query:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL
        SV EPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRK+NLNE QKSEVLARAVFGIFRIRAIREAFHVFSSHYRT+LTFNL ESLP GDSILETLLSRL
Subjt:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL

Query:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
        +++N D  Q D SGSP AKQ R+ +P+FLLALSQLGFTLQKEI++E D  L+GD+W GETNPLEI VRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
Subjt:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK

Query:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ
        ELLFPF+EL   +QILASWED+ KST FLLLFCYAI+ NW RFILPC LV LA LMLCRR FGKSKPLEPF +TSPPNRNAVEQLLTLQEVITQVEA IQ
Subjt:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ

Query:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
        DGNI LLKIRALLFAVLPQATD VAL+L+ AAL+FAFLPFKYI+ML LVEAYTREMPYRKETSNK+ RRAREWWI+IPAAPVQLVK DD+KKKKS
Subjt:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS

XP_022992982.1 uncharacterized protein LOC111489145 isoform X2 [Cucurbita maxima]0.0e+0085.9Show/hide
Query:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
        MPKK K  M++TLMKNQPNTFRSIFQRKKS+N ++ SP+DSPKSIP LS  ANSVV RCSKILQ+STEE+  LFDSELPGINKEPETYSRSLLEFCSYQ 
Subjt:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA

Query:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA
        LYS+ KRPDYLS+KDFRRL YD+MLAWE PGS SEPL QFDDKKTVG EAFARIAPAC ALADIITVHNLFDSLTSSSG RLH+LVFDKYIRSLDK+IKA
Subjt:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA

Query:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST
        TKN+LHPSTGNLHLSEGEIVLE+DGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAV+YDL AD KQ+IKPELTGPLGARLFDKAVMYKST
Subjt:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST

Query:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL
        SV EPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRK+NLNE QKSEVLARAVFGIFRIRAIREAFHVFSSHYRT+LTFNL ESLP GDSILETLLSRL
Subjt:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL

Query:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
        +++N D  Q D SGSP AKQ R+ +P+FLLALSQLGFTLQKEI++E D  L+GD+W GETNPLEI VRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
Subjt:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK

Query:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ
        ELL PF+EL   +QILASWED+ KST FLLLFCYAI+RNW RFILPCILV LA LMLCRR FGKSKPLEPF +TSPPNRNAVEQLLTLQEVITQVEA IQ
Subjt:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ

Query:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
        DGNI LLKIRALLFAV PQATD VAL+LI AAL+FAFLPFKYI+ML LVEAYTREMPYRKETSNK+ RRAREWWI+IPAAPVQLVK DD+KKKKS
Subjt:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS

TrEMBL top hitse value%identityAlignment
A0A5D3CTC1 DUF639 domain-containing protein0.0e+0086.98Show/hide
Query:  KKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQALY
        KKVKV M+++L+KNQPNTFRSIFQRKKS+N ++ SPSDSPKSIP LS  ANSVV RCSKILQ+ TEE+  LFDSELPGINKEPETYSRSLLEF SYQ LY
Subjt:  KKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQALY

Query:  SIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKATK
        S+ +RPDYLSDKDFRRL YDMMLAWE PGS SEPL QFDDKKTVG EAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKATK
Subjt:  SIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKATK

Query:  NSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSV
        N+LHPSTGNLHLSEGEI LE+DGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAV+YDL AD KQ+IKPELTGPLGARLFDKAVMYKSTSV
Subjt:  NSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSV

Query:  IEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRLML
        I+PV+LEFPEFKGSSRRDYWLDICLE+LRAHKFIRKHNL+E+QKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNL ESLP GDSILETLL RL+L
Subjt:  IEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRLML

Query:  LNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKEL
        +N    Q D SGSP AKQ R+  P FLLALSQLGFTL KEI +E D  LIGDVWVGE NPLEI VRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKEL
Subjt:  LNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKEL

Query:  LFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQDG
        LFPF+ELA R+QILASWEDS+KST FLLLFC+AI+RNWIRFILPCILVLL+ +MLCRR FGKSKPLEPF +TSPPNRNAVEQLLTLQEVITQVEA IQDG
Subjt:  LFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQDG

Query:  NILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKK
        NI LLKIRALLFAVLPQATD VAL+L+ AAL+FAFLPFKYIIMLVLVEAYTREMPYRKETSNK+ RRAREWWI+IPAAPVQLVKPDD KKK
Subjt:  NILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKK

A0A6J1CJY3 uncharacterized protein LOC111011981 isoform X10.0e+0095.99Show/hide
Query:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
        MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
Subjt:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA

Query:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLL-----------------------------QFDDKKTVGSEAFARIAPACIALADIITVHNLF
        LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLL                             QFDDKKTVGSEAFARIAPACIALADIITVHNLF
Subjt:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLL-----------------------------QFDDKKTVGSEAFARIAPACIALADIITVHNLF

Query:  DSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLG
        DSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLG
Subjt:  DSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLG

Query:  ADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFS
        ADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFS
Subjt:  ADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFS

Query:  SHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSI
        SHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSI
Subjt:  SHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSI

Query:  SDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPF
        SDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPF
Subjt:  SDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPF

Query:  MVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAR
        MVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAR
Subjt:  MVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAR

Query:  EWWIKIPAAPVQLVKPDDSKKKKS
        EWWIKIPAAPVQLVKPDDSKKKKS
Subjt:  EWWIKIPAAPVQLVKPDDSKKKKS

A0A6J1CKI2 uncharacterized protein LOC111011981 isoform X20.0e+00100Show/hide
Query:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
        MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
Subjt:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA

Query:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA
        LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA
Subjt:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA

Query:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST
        TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST
Subjt:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST

Query:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL
        SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL
Subjt:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL

Query:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
        MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
Subjt:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK

Query:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ
        ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ
Subjt:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ

Query:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
        DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
Subjt:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS

A0A6J1FE11 uncharacterized protein LOC111444665 isoform X20.0e+0085.76Show/hide
Query:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
        MPKK K  M++TLMKNQPNTFRSIFQRKKS+N ++ SP+DSPKSIP LS  ANSVV RCSKILQ+STEE+  LFDSELPGINKEPETYSRSLLEFCSYQ 
Subjt:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA

Query:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA
        LYS+ KRPDYLS+KDFRRL YD+MLAWE PGS SEPL QFDDKKTVG EAFARIAPAC ALADIITVHNLFDSLTSSSG RLH+LVFDKYIRSLDK+IKA
Subjt:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA

Query:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST
        TKN+LHPSTGNLHLSEGEIVLE+DGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAV+YDL AD KQ+IKPELTGPLGARLFDKAVMYKST
Subjt:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST

Query:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL
        SV EPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRK+NLNE QKSEVLARAVFGIFRIRAIREAFHVFSSHYRT+LTFNL ESLP GDSILETLLSRL
Subjt:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL

Query:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
        +++N D  Q D SGSP AKQ R+ +P+FLLALSQLGFTLQKEI++E D  L+GD+W GETNPLEI VRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
Subjt:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK

Query:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ
        ELLFPF+EL   +QILASWED+ KST FLLLFCYAI+ NW RFILPC LV LA LMLCRR FGKSKPLEPF +TSPPNRNAVEQLLTLQEVITQVEA IQ
Subjt:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ

Query:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
        DGNI LLKIRALLFAVLPQATD VAL+L+ AAL+FAFLPFKYI+ML LVEAYTREMPYRKETSNK+ RRAREWWI+IPAAPVQLVK DD+KKKKS
Subjt:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS

A0A6J1JRG9 uncharacterized protein LOC111489145 isoform X20.0e+0085.9Show/hide
Query:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA
        MPKK K  M++TLMKNQPNTFRSIFQRKKS+N ++ SP+DSPKSIP LS  ANSVV RCSKILQ+STEE+  LFDSELPGINKEPETYSRSLLEFCSYQ 
Subjt:  MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQA

Query:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA
        LYS+ KRPDYLS+KDFRRL YD+MLAWE PGS SEPL QFDDKKTVG EAFARIAPAC ALADIITVHNLFDSLTSSSG RLH+LVFDKYIRSLDK+IKA
Subjt:  LYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKA

Query:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST
        TKN+LHPSTGNLHLSEGEIVLE+DGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAV+YDL AD KQ+IKPELTGPLGARLFDKAVMYKST
Subjt:  TKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKST

Query:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL
        SV EPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRK+NLNE QKSEVLARAVFGIFRIRAIREAFHVFSSHYRT+LTFNL ESLP GDSILETLLSRL
Subjt:  SVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRL

Query:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
        +++N D  Q D SGSP AKQ R+ +P+FLLALSQLGFTLQKEI++E D  L+GD+W GETNPLEI VRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK
Subjt:  MLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMK

Query:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ
        ELL PF+EL   +QILASWED+ KST FLLLFCYAI+RNW RFILPCILV LA LMLCRR FGKSKPLEPF +TSPPNRNAVEQLLTLQEVITQVEA IQ
Subjt:  ELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQ

Query:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
        DGNI LLKIRALLFAV PQATD VAL+LI AAL+FAFLPFKYI+ML LVEAYTREMPYRKETSNK+ RRAREWWI+IPAAPVQLVK DD+KKKKS
Subjt:  DGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48840.1 Plant protein of unknown function (DUF639)1.4e-15446.08Show/hide
Query:  EPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQALYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGS
        E SPS     IP LS +AN V+ RCSKIL ++  E+   F  E     K+P  + R+ LE+C ++AL   +    +LSDK FRRLT+DMM+AWE P + S
Subjt:  EPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQALYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGS

Query:  EPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTS-SSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVL
        + LL  D+  TVG EAF+RIAPA   +AD+I   NLF  LTS S+  RL F V+DKY+  L++ IK  K+    S  +   S+GE +LE+DGTV TQPVL
Subjt:  EPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTS-SSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVL

Query:  QHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAH
        +HIGIS WPGRL LT H+LYFE++ V  +D   +Y L  D+KQ IKPELTGP G RLFDKAV YKS S+ EPV +EFPE KG +RRDYWL I LE+L  H
Subjt:  QHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAH

Query:  KFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETL--LSRLMLLNTDCFQHDGS-GSPSAKQHRRPYPVFLL
        ++I+K  +N V K E +++AV GI R++AI+E        Y  LL FNL + LP GD ILETL  +S   +L+      +G+  S SA            
Subjt:  KFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETL--LSRLMLLNTDCFQHDGS-GSPSAKQHRRPYPVFLL

Query:  ALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLL
         +SQLG              ++G+V VG+ NPLE AV+QS  +  +   AQ TV+ VKV+GIDTN+AVMKELL P  E+ N L  L  WED  KS  F L
Subjt:  ALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLL

Query:  LFCYAIVRNWIRFILPCILVLLAALMLCRRYF-GKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLI
        L  + I R WI ++     + +A  M+  RYF  + K +    V +PP  N +EQLL +Q  I+Q+E  IQD NI+LLK RALL ++ PQA++  A+ ++
Subjt:  LFCYAIVRNWIRFILPCILVLLAALMLCRRYF-GKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLI

Query:  LAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKK
        +AA + A +P+  +I++V +E +TR  P R+ ++ +  RR +EWW  IPAAPV L +  D  KK
Subjt:  LAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKK

AT1G71240.2 Plant protein of unknown function (DUF639)1.8e-5927.6Show/hide
Query:  SDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQAL-YSIIKRPDYLSDKDFRRLTYDMMLAWEFP-------
        SD+PK   +  ++ +  + + S++  ++ +++  +F++    ++    T +R L+E+C ++ L     +    L +  F+RL +  MLAW  P       
Subjt:  SDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQAL-YSIIKRPDYLSDKDFRRLTYDMMLAWEFP-------

Query:  --GSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHR-LHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTV
           +  +P  Q    + +G EAF RIAPA   LAD  TVHNLF +L +++  + +   ++  YI+ L K+ +  K+  H +T    LS   ++       
Subjt:  --GSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHR-LHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTV

Query:  PTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICL
           PVL+     AWPG+LTLT  ALYFE + +      ++ DL  D K  ++    GPLG  LFD AV   S   +    LEF +  G  RRD W  I  
Subjt:  PTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICL

Query:  EILRAHKFIRKHNLNEVQKS------------EVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETL-------------------
        E++  H F+R+    E  KS            + +A A   I R++A++   ++     + L+ F+  + +  GD + +TL                   
Subjt:  EILRAHKFIRKHNLNEVQKS------------EVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETL-------------------

Query:  LSRLMLLNTDCFQH--DGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDT
        ++R    + + F +  D  GS   K+  R  P +    S       K  S    + L   + V +   +E A           E  QAT+D   ++GI +
Subjt:  LSRLMLLNTDCFQH--DGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDT

Query:  NLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALML----CRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQE
        N+ + KEL+ P    A   + L  WE+ Y + +FL      I RN ++++LP  L+ LA  ML     RR     +      +   P+ N +++++ +++
Subjt:  NLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALML----CRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQE

Query:  VITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPV
         +  +E+ +Q  N++LLK+R ++ +  PQ T  VAL ++  A +   +PFKY++  VL + +TRE+ +RKE   K+    RE W  +PAAPV
Subjt:  VITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPV

AT2G21720.1 Plant protein of unknown function (DUF639)6.0e-8429.96Show/hide
Query:  VMMDTLMKNQPNTFR---SIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSE-LPGINKEPETYSRSLLEFCSYQALYS
        V++ T    +PN  R   S  ++ K R   +     + + + +LSS+AN VV RCS+ L+ + ++++  F+ +  PG      TYS+  +EFC+ +    
Subjt:  VMMDTLMKNQPNTFR---SIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSE-LPGINKEPETYSRSLLEFCSYQALYS

Query:  IIKR-PDYLSDKDFRRLTYDMMLAWEFP------------GSGSE-----------------------PLLQFDDKKTVGSEAFARIAPACIALADIITV
        + +   + + D  F RLT+DMMLAW+ P            G  SE                       PLL  D + +VG +AF  +        DII  
Subjt:  IIKR-PDYLSDKDFRRLTYDMMLAWEFP------------GSGSE-----------------------PLLQFDDKKTVGSEAFARIAPACIALADIITV

Query:  HNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVK
           F++LT+ +GH+LHF  +D +++ + K +K  +    P    + L++ EI+L ++GT+ +Q V++HI  ++WPGRLTLT++ALYFE+ G+  Y+ A+K
Subjt:  HNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVK

Query:  YDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNL-NEVQKSEVLARAVFGIFRIRAIREA
         DL  D ++  KP  TGPLGA LFDKA++Y+S    E + +EFPE   S+RRD+WL +  EI   HKF+RK N+ + +Q  E+ +R + GI R+ A RE 
Subjt:  YDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNL-NEVQKSEVLARAVFGIFRIRAIREA

Query:  FHVFSSHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGE-TNPLEI
          +     +  L F+L E +P+GD +LE L    + + T   ++  S S   +          + + QLG  +++E    C   ++      E    LE 
Subjt:  FHVFSSHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGE-TNPLEI

Query:  AVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWI-RFILPCILVLLAALMLCRRYFGK
        AV QS  +    E A+AT  +++ EGI  ++AV+ ELL P  ++    Q +  WE   ++   L +    + + W+ + I  C++ ++A +   R     
Subjt:  AVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWI-RFILPCILVLLAALMLCRRYFGK

Query:  SKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSN
        +K  +   V++  ++   E +++ Q  + ++   +Q  N+ +LK+R+L  +   +    V  ++++ A  FA +PFK  I+  +V  +          SN
Subjt:  SKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSN

Query:  -KYFRRAREWWIKIPAAPVQL
         +  RR +EWW  IP  PV++
Subjt:  -KYFRRAREWWIKIPAAPVQL

AT3G18350.1 Plant protein of unknown function (DUF639)1.4e-14944.01Show/hide
Query:  SPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQALYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEP
        SPS     IP LS +AN VV RCSKIL +S  E+   F  E     K+P  + R+ LE+C ++AL   +    +L+DK FRRLT+DMM+ WE P   S+ 
Subjt:  SPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQALYSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEP

Query:  LLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHI
        LL  ++  TV  EAF+RIAPA   +AD+I   NLF  LTSS+G RL F V+DKY+  L++ IK  +     S  +   S+ E +LEIDGTV TQPVL+H+
Subjt:  LLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHI

Query:  GISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFI
        GIS WPGRL LT H+LYFE+L V  YD   +Y L  D+KQ IKPELTGP G RLFDKAV Y+S S+ EPV +EFPE KG +RRDYWL I  E+L  H++I
Subjt:  GISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFI

Query:  RKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLG
         K+ +  + + E L++AV G+ R++A++E     +  Y  LL FNL + LP GD ILETL      ++T    H  + S             +  L  + 
Subjt:  RKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLG

Query:  FTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAI
                 E    ++G+V VG+ NPLE AV++S     +   AQ T++ VK+ GIDTNLAVMKEL+ P +E  N +  +  W+D  KS+ F LL  + I
Subjt:  FTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAI

Query:  VRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFA
         R W+ ++     +  A  M+  R F + K +    VT+PP  N +EQLL +Q  I+++E  IQD NI+LLK RALLF++ PQA+   A+ +++AA + A
Subjt:  VRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFA

Query:  FLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS
        F+P +Y++ +V VE +TR  P R+ ++ +  RR REWW  IPAAPV L+   ++KKKK+
Subjt:  FLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS

AT5G23390.1 Plant protein of unknown function (DUF639)3.4e-22055.82Show/hide
Query:  KVVMMDTLMKNQPNTFRSIFQRKKS---RNGD-EPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQAL
        KV M++  MK   ++ +S+FQRKKS   R+GD  PSP  SPK IP LS LANSVV RCSKIL + TE++ H FD ELP   K+  TY+R+ LEFCS+QAL
Subjt:  KVVMMDTLMKNQPNTFRSIFQRKKS---RNGD-EPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQAL

Query:  YSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEP-----------------------------LLQFDDKKTVGSEAFARIAPACIALADIITVHNLFD
        + ++K+PDYLSD++FR+L +DMMLAWE P   SE                               +Q D+KK+VG EAFARIAP C A+AD ITVHNLFD
Subjt:  YSIIKRPDYLSDKDFRRLTYDMMLAWEFPGSGSEP-----------------------------LLQFDDKKTVGSEAFARIAPACIALADIITVHNLFD

Query:  SLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGA
        +LTSSSGHRLH++V+DKY+R+LDK+ KA K++L PS  NL L++GEIVL++DG  P  PVL+H+GISAWPG+LTLT+ ALYF+S+G G  +K ++YDL  
Subjt:  SLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIVLEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGA

Query:  DMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSS
        D KQ IKPELTGPLGAR+FDKA+MYKS +V EPV+ EF EFKG++RRDYWL ICLEILR   FIR++N   +Q+SE+LARA+ GIFR RAIREAF VFSS
Subjt:  DMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEILRAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSS

Query:  HYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSIS
         Y+TLL FNL ESLP GD +LE L SR+  + T+     GS     K      PV L  L   G  L+   +   ++ ++GD  VGET+PLEIA++QSI 
Subjt:  HYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQKEISFECDIFLIGDVWVGETNPLEIAVRQSIS

Query:  DSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFM
        D+ RAEAAQATV+QVKVEGIDTN+AVMKELL PF++L   +  LA W+D YKST F++L  Y I+  WI FILP IL+L+A +M+ R+ F K K  +   
Subjt:  DSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILVLLAALMLCRRYFGKSKPLEPFM

Query:  VTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRARE
        V +PP++NAVEQLL LQ+ I+Q E+ IQ  N+ LLKIRA+  A+LPQATDT A+ L++ A+I A +P KY+I +  VE +TRE+ +RK +S++  RR RE
Subjt:  VTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRKETSNKYFRRARE

Query:  WWIKIPAAPVQLVKPDDSKKKK
        WW ++PAAPVQL++ +DSKKKK
Subjt:  WWIKIPAAPVQLVKPDDSKKKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGAAGAAAGTGAAGGTCGTAATGATGGACACCTTGATGAAGAATCAACCCAACACGTTCAGATCCATCTTCCAGCGCAAGAAATCCAGGAATGGCGACGAACCATC
CCCTTCCGATTCTCCAAAATCCATCCCTAACTTGTCTTCCCTCGCCAATTCCGTCGTCGTTCGCTGCTCCAAGATTCTTCAACTGTCCACTGAAGAAATACTACATCTTT
TTGATTCAGAGCTACCTGGAATTAATAAGGAACCTGAAACTTATTCTAGAAGTTTGTTGGAATTTTGCTCATACCAAGCACTTTATTCAATTATCAAACGTCCAGATTAT
TTAAGTGACAAGGACTTTCGTCGCTTGACATATGACATGATGCTTGCTTGGGAGTTCCCTGGTAGTGGAAGCGAACCACTCCTGCAATTTGATGACAAGAAAACTGTTGG
GTCAGAAGCTTTTGCACGGATAGCTCCTGCTTGCATTGCTCTTGCTGATATCATAACTGTCCACAATCTTTTTGATTCACTGACGAGCTCTTCAGGTCATCGACTTCATT
TTCTTGTCTTCGACAAATATATTAGAAGTCTTGACAAGGTTATTAAGGCAACCAAAAATTCGTTGCACCCATCAACTGGAAATCTTCATCTTTCCGAAGGAGAAATTGTC
CTTGAAATTGATGGTACAGTACCCACTCAACCAGTCCTTCAGCATATTGGCATTTCTGCATGGCCTGGGAGGTTGACACTGACCAGTCATGCCTTGTACTTTGAGTCATT
GGGTGTTGGTTTATACGACAAAGCTGTTAAATATGACCTGGGAGCCGACATGAAGCAGAAAATAAAACCTGAACTTACAGGACCGCTGGGTGCCCGTCTCTTTGATAAAG
CTGTTATGTACAAGTCAACATCTGTGATAGAACCTGTTTATTTAGAGTTTCCTGAATTCAAAGGCAGTTCTCGCCGGGATTATTGGCTGGATATCTGTCTTGAGATCCTG
CGAGCACATAAGTTTATAAGGAAGCACAATCTAAATGAAGTTCAGAAATCAGAAGTACTTGCGAGGGCAGTTTTTGGCATTTTCAGAATTCGTGCAATTAGAGAAGCTTT
CCATGTTTTTTCATCACATTACAGAACCTTACTCACTTTTAATCTGACAGAAAGTCTTCCTAGGGGAGACTCAATTCTGGAAACGCTCTTGAGTCGATTGATGTTGCTAA
ATACAGATTGTTTCCAGCATGATGGCTCTGGGAGTCCATCTGCAAAGCAGCATCGACGACCATATCCAGTTTTTCTTCTAGCACTAAGTCAACTTGGATTCACCTTACAG
AAAGAGATAAGTTTTGAATGCGACATATTCCTAATTGGAGATGTTTGGGTTGGTGAGACAAATCCCTTGGAAATTGCAGTGAGACAATCAATATCAGATTCAGGCAGGGC
TGAAGCTGCTCAAGCGACTGTTGACCAAGTGAAAGTGGAGGGTATTGATACAAATCTTGCAGTGATGAAGGAACTGTTGTTTCCATTTGTGGAATTGGCTAATCGTCTTC
AGATTTTGGCCTCGTGGGAAGACTCTTACAAATCAACGACTTTTCTACTGCTGTTCTGCTATGCTATTGTAAGGAATTGGATAAGGTTTATTTTGCCATGCATTCTAGTG
CTTCTTGCGGCTCTCATGCTTTGCCGTAGATACTTTGGCAAAAGCAAGCCACTAGAACCGTTCATGGTCACATCCCCTCCTAATCGAAATGCTGTAGAACAGTTGCTGAC
CTTACAAGAAGTCATCACTCAAGTTGAAGCATGTATTCAAGATGGGAATATTCTTCTCCTAAAAATAAGAGCTCTCTTATTTGCAGTACTTCCACAGGCAACGGACACGG
TCGCTCTAGTGCTCATTCTTGCCGCTTTAATCTTTGCATTTCTGCCATTTAAATACATAATCATGCTGGTACTCGTAGAGGCGTATACGAGGGAAATGCCATACAGGAAG
GAAACGAGTAACAAATATTTTAGGAGGGCAAGAGAATGGTGGATCAAAATACCAGCAGCTCCTGTTCAACTTGTCAAACCTGATGATAGTAAGAAAAAGAAATCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGAAGAAAGTGAAGGTCGTAATGATGGACACCTTGATGAAGAATCAACCCAACACGTTCAGATCCATCTTCCAGCGCAAGAAATCCAGGAATGGCGACGAACCATC
CCCTTCCGATTCTCCAAAATCCATCCCTAACTTGTCTTCCCTCGCCAATTCCGTCGTCGTTCGCTGCTCCAAGATTCTTCAACTGTCCACTGAAGAAATACTACATCTTT
TTGATTCAGAGCTACCTGGAATTAATAAGGAACCTGAAACTTATTCTAGAAGTTTGTTGGAATTTTGCTCATACCAAGCACTTTATTCAATTATCAAACGTCCAGATTAT
TTAAGTGACAAGGACTTTCGTCGCTTGACATATGACATGATGCTTGCTTGGGAGTTCCCTGGTAGTGGAAGCGAACCACTCCTGCAATTTGATGACAAGAAAACTGTTGG
GTCAGAAGCTTTTGCACGGATAGCTCCTGCTTGCATTGCTCTTGCTGATATCATAACTGTCCACAATCTTTTTGATTCACTGACGAGCTCTTCAGGTCATCGACTTCATT
TTCTTGTCTTCGACAAATATATTAGAAGTCTTGACAAGGTTATTAAGGCAACCAAAAATTCGTTGCACCCATCAACTGGAAATCTTCATCTTTCCGAAGGAGAAATTGTC
CTTGAAATTGATGGTACAGTACCCACTCAACCAGTCCTTCAGCATATTGGCATTTCTGCATGGCCTGGGAGGTTGACACTGACCAGTCATGCCTTGTACTTTGAGTCATT
GGGTGTTGGTTTATACGACAAAGCTGTTAAATATGACCTGGGAGCCGACATGAAGCAGAAAATAAAACCTGAACTTACAGGACCGCTGGGTGCCCGTCTCTTTGATAAAG
CTGTTATGTACAAGTCAACATCTGTGATAGAACCTGTTTATTTAGAGTTTCCTGAATTCAAAGGCAGTTCTCGCCGGGATTATTGGCTGGATATCTGTCTTGAGATCCTG
CGAGCACATAAGTTTATAAGGAAGCACAATCTAAATGAAGTTCAGAAATCAGAAGTACTTGCGAGGGCAGTTTTTGGCATTTTCAGAATTCGTGCAATTAGAGAAGCTTT
CCATGTTTTTTCATCACATTACAGAACCTTACTCACTTTTAATCTGACAGAAAGTCTTCCTAGGGGAGACTCAATTCTGGAAACGCTCTTGAGTCGATTGATGTTGCTAA
ATACAGATTGTTTCCAGCATGATGGCTCTGGGAGTCCATCTGCAAAGCAGCATCGACGACCATATCCAGTTTTTCTTCTAGCACTAAGTCAACTTGGATTCACCTTACAG
AAAGAGATAAGTTTTGAATGCGACATATTCCTAATTGGAGATGTTTGGGTTGGTGAGACAAATCCCTTGGAAATTGCAGTGAGACAATCAATATCAGATTCAGGCAGGGC
TGAAGCTGCTCAAGCGACTGTTGACCAAGTGAAAGTGGAGGGTATTGATACAAATCTTGCAGTGATGAAGGAACTGTTGTTTCCATTTGTGGAATTGGCTAATCGTCTTC
AGATTTTGGCCTCGTGGGAAGACTCTTACAAATCAACGACTTTTCTACTGCTGTTCTGCTATGCTATTGTAAGGAATTGGATAAGGTTTATTTTGCCATGCATTCTAGTG
CTTCTTGCGGCTCTCATGCTTTGCCGTAGATACTTTGGCAAAAGCAAGCCACTAGAACCGTTCATGGTCACATCCCCTCCTAATCGAAATGCTGTAGAACAGTTGCTGAC
CTTACAAGAAGTCATCACTCAAGTTGAAGCATGTATTCAAGATGGGAATATTCTTCTCCTAAAAATAAGAGCTCTCTTATTTGCAGTACTTCCACAGGCAACGGACACGG
TCGCTCTAGTGCTCATTCTTGCCGCTTTAATCTTTGCATTTCTGCCATTTAAATACATAATCATGCTGGTACTCGTAGAGGCGTATACGAGGGAAATGCCATACAGGAAG
GAAACGAGTAACAAATATTTTAGGAGGGCAAGAGAATGGTGGATCAAAATACCAGCAGCTCCTGTTCAACTTGTCAAACCTGATGATAGTAAGAAAAAGAAATCATAA
Protein sequenceShow/hide protein sequence
MPKKVKVVMMDTLMKNQPNTFRSIFQRKKSRNGDEPSPSDSPKSIPNLSSLANSVVVRCSKILQLSTEEILHLFDSELPGINKEPETYSRSLLEFCSYQALYSIIKRPDY
LSDKDFRRLTYDMMLAWEFPGSGSEPLLQFDDKKTVGSEAFARIAPACIALADIITVHNLFDSLTSSSGHRLHFLVFDKYIRSLDKVIKATKNSLHPSTGNLHLSEGEIV
LEIDGTVPTQPVLQHIGISAWPGRLTLTSHALYFESLGVGLYDKAVKYDLGADMKQKIKPELTGPLGARLFDKAVMYKSTSVIEPVYLEFPEFKGSSRRDYWLDICLEIL
RAHKFIRKHNLNEVQKSEVLARAVFGIFRIRAIREAFHVFSSHYRTLLTFNLTESLPRGDSILETLLSRLMLLNTDCFQHDGSGSPSAKQHRRPYPVFLLALSQLGFTLQ
KEISFECDIFLIGDVWVGETNPLEIAVRQSISDSGRAEAAQATVDQVKVEGIDTNLAVMKELLFPFVELANRLQILASWEDSYKSTTFLLLFCYAIVRNWIRFILPCILV
LLAALMLCRRYFGKSKPLEPFMVTSPPNRNAVEQLLTLQEVITQVEACIQDGNILLLKIRALLFAVLPQATDTVALVLILAALIFAFLPFKYIIMLVLVEAYTREMPYRK
ETSNKYFRRAREWWIKIPAAPVQLVKPDDSKKKKS