; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g30950 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g30950
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionbiotin--protein ligase 2-like
Genome locationchr4:23262800..23270028
RNA-Seq ExpressionMoc04g30950
SyntenyMoc04g30950
Gene Ontology termsGO:0006464 - cellular protein modification process (biological process)
GO:0004077 - biotin-[acetyl-CoA-carboxylase] ligase activity (molecular function)
InterPro domainsIPR003142 - Biotin protein ligase, C-terminal
IPR004143 - Biotinyl protein ligase (BPL) and lipoyl protein ligase (LPL), catalytic domain
IPR004408 - Biotin--acetyl-CoA-carboxylase ligase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602073.1 Serine/threonine-protein kinase MPS1, partial [Cucurbita argyrosperma subsp. sororia]5.8e-17281.91Show/hide
Query:  WRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL
        WR SR F SI+NS    S  + +  AP                     AMETSSTC+LVLSGKT AENETAKLLKRN+TLKLPDD  +SV LHSE DK L
Subjt:  WRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL

Query:  EENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEA
        +E+GFQI  YMNSLSTD FGRFLIWC RIPSTQDVIS NFSDLPLG+VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYV+SLAITEA
Subjt:  EENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEA

Query:  IKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQ
        +KDICDKKGLPYIDLKIKWPNDLYVNGLK+GGILCTSTYRSKKFNV+AGIGLN+DNDKPTTCLN AL NLSSTP KFRREDIL+ FFNKFERL+DIFINQ
Subjt:  IKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQ

Query:  GFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        GFQALEELY QTWLHSGQRVIVQEKKEDQVVENVVTIQGLT +GYLLAIGDDNQMCELHPDGNSLDFFKGL+KSKL
Subjt:  GFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL

XP_022141438.1 biotin--protein ligase 2-like isoform X1 [Momordica charantia]6.4e-203100Show/hide
Query:  RDSRYFFSIVNSKVVASARSLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRF
        RDSRYFFSIVNSKVVASARSLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRF
Subjt:  RDSRYFFSIVNSKVVASARSLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRF

Query:  LIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPND
        LIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPND
Subjt:  LIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPND

Query:  LYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIV
        LYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIV
Subjt:  LYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIV

Query:  QEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKLE
        QEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKLE
Subjt:  QEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKLE

XP_022141440.1 biotin--protein ligase 1, chloroplastic-like isoform X2 [Momordica charantia]2.0e-188100Show/hide
Query:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVC
        METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVC
Subjt:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVC

Query:  VADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAG
        VADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAG
Subjt:  VADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAG

Query:  IGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAI
        IGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAI
Subjt:  IGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAI

Query:  GDDNQMCELHPDGNSLDFFKGLVKSKLE
        GDDNQMCELHPDGNSLDFFKGLVKSKLE
Subjt:  GDDNQMCELHPDGNSLDFFKGLVKSKLE

XP_022957207.1 biotin--protein ligase 2-like [Cucurbita moschata]1.7e-17181.7Show/hide
Query:  AWRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKH
        +WR SR F SI+NS    S  + +  AP                     AMETSSTC+LVLSGKT AENETAKLLKRN+TLKLPDD  +SV LHSE DK 
Subjt:  AWRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKH

Query:  LEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITE
        L+E+GFQI  YMNSLSTD FGRFLIWC RIPSTQDVIS NFSDLPLG+VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYV+SLAITE
Subjt:  LEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITE

Query:  AIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFIN
        A+KDICDK+GLPYIDLKIKWPNDLYVNGLK+GGILCTSTYRSKKFNV+AGIGLNVDNDKPTTCLN AL NLSSTP KFRREDILA FFNKFE L+DIFIN
Subjt:  AIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFIN

Query:  QGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        QGFQALEELY QTWLHSGQRVIVQEKKEDQVVENVVTIQGLT +GYLLAIGDDNQMCELHPDGNSLDFFKGL+KSKL
Subjt:  QGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL

XP_023517017.1 biotin--protein ligase 2-like [Cucurbita pepo subsp. pepo]5.8e-17281.96Show/hide
Query:  AWRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKH
        +WR SR F SI+NS    S  + +  AP                     AMETSSTC+LVLSGKT AENETAKLLKRN+TLKLPDD  +S+ LHSE DK 
Subjt:  AWRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKH

Query:  LEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITE
        L+E+GFQI  YMNSLSTD FGRFLIWC RIPSTQDVIS NFSDLPLG+VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITE
Subjt:  LEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITE

Query:  AIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFIN
        A+KDICDKKGLPYIDLKIKWPNDLYVNGLK+GGILCTSTYRSKKFNV+AGIGLNVDNDKPTTCLN AL NLSSTP KFRREDILA FFNKFE L+DIFIN
Subjt:  AIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFIN

Query:  QGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        QGFQALEELY QTWLHSGQRVIVQEKKEDQVVENVVTIQGLT +GYLLAIGDDNQMCELHPDGNSLDFFKGL+KSKL
Subjt:  QGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL

TrEMBL top hitse value%identityAlignment
A0A1S3BH31 biotin--protein ligase 2-like isoform X42.0e-17080Show/hide
Query:  RDSRYFFSIVN-------SKVVASARSLSV--------------PAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLE
        RDSR+ FS++N       S++VAS  + ++               A  M+ S TC+LVLSGKTAAENETAKLLKRN+TLKLPDDT +SV LHSE DK LE
Subjt:  RDSRYFFSIVN-------SKVVASARSLSV--------------PAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLE

Query:  ENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAI
        ENGF+IDLY+N+LSTD FGRFLIW PR+PSTQDVISHNFS+LPLG+VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGR VPLLQYV+SLAITEAI
Subjt:  ENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAI

Query:  KDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQG
        KDICDK+GLPYIDLKIKWPNDLYVN LK+GG+LCTSTYR KKFNV+AGIGLNV+NDKP+TCLN AL +LSSTP KFR+EDILA FFNKFERL+D+FINQG
Subjt:  KDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQG

Query:  FQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        F+ALEELYYQTWLHSGQRV+VQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGL+KSKL
Subjt:  FQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL

A0A6J1CI37 biotin--protein ligase 1, chloroplastic-like isoform X29.6e-189100Show/hide
Query:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVC
        METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVC
Subjt:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVC

Query:  VADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAG
        VADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAG
Subjt:  VADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAG

Query:  IGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAI
        IGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAI
Subjt:  IGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAI

Query:  GDDNQMCELHPDGNSLDFFKGLVKSKLE
        GDDNQMCELHPDGNSLDFFKGLVKSKLE
Subjt:  GDDNQMCELHPDGNSLDFFKGLVKSKLE

A0A6J1CKH6 biotin--protein ligase 2-like isoform X13.1e-203100Show/hide
Query:  RDSRYFFSIVNSKVVASARSLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRF
        RDSRYFFSIVNSKVVASARSLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRF
Subjt:  RDSRYFFSIVNSKVVASARSLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRF

Query:  LIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPND
        LIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPND
Subjt:  LIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPND

Query:  LYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIV
        LYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIV
Subjt:  LYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIV

Query:  QEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKLE
        QEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKLE
Subjt:  QEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKLE

A0A6J1GYI6 biotin--protein ligase 2-like8.2e-17281.7Show/hide
Query:  AWRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKH
        +WR SR F SI+NS    S  + +  AP                     AMETSSTC+LVLSGKT AENETAKLLKRN+TLKLPDD  +SV LHSE DK 
Subjt:  AWRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKH

Query:  LEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITE
        L+E+GFQI  YMNSLSTD FGRFLIWC RIPSTQDVIS NFSDLPLG+VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYV+SLAITE
Subjt:  LEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITE

Query:  AIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFIN
        A+KDICDK+GLPYIDLKIKWPNDLYVNGLK+GGILCTSTYRSKKFNV+AGIGLNVDNDKPTTCLN AL NLSSTP KFRREDILA FFNKFE L+DIFIN
Subjt:  AIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFIN

Query:  QGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        QGFQALEELY QTWLHSGQRVIVQEKKEDQVVENVVTIQGLT +GYLLAIGDDNQMCELHPDGNSLDFFKGL+KSKL
Subjt:  QGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL

A0A6J1JK17 biotin--protein ligase 2-like1.8e-17181.7Show/hide
Query:  AWRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKH
        +WR SR F SI++S    S  + +  AP                     AME SSTCALVLSGKT AENETAKLLKRN+TLKLPDD  +SV LHSE DK 
Subjt:  AWRDSRYFFSIVNSKVVASARSLSVPAP---------------------AMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKH

Query:  LEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITE
        L+E+GFQI  YMNSLSTD FGRFLIWC RIPSTQDVIS NFSDLPLG+VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYV+SLAITE
Subjt:  LEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITE

Query:  AIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFIN
        A+KDICDKKGLPYIDLKIKWPNDLYVNGLK+GGILCTSTYRSKKFNV+AGIGLNVDNDKPTTCLN AL NLSSTP KFRREDILA FFNKFE L+DIFIN
Subjt:  AIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFIN

Query:  QGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        QGFQALEELY QTWLHSGQRVIVQEKKEDQVVENVVTIQGLT +GYLLAIGDDNQMCELHPDGNSLDFFKGL+KSKL
Subjt:  QGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL

SwissProt top hitse value%identityAlignment
F4I4W2 Biotin--protein ligase 23.5e-13567.48Show/hide
Query:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGS
        M+  ++C+LVL GK++ E +TA  LK NN LKLPD++ +S+FL SE    +  +++ F + L+MNS+ST  FGRFLIW P + ST DV+SHNFS++P+GS
Subjt:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGS

Query:  VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVS
        VCV+D+Q KGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYVVSLA+TEA+KD+CDKKGL Y D+KIKWPNDLY+NGLK+GGILCTSTYRS+KF VS
Subjt:  VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVS

Query:  AGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLL
         G+GLNVDN++PTTCLNA LK++    +  +RE+IL +FF KFE   D+F+ QGF++LEELYY+TWLHSGQRVI +EK EDQVV+NVVTIQGLTSSGYLL
Subjt:  AGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLL

Query:  AIGDDNQMCELHPDGNSLDFFKGLVKSKL
        AIGDDN M ELHPDGNS DFFKGLV+ KL
Subjt:  AIGDDNQMCELHPDGNSLDFFKGLVKSKL

O14353 Biotin--protein ligase8.8e-3834.11Show/hide
Query:  FQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLP---LGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRI----VPLLQYVVSLAI
        F ++LY   ++   FG  +I  P I STQ ++  N+  L     G   + + Q  GRGR +N+W SP G L FSF I ++        + L QY+++LA+
Subjt:  FQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLP---LGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRI----VPLLQYVVSLAI

Query:  TEAIKDICDKKGLPYIDLKIKWPNDLYV------------NGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPD-----KFRRE
           I++     G   I   IKWPND+YV              +KL GI+ TS YR    ++  G G+NV N  PT  LN  +   +   D     KF  E
Subjt:  TEAIKDICDKKGLPYIDLKIKWPNDLYV------------NGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPD-----KFRRE

Query:  DILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSS-GYLLA--IGDDNQ----MCELHPDGNSLDFFKGLVK
         +LAS  N+F+R H + + +GF  +   YYQ WLHS Q V +    +         IQG+TS  G+LLA  + ++N+    +  L PDGNS D  + L+ 
Subjt:  DILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSS-GYLLA--IGDDNQ----MCELHPDGNSLDFFKGLVK

Query:  SK
         K
Subjt:  SK

P50747 Biotin--protein ligase3.9e-3830.85Show/hide
Query:  FQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLP--LGSVCVADVQFKGRGRSKNLWESPPGC----LMFSFTIQMEDGRIVPLLQYVVSLAIT
        F +++Y  +L T   G+ +++    P+T  ++       P  +G + +A  Q +G+GR  N+W SP GC    L+ S  ++ + G+ +P +Q+++S+A+ 
Subjt:  FQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLP--LGSVCVADVQFKGRGRSKNLWESPPGC----LMFSFTIQMEDGRIVPLLQYVVSLAIT

Query:  EAIKDICDKKGLPYIDLKIKWPNDLYVNGL-KLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAAL----KNLSSTPDKFRREDILASFFNKFERL
        EA++ I + +    I+L++KWPND+Y + L K+GG+L  ST   + F +  G G NV N  PT C+N  +    K   +     R + ++A      E+L
Subjt:  EAIKDICDKKGLPYIDLKIKWPNDLYVNGL-KLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAAL----KNLSSTPDKFRREDILASFFNKFERL

Query:  HDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSK
           F ++G  ++  LYY+ W+HSGQ+V +   +  +     V+I GL  SG+L    +  ++  +HPDGNS D  + L+  K
Subjt:  HDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSK

Q920N2 Biotin--protein ligase3.9e-3831.56Show/hide
Query:  FQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLP--LGSVCVADVQFKGRGRSKNLWESPPGC----LMFSFTIQMEDGRIVPLLQYVVSLAIT
        F ++ Y  +L T   G+ +++     +T  ++     ++P  +G + +A  Q +G+GR  N W SP GC    L+    ++ + G+ +P +Q+++SLA+ 
Subjt:  FQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLP--LGSVCVADVQFKGRGRSKNLWESPPGC----LMFSFTIQMEDGRIVPLLQYVVSLAIT

Query:  EAIKDICDKKGLPYIDLKIKWPNDLYVNGL-KLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAAL----KNLSSTPDKFRREDILASFFNKFERL
        EA++ I    G   I+L++KWPND+Y + L K+GG+L  ST   + F +  G G NV N  PT C+N  +    K   +     R + ++A      E+L
Subjt:  EAIKDICDKKGLPYIDLKIKWPNDLYVNGL-KLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAAL----KNLSSTPDKFRREDILASFFNKFERL

Query:  HDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSK
         D F +QG   +  LYY+ W+H GQ+V +   +  Q      +I GL  SG+L    +D  +  +HPDGNS D  + L+  K
Subjt:  HDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSK

Q9SL92 Biotin--protein ligase 1, chloroplastic1.1e-14167.04Show/hide
Query:  FSIVNSKVVASAR-----SLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGR
        F ++N  V+ S +     S S  A AME+ ++C+LVL GK++ E E AK LK  N+LKLPD+T +S+ L SEA   +  ++N F + L+MNS+ T  FGR
Subjt:  FSIVNSKVVASAR-----SLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGR

Query:  FLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPN
        FLIW PR+ ST DV+SHNFS+LP+GSVCV D+QFKGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYVVSLA+TEA+KD+CDKKGLPYID+KIKWPN
Subjt:  FLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPN

Query:  DLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVI
        DLYVNGLK+GGILCTSTYRSKKFNVS G+GLNVDN +PTTCLNA LK ++   +  +RE+IL +FF+KFE+  D+F++QGF++LEELYY+TWLHS QRVI
Subjt:  DLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVI

Query:  VQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        V++K EDQVV+NVVTIQGLTSSGYLLA+GDDNQM ELHPDGNS DFFKGLV+ K+
Subjt:  VQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL

Arabidopsis top hitse value%identityAlignment
AT1G37150.2 holocarboxylase synthetase 22.5e-13667.48Show/hide
Query:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGS
        M+  ++C+LVL GK++ E +TA  LK NN LKLPD++ +S+FL SE    +  +++ F + L+MNS+ST  FGRFLIW P + ST DV+SHNFS++P+GS
Subjt:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGS

Query:  VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVS
        VCV+D+Q KGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYVVSLA+TEA+KD+CDKKGL Y D+KIKWPNDLY+NGLK+GGILCTSTYRS+KF VS
Subjt:  VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVS

Query:  AGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLL
         G+GLNVDN++PTTCLNA LK++    +  +RE+IL +FF KFE   D+F+ QGF++LEELYY+TWLHSGQRVI +EK EDQVV+NVVTIQGLTSSGYLL
Subjt:  AGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLL

Query:  AIGDDNQMCELHPDGNSLDFFKGLVKSKL
        AIGDDN M ELHPDGNS DFFKGLV+ KL
Subjt:  AIGDDNQMCELHPDGNSLDFFKGLVKSKL

AT1G37150.3 holocarboxylase synthetase 26.3e-11664.95Show/hide
Query:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGS
        M+  ++C+LVL GK++ E +TA  LK NN LKLPD++ +S+FL SE    +  +++ F + L+MNS+ST  FGRFLIW P + ST DV+SHNFS++P+GS
Subjt:  METSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGS

Query:  VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVS
        VCV+D+Q KGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYVVSLA+TEA+KD+CDKKGL Y D+KIKWPNDLY+NGLK+GGILCTSTYRS+KF VS
Subjt:  VCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVS

Query:  AGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQ
         G+GLNVDN++PTTCLNA LK++    +  +RE+IL +FF KFE   D+F+ QGF++LEELYY+TWLHSGQRVI +EK EDQVV+NVVTIQ
Subjt:  AGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQ

AT1G37150.4 holocarboxylase synthetase 23.4e-11473.6Show/hide
Query:  PRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVN
        P + ST DV+SHNFS++P+GSVCV+D+Q KGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYVVSLA+TEA+KD+CDKKGL Y D+KIKWPNDLY+N
Subjt:  PRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVN

Query:  GLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKK
        GLK+GGILCTSTYRS+KF VS G+GLNVDN++PTTCLNA LK++    +  +RE+IL +FF KFE   D+F+ QGF++LEELYY+TWLHSGQRVI +EK 
Subjt:  GLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKK

Query:  EDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        EDQVV+NVVTIQGLTSSGYLLAIGDDN M ELHPDGNS DFFKGLV+ KL
Subjt:  EDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL

AT2G25710.1 holocarboxylase synthase 17.9e-14367.04Show/hide
Query:  FSIVNSKVVASAR-----SLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGR
        F ++N  V+ S +     S S  A AME+ ++C+LVL GK++ E E AK LK  N+LKLPD+T +S+ L SEA   +  ++N F + L+MNS+ T  FGR
Subjt:  FSIVNSKVVASAR-----SLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGR

Query:  FLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPN
        FLIW PR+ ST DV+SHNFS+LP+GSVCV D+QFKGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYVVSLA+TEA+KD+CDKKGLPYID+KIKWPN
Subjt:  FLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPN

Query:  DLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVI
        DLYVNGLK+GGILCTSTYRSKKFNVS G+GLNVDN +PTTCLNA LK ++   +  +RE+IL +FF+KFE+  D+F++QGF++LEELYY+TWLHS QRVI
Subjt:  DLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVI

Query:  VQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        V++K EDQVV+NVVTIQGLTSSGYLLA+GDDNQM ELHPDGNS DFFKGLV+ K+
Subjt:  VQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL

AT2G25710.2 holocarboxylase synthase 17.9e-14367.04Show/hide
Query:  FSIVNSKVVASAR-----SLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGR
        F ++N  V+ S +     S S  A AME+ ++C+LVL GK++ E E AK LK  N+LKLPD+T +S+ L SEA   +  ++N F + L+MNS+ T  FGR
Subjt:  FSIVNSKVVASAR-----SLSVPAPAMETSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHL--EENGFQIDLYMNSLSTDAFGR

Query:  FLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPN
        FLIW PR+ ST DV+SHNFS+LP+GSVCV D+QFKGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYVVSLA+TEA+KD+CDKKGLPYID+KIKWPN
Subjt:  FLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPN

Query:  DLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVI
        DLYVNGLK+GGILCTSTYRSKKFNVS G+GLNVDN +PTTCLNA LK ++   +  +RE+IL +FF+KFE+  D+F++QGF++LEELYY+TWLHS QRVI
Subjt:  DLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTTCLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVI

Query:  VQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL
        V++K EDQVV+NVVTIQGLTSSGYLLA+GDDNQM ELHPDGNS DFFKGLV+ K+
Subjt:  VQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLVKSKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGTGCATCAAAAACCTGAAGTTGAGTACAAAAAAGGAAATGGGCTAAAGCTAAGCCAATACAATTACAATAGAAGAGATGACACCAGTGGACAGAAC
TTGAAAGCAAATAAAGACCAAGTTTTTGTTGGTCTTGTATTGTGGAGGCACGCCGCTTCTGGAGTTCTCGTCGCTTTCAACCCACACGGTGAAAGAAGATTTGCC
CATGTACCAGAGGCTTGGCGAGACTCGCGCTACTTCTTCTCCATCGTAAACTCCAAAGTTGTTGCTTCAGCGCGCTCTCTTTCAGTGCCCGCCCCAGCAATGGAG
ACCAGTTCAACCTGTGCGTTAGTTTTGAGTGGAAAAACAGCGGCTGAGAACGAAACTGCCAAATTGCTGAAGAGGAATAACACTCTGAAGCTCCCTGACGATACT
GGACTTTCGGTTTTTCTACACTCAGAAGCGGATAAGCATTTGGAGGAAAATGGTTTTCAGATTGATTTGTATATGAATTCTCTTTCTACTGATGCTTTTGGTAGA
TTCCTCATTTGGTGTCCGCGGATTCCTTCAACTCAAGACGTCATTTCTCACAACTTCAGCGACCTTCCATTGGGTTCTGTCTGTGTGGCTGATGTTCAGTTCAAG
GGAAGAGGTCGATCGAAGAATTTGTGGGAATCTCCCCCTGGTTGCCTTATGTTTTCCTTTACCATTCAAATGGAAGATGGGCGTATTGTTCCTCTACTACAGTAT
GTTGTATCTCTTGCTATTACCGAGGCCATAAAAGATATTTGCGACAAAAAGGGACTACCCTATATTGATTTGAAAATAAAGTGGCCAAATGATCTTTATGTGAAT
GGCCTGAAACTTGGAGGCATTCTGTGCACTTCAACATATAGATCAAAGAAGTTCAACGTTAGTGCTGGTATAGGCTTGAATGTGGACAATGATAAACCAACGACA
TGCTTGAATGCAGCTCTTAAAAATTTGTCCAGTACACCTGACAAGTTCAGGAGGGAGGATATCTTAGCATCCTTTTTTAACAAATTTGAAAGATTGCACGATATT
TTCATAAATCAAGGGTTTCAGGCTCTTGAGGAACTTTACTATCAGACATGGTTGCACAGTGGGCAAAGAGTTATTGTACAAGAAAAGAAAGAAGACCAAGTAGTG
GAAAATGTAGTCACTATTCAGGGTTTGACATCTTCAGGATATTTGCTAGCTATTGGAGATGACAACCAAATGTGCGAGCTCCATCCCGATGGAAATAGTTTGGAC
TTTTTCAAAGGACTGGTCAAGAGCAAACTGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGGTGCATCAAAAACCTGAAGTTGAGTACAAAAAAGGAAATGGGCTAAAGCTAAGCCAATACAATTACAATAGAAGAGATGACACCAGTGGACAGAAC
TTGAAAGCAAATAAAGACCAAGTTTTTGTTGGTCTTGTATTGTGGAGGCACGCCGCTTCTGGAGTTCTCGTCGCTTTCAACCCACACGGTGAAAGAAGATTTGCC
CATGTACCAGAGGCTTGGCGAGACTCGCGCTACTTCTTCTCCATCGTAAACTCCAAAGTTGTTGCTTCAGCGCGCTCTCTTTCAGTGCCCGCCCCAGCAATGGAG
ACCAGTTCAACCTGTGCGTTAGTTTTGAGTGGAAAAACAGCGGCTGAGAACGAAACTGCCAAATTGCTGAAGAGGAATAACACTCTGAAGCTCCCTGACGATACT
GGACTTTCGGTTTTTCTACACTCAGAAGCGGATAAGCATTTGGAGGAAAATGGTTTTCAGATTGATTTGTATATGAATTCTCTTTCTACTGATGCTTTTGGTAGA
TTCCTCATTTGGTGTCCGCGGATTCCTTCAACTCAAGACGTCATTTCTCACAACTTCAGCGACCTTCCATTGGGTTCTGTCTGTGTGGCTGATGTTCAGTTCAAG
GGAAGAGGTCGATCGAAGAATTTGTGGGAATCTCCCCCTGGTTGCCTTATGTTTTCCTTTACCATTCAAATGGAAGATGGGCGTATTGTTCCTCTACTACAGTAT
GTTGTATCTCTTGCTATTACCGAGGCCATAAAAGATATTTGCGACAAAAAGGGACTACCCTATATTGATTTGAAAATAAAGTGGCCAAATGATCTTTATGTGAAT
GGCCTGAAACTTGGAGGCATTCTGTGCACTTCAACATATAGATCAAAGAAGTTCAACGTTAGTGCTGGTATAGGCTTGAATGTGGACAATGATAAACCAACGACA
TGCTTGAATGCAGCTCTTAAAAATTTGTCCAGTACACCTGACAAGTTCAGGAGGGAGGATATCTTAGCATCCTTTTTTAACAAATTTGAAAGATTGCACGATATT
TTCATAAATCAAGGGTTTCAGGCTCTTGAGGAACTTTACTATCAGACATGGTTGCACAGTGGGCAAAGAGTTATTGTACAAGAAAAGAAAGAAGACCAAGTAGTG
GAAAATGTAGTCACTATTCAGGGTTTGACATCTTCAGGATATTTGCTAGCTATTGGAGATGACAACCAAATGTGCGAGCTCCATCCCGATGGAAATAGTTTGGAC
TTTTTCAAAGGACTGGTCAAGAGCAAACTGGAGTGA
Protein sequenceShow/hide protein sequence
MKMVHQKPEVEYKKGNGLKLSQYNYNRRDDTSGQNLKANKDQVFVGLVLWRHAASGVLVAFNPHGERRFAHVPEAWRDSRYFFSIVNSKVVASARSLSVPAPAME
TSSTCALVLSGKTAAENETAKLLKRNNTLKLPDDTGLSVFLHSEADKHLEENGFQIDLYMNSLSTDAFGRFLIWCPRIPSTQDVISHNFSDLPLGSVCVADVQFK
GRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVVSLAITEAIKDICDKKGLPYIDLKIKWPNDLYVNGLKLGGILCTSTYRSKKFNVSAGIGLNVDNDKPTT
CLNAALKNLSSTPDKFRREDILASFFNKFERLHDIFINQGFQALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLD
FFKGLVKSKLE