; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G218330 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G218330
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionbiotin--protein ligase 2-like
Genome locationCicolChr11:19377663..19394496
RNA-Seq ExpressionCcUC11G218330
SyntenyCcUC11G218330
Gene Ontology termsGO:0006464 - cellular protein modification process (biological process)
GO:0004077 - biotin-[acetyl-CoA-carboxylase] ligase activity (molecular function)
InterPro domainsIPR003142 - Biotin protein ligase, C-terminal
IPR004143 - Biotinyl protein ligase (BPL) and lipoyl protein ligase (LPL), catalytic domain
IPR004408 - Biotin--acetyl-CoA-carboxylase ligase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150602.2 biotin--protein ligase 2 [Cucumis sativus]1.2e-18693.39Show/hide
Query:  ETPINERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRI
        E PINER R+RLFSLS ASGMD SP C+LVLSGKTA ENETAKLLKRNDTLKLPDDTEISVLLHSE+DKPLEEN FRIDLYLNALSTDTFGRFLIWSPR+
Subjt:  ETPINERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRI

Query:  PSTQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLK
        PSTQDVISHNFSNLPLGAVC ADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVNDLK
Subjt:  PSTQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLK

Query:  VGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQ
        VGG+LCTSTYR KKFNVTAGIGLNVDND+PSTCLNEAL+NLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRV+VQEKKEDQ
Subjt:  VGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQ

Query:  VVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
        VVENVVTIQGLTSSGYLLAIGDD QMCELHPDGNSLDFFKGLIKSKLV
Subjt:  VVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV

XP_008447538.1 PREDICTED: biotin--protein ligase 2-like isoform X3 [Cucumis melo]1.9e-18491.78Show/hide
Query:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST
        INERER+RLFSLSSASGMD SP C+LVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSE+DKPLEEN FRIDLYLNALSTDTFGRFLIWSPR+PST
Subjt:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST

Query:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG
        QDVISHNFSNLPLGAVC ADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGR VPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVNDLKVGG
Subjt:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG

Query:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE
        +LCTSTYR KKFNVTAGIGLNV+ND+PSTCLNEAL++LSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRV+VQEKKEDQVVE
Subjt:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE

Query:  NVVT--------IQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
        NVVT        IQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
Subjt:  NVVT--------IQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV

XP_008447539.1 PREDICTED: biotin--protein ligase 2-like isoform X4 [Cucumis melo]1.2e-18693.91Show/hide
Query:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST
        INERER+RLFSLSSASGMD SP C+LVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSE+DKPLEEN FRIDLYLNALSTDTFGRFLIWSPR+PST
Subjt:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST

Query:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG
        QDVISHNFSNLPLGAVC ADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGR VPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVNDLKVGG
Subjt:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG

Query:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE
        +LCTSTYR KKFNVTAGIGLNV+ND+PSTCLNEAL++LSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRV+VQEKKEDQVVE
Subjt:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE

Query:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
        NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
Subjt:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV

XP_016900373.1 PREDICTED: biotin--protein ligase 2-like isoform X2 [Cucumis melo]1.4e-17991.76Show/hide
Query:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST
        INERER+RLFSLSSASGMD SP C+LVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSE+DKPLEEN FRIDLYLNALSTDTFGRFLIWSPR+PST
Subjt:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST

Query:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG
        QDVISHNFSNLPLGAVC ADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGR VPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVNDLKVGG
Subjt:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG

Query:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE
        +LCTSTYR KKFNVTAGIGLNV+ND+PSTCLNEAL++LSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRV+VQEKKEDQVVE
Subjt:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE

Query:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLI
        NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNS      L+
Subjt:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLI

XP_038896816.1 biotin--protein ligase 2-like isoform X1 [Benincasa hispida]1.2e-18191.4Show/hide
Query:  IETPINERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPR
        +ET IN+RER+RLFSL SASGMD +P CALVLSGKT AENETAKLLKR+DTLKLPDDT+ISVLLHSERDKPLEEN F+IDLYLNALSTDTFGRFLIWSPR
Subjt:  IETPINERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPR

Query:  IPSTQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDL
        IPSTQDVISHNFSNLPLGAVC ADVQFKGRGRSKN+WESP GCLMFSFTIQMEDG IVPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVND 
Subjt:  IPSTQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDL

Query:  KVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKED
        KVGGILCTSTYRSKKFNVTAGIGLNVDND+PST LN AL+NLSSTPYKFRKEDILAFFFNKFERLYDVFINQGF+ALEELYYQTWLHSGQRVIV+EKKED
Subjt:  KVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKED

Query:  QVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
        QVVENVVTIQGLTSSGYLLA+GDDNQMCELHPDGNSLDFFKGLIKSKLV
Subjt:  QVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV

TrEMBL top hitse value%identityAlignment
A0A1S3BH31 biotin--protein ligase 2-like isoform X45.8e-18793.91Show/hide
Query:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST
        INERER+RLFSLSSASGMD SP C+LVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSE+DKPLEEN FRIDLYLNALSTDTFGRFLIWSPR+PST
Subjt:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST

Query:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG
        QDVISHNFSNLPLGAVC ADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGR VPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVNDLKVGG
Subjt:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG

Query:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE
        +LCTSTYR KKFNVTAGIGLNV+ND+PSTCLNEAL++LSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRV+VQEKKEDQVVE
Subjt:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE

Query:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
        NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
Subjt:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV

A0A1S3BIK3 biotin--protein ligase 2-like isoform X39.2e-18591.78Show/hide
Query:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST
        INERER+RLFSLSSASGMD SP C+LVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSE+DKPLEEN FRIDLYLNALSTDTFGRFLIWSPR+PST
Subjt:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST

Query:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG
        QDVISHNFSNLPLGAVC ADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGR VPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVNDLKVGG
Subjt:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG

Query:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE
        +LCTSTYR KKFNVTAGIGLNV+ND+PSTCLNEAL++LSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRV+VQEKKEDQVVE
Subjt:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE

Query:  NVVT--------IQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
        NVVT        IQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV
Subjt:  NVVT--------IQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKLV

A0A1S4DWL1 biotin--protein ligase 2-like isoform X65.8e-17993.66Show/hide
Query:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST
        INERER+RLFSLSSASGMD SP C+LVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSE+DKPLEEN FRIDLYLNALSTDTFGRFLIWSPR+PST
Subjt:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST

Query:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG
        QDVISHNFSNLPLGAVC ADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGR VPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVNDLKVGG
Subjt:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG

Query:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE
        +LCTSTYR KKFNVTAGIGLNV+ND+PSTCLNEAL++LSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRV+VQEKKEDQVVE
Subjt:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE

Query:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGN
        NVVTIQGLTSSGYLLAIGDDNQMCELHPDGN
Subjt:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGN

A0A1S4DWM1 biotin--protein ligase 2-like isoform X26.8e-18091.76Show/hide
Query:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST
        INERER+RLFSLSSASGMD SP C+LVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSE+DKPLEEN FRIDLYLNALSTDTFGRFLIWSPR+PST
Subjt:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST

Query:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG
        QDVISHNFSNLPLGAVC ADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGR VPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVNDLKVGG
Subjt:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG

Query:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE
        +LCTSTYR KKFNVTAGIGLNV+ND+PSTCLNEAL++LSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRV+VQEKKEDQVVE
Subjt:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE

Query:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLI
        NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNS      L+
Subjt:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLI

A0A5A7U4Q8 Biotin--protein ligase 2-like isoform X22.0e-17993.67Show/hide
Query:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST
        INERER+RLFSLSSASGMD SP C+LVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSE+DKPLEEN FRIDLYLNALSTDTFGRFLIWSPR+PST
Subjt:  INERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSPRIPST

Query:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG
        QDVISHNFSNLPLGAVC ADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGR VPLLQYVISLA+TEAIKDI +++GLPYIDLKIKWPNDLYVNDLKVGG
Subjt:  QDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGG

Query:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE
        +LCTSTYR KKFNVTAGIGLNV+ND+PSTCLNEAL++LSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRV+VQEKKEDQVVE
Subjt:  ILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVE

Query:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNS
        NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNS
Subjt:  NVVTIQGLTSSGYLLAIGDDNQMCELHPDGNS

SwissProt top hitse value%identityAlignment
F4I4W2 Biotin--protein ligase 21.7e-12763.22Show/hide
Query:  MDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLPLGA
        MDI   C+LVL GK++ E +TA  LK N+ LKLPD++++S+ L SE    +  +++ F + L++N++ST  FGRFLIWSP + ST DV+SHNFS +P+G+
Subjt:  MDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLPLGA

Query:  VCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGGILCTSTYRSKKFNVT
        VC +D+Q KGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYV+SLA+TEA+KD+ ++KGL Y D+KIKWPNDLY+N LK+GGILCTSTYRS+KF V+
Subjt:  VCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGGILCTSTYRSKKFNVT

Query:  AGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLL
         G+GLNVDN++P+TCLN  L ++       ++E+IL  FF KFE  +D+F+ QGF++LEELYY+TWLHSGQRVI +EK EDQVV+NVVTIQGLTSSGYLL
Subjt:  AGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLL

Query:  AIGDDNQMCELHPDGNSLDFFKGLIKSKL
        AIGDDN M ELHPDGNS DFFKGL++ KL
Subjt:  AIGDDNQMCELHPDGNSLDFFKGLIKSKL

O14353 Biotin--protein ligase1.3e-3432.41Show/hide
Query:  DTEISVLLHSERDKPLEENE--FRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLP---LGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQ
        D EI  + + + +K  ++ +  F ++LY   ++   FG  +I +P I STQ ++  N+  L     G     + Q  GRGR +N+W SP G L FSF I 
Subjt:  DTEISVLLHSERDKPLEENE--FRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLP---LGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQ

Query:  MEDGRI----VPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYV------------NDLKVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCL
        ++        + L QY+++LA+   I++ +   G   I   IKWPND+YV              +K+ GI+ TS YR    ++  G G+NV N  P+  L
Subjt:  MEDGRI----VPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYV------------NDLKVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCL

Query:  N---EALSNLSSTP--YKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSS-GYLLA--IGDDNQ-
        N   +  +  S  P   KF  E +LA   N+F+R + + + +GF  +   YYQ WLHS Q V +    +         IQG+TS  G+LLA  + ++N+ 
Subjt:  N---EALSNLSSTP--YKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSS-GYLLA--IGDDNQ-

Query:  ---MCELHPDGNSLDFFKGLIKSK
           +  L PDGNS D  + LI  K
Subjt:  ---MCELHPDGNSLDFFKGLIKSK

P50747 Biotin--protein ligase7.3e-3830.85Show/hide
Query:  FRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLP--LGAVCAADVQFKGRGRSKNLWESPPGC----LMFSFTIQMEDGRIVPLLQYVISLAMT
        F +++Y   L T   G+ ++++   P+T  ++       P  +G +  A  Q +G+GR  N+W SP GC    L+ S  ++ + G+ +P +Q+++S+A+ 
Subjt:  FRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLP--LGAVCAADVQFKGRGRSKNLWESPPGC----LMFSFTIQMEDGRIVPLLQYVISLAMT

Query:  EAIKDISNEKGLPYIDLKIKWPNDLYVNDL-KVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLS----STPYKFRKEDILAFFFNKFERL
        EA++ I   +    I+L++KWPND+Y +DL K+GG+L  ST   + F +  G G NV N  P+ C+N+ ++  +    +     R + ++A      E+L
Subjt:  EAIKDISNEKGLPYIDLKIKWPNDLYVNDL-KVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLS----STPYKFRKEDILAFFFNKFERL

Query:  YDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSK
           F ++G  ++  LYY+ W+HSGQ+V +   +  +     V+I GL  SG+L    +  ++  +HPDGNS D  + LI  K
Subjt:  YDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSK

Q920N2 Biotin--protein ligase4.7e-3731.56Show/hide
Query:  FRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLP--LGAVCAADVQFKGRGRSKNLWESPPGC----LMFSFTIQMEDGRIVPLLQYVISLAMT
        F ++ Y   L T   G+ ++++    +T  ++      +P  +G +  A  Q +G+GR  N W SP GC    L+    ++ + G+ +P +Q+++SLA+ 
Subjt:  FRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLP--LGAVCAADVQFKGRGRSKNLWESPPGC----LMFSFTIQMEDGRIVPLLQYVISLAMT

Query:  EAIKDISNEKGLPYIDLKIKWPNDLYVNDL-KVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTP----YKFRKEDILAFFFNKFERL
        EA++ I    G   I+L++KWPND+Y +DL K+GG+L  ST   + F +  G G NV N  P+ C+N+ +   +          R + ++A      E+L
Subjt:  EAIKDISNEKGLPYIDLKIKWPNDLYVNDL-KVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTP----YKFRKEDILAFFFNKFERL

Query:  YDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSK
         D F +QG   +  LYY+ W+H GQ+V +   +  Q      +I GL  SG+L    +D  +  +HPDGNS D  + LI  K
Subjt:  YDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSK

Q9SL92 Biotin--protein ligase 1, chloroplastic3.5e-13364.16Show/hide
Query:  RERKRLFSLS---SASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIP
        R  K L  LS   SAS M+    C+LVL GK++ E E AK LK  ++LKLPD+T++S++L SE    +  ++N F + L++N++ T  FGRFLIWSPR+ 
Subjt:  RERKRLFSLS---SASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIP

Query:  STQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKV
        ST DV+SHNFS LP+G+VC  D+QFKGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYV+SLA+TEA+KD+ ++KGLPYID+KIKWPNDLYVN LKV
Subjt:  STQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKV

Query:  GGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQV
        GGILCTSTYRSKKFNV+ G+GLNVDN +P+TCLN  L  ++      ++E+IL  FF+KFE+ +D+F++QGF++LEELYY+TWLHS QRVIV++K EDQV
Subjt:  GGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQV

Query:  VENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKL
        V+NVVTIQGLTSSGYLLA+GDDNQM ELHPDGNS DFFKGL++ K+
Subjt:  VENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKL

Arabidopsis top hitse value%identityAlignment
AT1G37150.2 holocarboxylase synthetase 21.2e-12863.22Show/hide
Query:  MDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLPLGA
        MDI   C+LVL GK++ E +TA  LK N+ LKLPD++++S+ L SE    +  +++ F + L++N++ST  FGRFLIWSP + ST DV+SHNFS +P+G+
Subjt:  MDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLPLGA

Query:  VCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGGILCTSTYRSKKFNVT
        VC +D+Q KGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYV+SLA+TEA+KD+ ++KGL Y D+KIKWPNDLY+N LK+GGILCTSTYRS+KF V+
Subjt:  VCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGGILCTSTYRSKKFNVT

Query:  AGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLL
         G+GLNVDN++P+TCLN  L ++       ++E+IL  FF KFE  +D+F+ QGF++LEELYY+TWLHSGQRVI +EK EDQVV+NVVTIQGLTSSGYLL
Subjt:  AGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLL

Query:  AIGDDNQMCELHPDGNSLDFFKGLIKSKL
        AIGDDN M ELHPDGNS DFFKGL++ KL
Subjt:  AIGDDNQMCELHPDGNSLDFFKGLIKSKL

AT1G37150.3 holocarboxylase synthetase 22.4e-10860.48Show/hide
Query:  MDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLPLGA
        MDI   C+LVL GK++ E +TA  LK N+ LKLPD++++S+ L SE    +  +++ F + L++N++ST  FGRFLIWSP + ST DV+SHNFS +P+G+
Subjt:  MDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIPSTQDVISHNFSNLPLGA

Query:  VCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGGILCTSTYRSKKFNVT
        VC +D+Q KGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYV+SLA+TEA+KD+ ++KGL Y D+KIKWPNDLY+N LK+GGILCTSTYRS+KF V+
Subjt:  VCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGGILCTSTYRSKKFNVT

Query:  AGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQ
         G+GLNVDN++P+TCLN  L ++       ++E+IL  FF KFE  +D+F+ QGF++LEELYY+TWLHSGQRVI +EK EDQVV+NVVTIQ
Subjt:  AGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQ

AT1G37150.4 holocarboxylase synthetase 21.5e-10768.53Show/hide
Query:  SPRIPSTQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYV
        SP + ST DV+SHNFS +P+G+VC +D+Q KGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYV+SLA+TEA+KD+ ++KGL Y D+KIKWPNDLY+
Subjt:  SPRIPSTQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYV

Query:  NDLKVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEK
        N LK+GGILCTSTYRS+KF V+ G+GLNVDN++P+TCLN  L ++       ++E+IL  FF KFE  +D+F+ QGF++LEELYY+TWLHSGQRVI +EK
Subjt:  NDLKVGGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEK

Query:  KEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKL
         EDQVV+NVVTIQGLTSSGYLLAIGDDN M ELHPDGNS DFFKGL++ KL
Subjt:  KEDQVVENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKL

AT2G25710.1 holocarboxylase synthase 12.5e-13464.16Show/hide
Query:  RERKRLFSLS---SASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIP
        R  K L  LS   SAS M+    C+LVL GK++ E E AK LK  ++LKLPD+T++S++L SE    +  ++N F + L++N++ T  FGRFLIWSPR+ 
Subjt:  RERKRLFSLS---SASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIP

Query:  STQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKV
        ST DV+SHNFS LP+G+VC  D+QFKGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYV+SLA+TEA+KD+ ++KGLPYID+KIKWPNDLYVN LKV
Subjt:  STQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKV

Query:  GGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQV
        GGILCTSTYRSKKFNV+ G+GLNVDN +P+TCLN  L  ++      ++E+IL  FF+KFE+ +D+F++QGF++LEELYY+TWLHS QRVIV++K EDQV
Subjt:  GGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQV

Query:  VENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKL
        V+NVVTIQGLTSSGYLLA+GDDNQM ELHPDGNS DFFKGL++ K+
Subjt:  VENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKL

AT2G25710.2 holocarboxylase synthase 12.5e-13464.16Show/hide
Query:  RERKRLFSLS---SASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIP
        R  K L  LS   SAS M+    C+LVL GK++ E E AK LK  ++LKLPD+T++S++L SE    +  ++N F + L++N++ T  FGRFLIWSPR+ 
Subjt:  RERKRLFSLS---SASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPL--EENEFRIDLYLNALSTDTFGRFLIWSPRIP

Query:  STQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKV
        ST DV+SHNFS LP+G+VC  D+QFKGRGR+KN+WESP GCLM+SFT++MEDGR+VPL+QYV+SLA+TEA+KD+ ++KGLPYID+KIKWPNDLYVN LKV
Subjt:  STQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKV

Query:  GGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQV
        GGILCTSTYRSKKFNV+ G+GLNVDN +P+TCLN  L  ++      ++E+IL  FF+KFE+ +D+F++QGF++LEELYY+TWLHS QRVIV++K EDQV
Subjt:  GGILCTSTYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQV

Query:  VENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKL
        V+NVVTIQGLTSSGYLLA+GDDNQM ELHPDGNS DFFKGL++ K+
Subjt:  VENVVTIQGLTSSGYLLAIGDDNQMCELHPDGNSLDFFKGLIKSKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGTTGCTTACAGCTCTCGACAATCTCGCTCTCGATGGCTCTCTCTCGTCGACTCTCTGGATCTCGATAGCTCTGGCGATCTCGCTCTCGATCTTGCTCTCGAGCT
TATTGTTCTCGGTTCTCCACTCTCGATCTCAATCTTTATCTATATTCGTTTGGACGTCCGACGGTACCTGTACACAGCTGAACCAGCGAGGGGGGCGCGACCCCGACACG
AACGGCGGCTCCGACCAACGTTCTCTGACGACAGCCACGAACAGCTCCCGGCGACATCTGCAGCGACCTTCATCTCCTTCGGCACCGAAGCAACCTGCAACGTTGTTGCG
GCCACACAGCTTTGGCGAACCAGCGGCAACAACATAGAAACTCCCATTAACGAGAGGGAGAGAAAGAGATTGTTCTCTCTATCATCTGCTTCAGGAATGGACATTAGTCC
AATCTGTGCATTGGTTTTGAGTGGGAAAACAGCAGCTGAGAACGAAACTGCCAAATTGCTGAAGAGGAATGACACTTTGAAGCTCCCTGATGATACTGAAATTTCAGTTT
TATTGCACTCAGAAAGGGATAAACCTTTGGAGGAAAATGAGTTTCGAATTGATTTGTATCTGAATGCTCTATCGACTGATACTTTTGGTAGATTTCTGATTTGGTCTCCG
CGAATTCCTTCTACCCAGGATGTTATTTCTCACAACTTTAGCAACCTTCCACTGGGTGCTGTTTGTGCGGCTGATGTTCAGTTCAAGGGAAGAGGTCGATCGAAGAATTT
GTGGGAATCTCCCCCTGGTTGCCTTATGTTTTCCTTTACCATTCAAATGGAAGATGGGCGTATTGTTCCTCTATTACAGTATGTAATATCTCTCGCTATGACTGAGGCCA
TAAAAGATATTTCCAACGAAAAGGGGTTACCCTATATTGATTTGAAAATAAAGTGGCCAAATGATCTTTATGTGAATGACCTGAAAGTTGGAGGCATTCTATGTACTTCA
ACATATAGATCGAAGAAGTTCAACGTTACTGCTGGTATAGGTTTGAATGTAGACAACGACGAACCATCAACATGCTTGAATGAAGCTCTTTCAAATTTATCCAGTACACC
TTACAAGTTCAGAAAAGAGGATATTTTAGCATTCTTCTTTAACAAATTTGAAAGGTTGTATGATGTTTTCATAAATCAAGGGTTTCGGGCTCTTGAGGAACTCTACTATC
AGACATGGCTGCACAGTGGACAAAGAGTTATTGTACAGGAAAAGAAAGAAGATCAAGTTGTGGAAAATGTAGTCACTATTCAGGGGTTGACATCTTCAGGATATTTGCTA
GCTATTGGAGATGACAACCAAATGTGCGAACTTCATCCCGATGGAAATAGTTTGGACTTTTTCAAAGGACTAATCAAGAGCAAACTGGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGTTGCTTACAGCTCTCGACAATCTCGCTCTCGATGGCTCTCTCTCGTCGACTCTCTGGATCTCGATAGCTCTGGCGATCTCGCTCTCGATCTTGCTCTCGAGCT
TATTGTTCTCGGTTCTCCACTCTCGATCTCAATCTTTATCTATATTCGTTTGGACGTCCGACGGTACCTGTACACAGCTGAACCAGCGAGGGGGGCGCGACCCCGACACG
AACGGCGGCTCCGACCAACGTTCTCTGACGACAGCCACGAACAGCTCCCGGCGACATCTGCAGCGACCTTCATCTCCTTCGGCACCGAAGCAACCTGCAACGTTGTTGCG
GCCACACAGCTTTGGCGAACCAGCGGCAACAACATAGAAACTCCCATTAACGAGAGGGAGAGAAAGAGATTGTTCTCTCTATCATCTGCTTCAGGAATGGACATTAGTCC
AATCTGTGCATTGGTTTTGAGTGGGAAAACAGCAGCTGAGAACGAAACTGCCAAATTGCTGAAGAGGAATGACACTTTGAAGCTCCCTGATGATACTGAAATTTCAGTTT
TATTGCACTCAGAAAGGGATAAACCTTTGGAGGAAAATGAGTTTCGAATTGATTTGTATCTGAATGCTCTATCGACTGATACTTTTGGTAGATTTCTGATTTGGTCTCCG
CGAATTCCTTCTACCCAGGATGTTATTTCTCACAACTTTAGCAACCTTCCACTGGGTGCTGTTTGTGCGGCTGATGTTCAGTTCAAGGGAAGAGGTCGATCGAAGAATTT
GTGGGAATCTCCCCCTGGTTGCCTTATGTTTTCCTTTACCATTCAAATGGAAGATGGGCGTATTGTTCCTCTATTACAGTATGTAATATCTCTCGCTATGACTGAGGCCA
TAAAAGATATTTCCAACGAAAAGGGGTTACCCTATATTGATTTGAAAATAAAGTGGCCAAATGATCTTTATGTGAATGACCTGAAAGTTGGAGGCATTCTATGTACTTCA
ACATATAGATCGAAGAAGTTCAACGTTACTGCTGGTATAGGTTTGAATGTAGACAACGACGAACCATCAACATGCTTGAATGAAGCTCTTTCAAATTTATCCAGTACACC
TTACAAGTTCAGAAAAGAGGATATTTTAGCATTCTTCTTTAACAAATTTGAAAGGTTGTATGATGTTTTCATAAATCAAGGGTTTCGGGCTCTTGAGGAACTCTACTATC
AGACATGGCTGCACAGTGGACAAAGAGTTATTGTACAGGAAAAGAAAGAAGATCAAGTTGTGGAAAATGTAGTCACTATTCAGGGGTTGACATCTTCAGGATATTTGCTA
GCTATTGGAGATGACAACCAAATGTGCGAACTTCATCCCGATGGAAATAGTTTGGACTTTTTCAAAGGACTAATCAAGAGCAAACTGGTGTAAGACTACTGCTTTTTCAC
TTTTCAACAAGTGCTGACTATGAGCTGTATGGAATGGACTCATGTGCAATAAGAAGGGGACATTGGCTTTCTTTTATTTTTAGGTAATAATTCTCTCTGTTTCGAAAGTG
GAGGGAAGAGAGAGAGATATCTTACTTTTTATGATTAAACAATGAGAACTCAACTGAAGTTGTAAATATTGTTATCTTTAAACTTACTAAGGAGGGTCCATGGCCCAGAA
CGTGCTATATTTTTGTATCATCCGAATGAACGTGAAGGAATTCCCAGTGGGACAATTTCCAATTGATTAAACTGTCCCCACAATTGGTATAATCAATCGGTCAGCAGTTG
TTTTTTGTTAGAAACTTGCGAAACTCTCTGCCACTTTATTGCCAGCTGTTTGAGCTTGAAACCGAAACTCCGTTATGACAGAAGGTGTTGTGTTGTAAATCGTAAAGATA
CGTTGTGATGGAGAAATGGTTTCAACTTATTTGAACTTTACAACTGTGAAATTGAGTGAAAAGAAACAGAAGGGCAAAAGAATCTTGTGGGGAGGCGCCATGTCAGACCT
AGAAACTGGTATGTCTTCTTCTGTCTATTTCCTCATATACGGCTATTGAGCGTTTTGTTTAGGGCTCACTATCAGAAGAGAGAAAATGGGTTGACTTACCACATGCAGAG
CAGAGGCTGGATTGGATAAACAAAAAAGGCCCCATTATAAGTTAGTGCCTAATAGTTTCATGTCTTTAACTTCGTCAATATTACCTATGAATTTATCATGTTCGTGATGT
GCTAACTATATTATTAAAGCTACATGGCAGTCTGATTCTGAACTAGTTAAGGTGTTTGGTAGTGTAAAAACTTTGGGATGGCCCATGTCTTAAAAGGGCAGGTTTGTCTG
GTTTTTGGGAGGCTGGGGGCCATGGAAGAGGAAGACCAGAGTGGACTGATAGTTATATGTGGAGTTGGGTAAACAAAAGCGGAGTCATAATGAGCTGCAGCTGTTGCAGC
CTTTTGTCTGTCTTCGGCTTACCTCACTCTCAATAATGCGGAAGATAAAAGATGTAGTACTATTAGTACATACTATTCTAGTTCATTATTTGAC
Protein sequenceShow/hide protein sequence
MSVAYSSRQSRSRWLSLVDSLDLDSSGDLALDLALELIVLGSPLSISIFIYIRLDVRRYLYTAEPARGARPRHERRLRPTFSDDSHEQLPATSAATFISFGTEATCNVVA
ATQLWRTSGNNIETPINERERKRLFSLSSASGMDISPICALVLSGKTAAENETAKLLKRNDTLKLPDDTEISVLLHSERDKPLEENEFRIDLYLNALSTDTFGRFLIWSP
RIPSTQDVISHNFSNLPLGAVCAADVQFKGRGRSKNLWESPPGCLMFSFTIQMEDGRIVPLLQYVISLAMTEAIKDISNEKGLPYIDLKIKWPNDLYVNDLKVGGILCTS
TYRSKKFNVTAGIGLNVDNDEPSTCLNEALSNLSSTPYKFRKEDILAFFFNKFERLYDVFINQGFRALEELYYQTWLHSGQRVIVQEKKEDQVVENVVTIQGLTSSGYLL
AIGDDNQMCELHPDGNSLDFFKGLIKSKLV