; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0014323 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0014323
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionEnzymatic polyprotein
Genome locationchr03:640442..644598
RNA-Seq ExpressionIVF0014323
SyntenyIVF0014323
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037024.1 polyprotein [Cucumis melo var. makuwa]0.094.96Show/hide
Query:  MGSLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRRMRFTGNLRSWWHNHLTD
        MGSLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDE                         FTGNLRSWWHNHLTD
Subjt:  MGSLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRRMRFTGNLRSWWHNHLTD

Query:  ANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKY
        ANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKY
Subjt:  ANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKY

Query:  VKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRKRK
        VKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRKRK
Subjt:  VKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRKRK

Query:  YHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAIS
        YHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAIS
Subjt:  YHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAIS

Query:  YFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVV
        YFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVV
Subjt:  YFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVV

Query:  AIQEQIISKENIETQN
        AIQEQIISKENIETQ+
Subjt:  AIQEQIISKENIETQN

KAA0050625.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.01e-25187.74Show/hide
Query:  FTGNLRSWWHNHLTDANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCR
        FTGNLRSWWHNHLTDANREAIKDAV EK EQGPNGDVV  QPNS+NTLVY  IKHF+GRTTLYSDQ +E LLGMKCPKMSDFKWYKD FMSRLYNL TCR
Subjt:  FTGNLRSWWHNHLTDANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCR

Query:  DVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSRSKKVFKR
        DVVWKHKYVEGLPKYV+EKFYSTMVTNSG TDIDWE IS  DINSTIQKVCLEI QQQKHATKIAKDSDYR+++ SFCKQY IDNTPSRKKS SKKVF+R
Subjt:  DVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSRSKKVFKR

Query:  SKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEE
        SKTSESHGYSNRKRKYHGK +NKRK FKENTTCFKCNKKGHYANRCPVV+KIN MEIDED+KQSL KV+KAEELSSEE+EFS EKEEDLLNVLLEESSEE
Subjt:  SKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEE

Query:  PSSSENETDSEEAISYFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEI
        PSSS+NETD EE IS  GCI+VLTSTQK LLDIIEEVDDEAIRK ILLRLREDLETPDQKFKDPMNFSFQ VMNLL KEHAAPVKV DL HEI+VLK E+
Subjt:  PSSSENETDSEEAISYFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEI

Query:  NENKQQISYLKNAVVAIQEQIISK
        NENKQQISYL+NAVVAIQEQIISK
Subjt:  NENKQQISYLKNAVVAIQEQIISK

KAA0056776.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.64e-25641.61Show/hide
Query:  LNTIKYDS------PKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF
        +N IK DS       KILP+  + IDMKNHY +PSP DLGWDDL  + R++DG ++ TWNID  SE Q+M  FQE+ LAAT+Y  +           + F
Subjt:  LNTIKYDS------PKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF

Query:  TGNLRSWWHNHLTDANREAIKDAV--LEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC
         GNLRSWWHN LT+ +R+ I  A   + KTE   +  +   +P+ VN L+Y   KHFIG T ++ +   E LLG+KC KMS +KWYKDTFM+RLY L TC
Subjt:  TGNLRSWWHNHLTDANREAIKDAV--LEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC

Query:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSR------
           +WK K+VEGLP Y+ +KFY TM  NS    IDW  ++ GDI+ST+Q +C+ +  + KH TK+ KDSDYR+++G+FCKQY +   P  +K +      
Subjt:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSR------

Query:  SKKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFF-KENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNV
        SKK F++SK  +      R+R+++ K ++K+ +  K +T CFKCN+KGHYANRCP+  KIN M IDE+ KQSL   I++++ ++ + E S E  ED +N+
Subjt:  SKKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFF-KENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNV

Query:  LLEESS---EEPSSSENETDSEEAISYFG-----C---INVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKE
        L EE S   EE  S  + +D E AI   G     C   INV+T  Q+ L D+IE++ DE  ++  LL+L++ LE   P +  ++P+ +S+Q ++N +  E
Subjt:  LLEESS---EEPSSSENETDSEEAISYFG-----C---INVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKE

Query:  HAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKE------------------------------------------------------
           P++V DL+HE++ LK+E+ ENKQ++ YL+ A  A Q    SKE                                                      
Subjt:  HAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKE------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------NIETQ------------------------------------------------------------------------NNFDETKTPTKARPI
                NIE +                                                                        + F E + PTKARPI
Subjt:  --------NIETQ------------------------------------------------------------------------NNFDETKTPTKARPI

Query:  QMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINS
        QM+KDLV  C  EI++L  KGLISPSKSPWSC+AFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRI  AK+FSKFD+KS FWQIQI+ 
Subjt:  QMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINS

Query:  KDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK--------
         D+YKTAFNVPFGQ++WNVM FGLKNAPSEF+KIMNDIFN++Q+FTIVYIDDV++FS T+D+HFKHL +F+N+IK+NGLVVS PK+KLFQTK        
Subjt:  KDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK--------

Query:  -----------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDAS
                                     +++GCVNYIGDFI++ R+ICLP YDRLKKNPKPWT+EH+RAVQSIKSLAK IPCLSLVD +AK+II+TDAS
Subjt:  -----------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDAS

Query:  EKEL--IQKQALATALRIIYRESTSQNISSQN
        +     I KQ L   + ++   S   N + +N
Subjt:  EKEL--IQKQALATALRIIYRESTSQNISSQN

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.13e-25841.91Show/hide
Query:  LNTIKYDS------PKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF
        +N IK DS       KILP+  + IDMKNHY +PSP DLGWDDL  + R++DG ++ TWNID  SE Q+M  FQE+ LAAT+Y  +           + F
Subjt:  LNTIKYDS------PKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF

Query:  TGNLRSWWHNHLTDANREAIKDAV--LEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC
         GNLRSWWHN LT+ +R+ I  A   + KTE   +  +   +P+ VN L+Y   KHFIG T ++ +   E LLG+KC KMS +KWYKDTFM+RLY L TC
Subjt:  TGNLRSWWHNHLTDANREAIKDAV--LEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC

Query:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSR------
           +WK K+VEGLP Y+ +KFY TM  NS    IDW  ++ GDI+ST+Q +C+ +  + KH TK+ KDSDYR+++G+FCKQY +   P  +K +      
Subjt:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSR------

Query:  SKKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVL
        SKK F++SKT +      RKR Y+     K    K +T CFKCN+KGHYANRCP+  KIN M IDE+ KQSL   I++++ ++ + E S E  ED +N+L
Subjt:  SKKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVL

Query:  LEESS---EEPSSSENETDSEEAISYFG-----C---INVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKEH
         EE S   EE  S  + +D E AI   G     C   INV+T  Q+ L D+IE++ DE  ++  LL+L++ LE   P +  ++P+ +S+Q ++N +  E 
Subjt:  LEESS---EEPSSSENETDSEEAISYFG-----C---INVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKEH

Query:  AAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKE-------------------------------------------------------
          P++V DL+HE++ LK+E+ ENKQ++ YL+ A  A Q    SKE                                                       
Subjt:  AAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKE-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------NIETQ------------------------------------------------------------------------NNFDETKTPTKARPIQ
               NIE +                                                                        + F E + PTKARPIQ
Subjt:  -------NIETQ------------------------------------------------------------------------NNFDETKTPTKARPIQ

Query:  MSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSK
        M+KDLV  C  EI++L  KGLISPSKSPWSC+AFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRI  AK+FSKFD+KS FWQIQI+  
Subjt:  MSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSK

Query:  DKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK---------
        D+YKTAFNVPFGQ++WNVM FGLKNAPSEF+KIMNDIFN++Q+FTIVYIDDV++FS T+D+HFKHL +F+N+IK+NGLVVS PK+KLFQTK         
Subjt:  DKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK---------

Query:  ----------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE
                                    +++GCVNYIGDFI+D R+ICLP YDRLKKNPKPWT+EH+RAVQSIKSLAK IPCLSLVD +AK+II+TDAS+
Subjt:  ----------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE

Query:  KEL--IQKQALATALRIIYRESTSQNISSQN
             I KQ L   + ++   S   N + +N
Subjt:  KEL--IQKQALATALRIIYRESTSQNISSQN

TYK23160.1 hypothetical protein E5676_scaffold142G001850 [Cucumis melo var. makuwa]1.41e-26649.8Show/hide
Query:  SLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRRMRFTGNLRSWWHNHLTDAN
        SLNTI YDSPKILPI VY+ DMKNHYKR SP DLG D LRPDFRSFDGNNIETWNID CSEGQ+MIIFQE+                   WWHNHLTDAN
Subjt:  SLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRRMRFTGNLRSWWHNHLTDAN

Query:  REAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKYVK
        RE IKDAVLEKTEQ  NGDVVT QPNSVNT+VY AIKHF+GRTTLYSDQ ME LLGMKCPKMSDFKWYKDTFMSRLYN   CRDVVWKHKYVEGL KYV+
Subjt:  REAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKYVK

Query:  EKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTP-SRKKSRSKKVFKRSKTSESHGYSNRKRKY
        EKFYSTMVTNS  TDIDW  IS GDINSTIQKVCLEI QQQKHATKIAKD DYRR++GSFCKQY IDNTP SRKKSRSKK+  RSKTSESHGYSNRK KY
Subjt:  EKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTP-SRKKSRSKKVFKRSKTSESHGYSNRKRKY

Query:  HGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAISY
        HG++                                                   EELSSEEEE SFEKEEDLLN LLEESSEEPSSSENETD+EEAI  
Subjt:  HGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAISY

Query:  FGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVA
          CINVLTSTQKGLLDIIEEVDDEAIRKKILLRL EDLETP+QKFKDPMNFSFQ V NLL K                                      
Subjt:  FGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVA

Query:  IQEQIISKENIETQNNFDETKTPTKARPIQMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNR
                                                                                                            
Subjt:  IQEQIISKENIETQNNFDETKTPTKARPIQMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNR

Query:  QDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFI
          LL R+                                                                                             
Subjt:  QDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFI

Query:  NIIKTNGLVVSLPKVKLFQTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEKELIQKQ
                                                      RLKKNPKPWT EHSRAVQSIKSL KGIPCLSL+D KAKMI+ETDA         
Subjt:  NIIKTNGLVVSLPKVKLFQTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEKELIQKQ

Query:  ALATALRIIYRESTSQNISSQNPQVPQKMLPNSDRPRPPRRPPPPLPSNQPSPTNSQPASSSRASSSKGKWPISQASALTPMCADNYAMDLSFQTVSRRR
                                                              NSQ ASSSR SSS GK PISQ SAL PMCA+NYAMDLSFQ VSRRR
Subjt:  ALATALRIIYRESTSQNISSQNPQVPQKMLPNSDRPRPPRRPPPPLPSNQPSPTNSQPASSSRASSSKGKWPISQASALTPMCADNYAMDLSFQTVSRRR

Query:  QGSSQRNLTNGSTPYPSTTLLRPISAATSNRLASRPQSYAQTIKPAVFMPRPPVTGYQTKTTLEDVVIEPEFDGPSVPEISSQVYPHGFNFFPEDIKK
        QGSSQRN+T GST  PST LLRPIS  TSNR +SRPQSYA+T+KPAVFMPRPPVTGYQTKTTLEDVVIEPEFDGP + EI SQVYPHGFNFF + ++K
Subjt:  QGSSQRNLTNGSTPYPSTTLLRPISAATSNRLASRPQSYAQTIKPAVFMPRPPVTGYQTKTTLEDVVIEPEFDGPSVPEISSQVYPHGFNFFPEDIKK

TrEMBL top hitse value%identityAlignment
A0A5A7UR29 Enzymatic polyprotein8.3e-22341.42Show/hide
Query:  LNTIKYDS------PKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF
        +N IK DS       KILP+  + IDMKNHY +PSP DLGWDDL  + R++DG ++ TWNID  SE Q+M  FQE+ LAAT+Y  +           + F
Subjt:  LNTIKYDS------PKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF

Query:  TGNLRSWWHNHLTDANREAIKDA--VLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC
         GNLRSWWHN LT+ +R+ I  A   + KTE   +  +   +P+ VN L+Y   KHFIG T ++ +   E LLG+KC KMS +KWYKDTFM+RLY L TC
Subjt:  TGNLRSWWHNHLTDANREAIKDA--VLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC

Query:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSR------
           +WK K+VEGLP Y+ +KFY TM  NS    IDW  ++ GDI+ST+Q +C+ +  + KH TK+ KDSDYR+++G+FCKQY +   P  +K +      
Subjt:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSR------

Query:  SKKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVL
        SKK F++SK  +      R+R Y+     K    K +T CFKCN+KGHYANRCP+  KIN M IDE+ KQSL   I++++ ++ + E S   EED +N+L
Subjt:  SKKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVL

Query:  LEE---SSEEPSSSENETDSEEAI--------SYFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKEH
         EE   S EE  S  + +D E AI           G INV+T  Q+ L D+IE++ DE  ++  LL+L++ LE   P +  ++P+ +S+Q ++N +  E 
Subjt:  LEE---SSEEPSSSENETDSEEAI--------SYFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKEH

Query:  AAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKE-------------------------------------------------------
          P++V DL+HE++ LK+E+ ENKQ++ YL+ A  A Q    SKE                                                       
Subjt:  AAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKE-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------NIET------------------------------------------------------------------------QNNFDETKTPTKARPIQ
               NIE                                                                         ++ F E + PTKARPIQ
Subjt:  -------NIET------------------------------------------------------------------------QNNFDETKTPTKARPIQ

Query:  MSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSK
        M+KDLV  C  EI++L  KGLISPSKSPWSC+AFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRI  AK+FSKFD+KS FWQIQI+  
Subjt:  MSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSK

Query:  DKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK---------
        D+YKTAFNVPFGQ++WNVM FGLKNAPSEF+KIMNDIFN++Q+FTIVYIDDV++FS T+D+HFKHL +F+N+IK+NGLVVS PK+KLFQTK         
Subjt:  DKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK---------

Query:  ----------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE
                                    +++GCVNYIGDFI++ R+ICLP YDRLKKNPKPWT+EH+RAVQSIKSLAK IPCLSLVD +AK+II+TDAS+
Subjt:  ----------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE

Query:  KEL--IQKQALATALRIIYRESTSQNISSQNPQVPQK
             I KQ L   + ++   S   N + +N    +K
Subjt:  KEL--IQKQALATALRIIYRESTSQNISSQNPQVPQK

A0A5A7URX9 Enzymatic polyprotein1.9e-20641.59Show/hide
Query:  LNTIKYD------SPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF
        +N IK D      + KILP+  + +DMKNHY +PSP DLGWDDL  + R++DG ++ TWN D   E Q+M  FQE+ LAAT+Y  +           + F
Subjt:  LNTIKYD------SPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF

Query:  TGNLRSWWHNHLTDANREAIKDA--VLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC
         GNLRSWWHN LT+ +R+ I  A   + KTE   +  +   +P+ VN L+Y   KHFIG T ++ +   E LLG+KC KMS +KWYKDTFM+RLY L TC
Subjt:  TGNLRSWWHNHLTDANREAIKDA--VLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC

Query:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTP-----SRKKSRS
           +WK K+VEGLP Y+ +KFY TM  NS    IDW  ++ GDI+ST+Q +C+ +  + KH TK+ KDSDYR+++G+FCKQY +   P      +KK  S
Subjt:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTP-----SRKKSRS

Query:  KKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLL
        KK F+RSK  +      RK  Y+     KR   K NT CFKCN+KGHYANRCP+  KIN + IDE  KQS+   I++++ +S + E S   EED +N+L 
Subjt:  KKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLL

Query:  EE---SSEEPSSSENETDSEEAI--------SYFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKEHA
        EE   S EE  S  + +D E AI          FG INV+T  Q+ L D+IE++ DE  ++  LL+L++ LE   P +  ++ + + +Q + N +  E  
Subjt:  EE---SSEEPSSSENETDSEEAI--------SYFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKEHA

Query:  APVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKENIETQNN-------------------------------------------------
         P++V DL+HE+++LK+E+ ENKQ++ YL+NA  A QE  + KEN ET  N                                                 
Subjt:  APVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKENIETQNN-------------------------------------------------

Query:  ------------------FDETK---------------------------------------------------------------TPTKARPIQMSKDL
                          F++TK                                                                  K + I+  +D 
Subjt:  ------------------FDETK---------------------------------------------------------------TPTKARPIQMSKDL

Query:  VDTCFREI-----------------------SDL------QKKGLIS--------PSKSPWSCAAFYVNNQ-AEKERGVPRLVINYKPLNKVLKWIRYPI
        + TC   I                       S+L      +KK +++         ++ P       +N   AEKERGVPRLVINYKPLNKVLKWIRYPI
Subjt:  VDTCFREI-----------------------SDL------QKKGLIS--------PSKSPWSCAAFYVNNQ-AEKERGVPRLVINYKPLNKVLKWIRYPI

Query:  PNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLN
         NRQDLLKRI  AK+FSKFD+KS FWQIQI+  D YKTAFNVPF Q++WNVM FGLKNAP EF+KIMNDIFN++Q+FTIVYIDDV++FS T+D+HFKHL 
Subjt:  PNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLN

Query:  MFINIIKTNGLVVSLPKVKLFQTK-------------------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHS
        +F+N+IK+NGLVVS PK+KLFQTK                                     +++GCVNYIGDFI+D R+ICLP YDRLKKNPKPWT+EH+
Subjt:  MFINIIKTNGLVVSLPKVKLFQTK-------------------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHS

Query:  RAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEKEL--IQKQALATALRIIYRESTSQNISSQNPQVPQK
        RAV+SIKSLAK IPCLSLVD +AK+II+TDAS+     I KQ L   + I+   S   N + +N    +K
Subjt:  RAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEKEL--IQKQALATALRIIYRESTSQNISSQNPQVPQK

A0A5D3BEY3 Enzymatic polyprotein2.0e-22441.69Show/hide
Query:  LNTIKYDS------PKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF
        +N IK DS       KILP+  + IDMKNHY +PSP DLGWDDL  + R++DG ++ TWNID  SE Q+M  FQE+ LAAT+Y  +           + F
Subjt:  LNTIKYDS------PKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRR-----------MRF

Query:  TGNLRSWWHNHLTDANREAIKDA--VLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC
         GNLRSWWHN LT+ +R+ I  A   + KTE   +  +   +P+ VN L+Y   KHFIG T ++ +   E LLG+KC KMS +KWYKDTFM+RLY L TC
Subjt:  TGNLRSWWHNHLTDANREAIKDA--VLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTC

Query:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSR------
           +WK K+VEGLP Y+ +KFY TM  NS    IDW  ++ GDI+ST+Q +C+ +  + KH TK+ KDSDYR+++G+FCKQY +   P  +K +      
Subjt:  RDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSR------

Query:  SKKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVL
        SKK F++SKT +      RKR Y+     K    K +T CFKCN+KGHYANRCP+  KIN M IDE+ KQSL   I++++ ++ + E S   EED +N+L
Subjt:  SKKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVL

Query:  LEE---SSEEPSSSENETDSEEAI--------SYFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKEH
         EE   S EE  S  + +D E AI           G INV+T  Q+ L D+IE++ DE  ++  LL+L++ LE   P +  ++P+ +S+Q ++N +  E 
Subjt:  LEE---SSEEPSSSENETDSEEAI--------SYFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE--TPDQKFKDPMNFSFQAVMNLLPKEH

Query:  AAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKE-------------------------------------------------------
          P++V DL+HE++ LK+E+ ENKQ++ YL+ A  A Q    SKE                                                       
Subjt:  AAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKE-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------NIET------------------------------------------------------------------------QNNFDETKTPTKARPIQ
               NIE                                                                         ++ F E + PTKARPIQ
Subjt:  -------NIET------------------------------------------------------------------------QNNFDETKTPTKARPIQ

Query:  MSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSK
        M+KDLV  C  EI++L  KGLISPSKSPWSC+AFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRI  AK+FSKFD+KS FWQIQI+  
Subjt:  MSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSK

Query:  DKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK---------
        D+YKTAFNVPFGQ++WNVM FGLKNAPSEF+KIMNDIFN++Q+FTIVYIDDV++FS T+D+HFKHL +F+N+IK+NGLVVS PK+KLFQTK         
Subjt:  DKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK---------

Query:  ----------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE
                                    +++GCVNYIGDFI+D R+ICLP YDRLKKNPKPWT+EH+RAVQSIKSLAK IPCLSLVD +AK+II+TDAS+
Subjt:  ----------------------------KYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE

Query:  KEL--IQKQALATALRIIYRESTSQNISSQNPQVPQK
             I KQ L   + ++   S   N + +N    +K
Subjt:  KEL--IQKQALATALRIIYRESTSQNISSQNPQVPQK

A0A5D3C457 Polyprotein6.5e-27694.96Show/hide
Query:  MGSLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRRMRFTGNLRSWWHNHLTD
        MGSLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDE                         FTGNLRSWWHNHLTD
Subjt:  MGSLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRRMRFTGNLRSWWHNHLTD

Query:  ANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKY
        ANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKY
Subjt:  ANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKY

Query:  VKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRKRK
        VKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRKRK
Subjt:  VKEKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRKRK

Query:  YHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAIS
        YHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAIS
Subjt:  YHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAIS

Query:  YFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVV
        YFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVV
Subjt:  YFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVV

Query:  AIQEQIISKENIETQN
        AIQEQIISKENIETQ+
Subjt:  AIQEQIISKENIETQN

A0A5D3DIF5 Uncharacterized protein5.0e-22049.8Show/hide
Query:  SLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRRMRFTGNLRSWWHNHLTDAN
        SLNTI YDSPKILPI VY+ DMKNHYKR SP DLG D LRPDFRSFDGNNIETWNID CSEGQ+MIIFQE+                   WWHNHLTDAN
Subjt:  SLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRRMRFTGNLRSWWHNHLTDAN

Query:  REAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKYVK
        RE IKDAVLEKTEQ  NGDVVT QPNSVNT+VY AIKHF+GRTTLYSDQ ME LLGMKCPKMSDFKWYKDTFMSRLYN   CRDVVWKHKYVEGL KYV+
Subjt:  REAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKYVK

Query:  EKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNT-PSRKKSRSKKVFKRSKTSESHGYSNRKRKY
        EKFYSTMVTNS  TDIDW  IS GDINSTIQKVCLEI QQQKHATKIAKD DYRR++GSFCKQY IDNT PSRKKSRSKK+  RSKTSESHGYSNRK KY
Subjt:  EKFYSTMVTNSGRTDIDWEVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNT-PSRKKSRSKKVFKRSKTSESHGYSNRKRKY

Query:  HGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAISY
        HG++                                                   EELSSEEEE SFEKEEDLLN LLEESSEEPSSSENETD+EEAI  
Subjt:  HGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAISY

Query:  FGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVA
          CINVLTSTQKGLLDIIEEVDDEAIRKKILLRL EDLETP+QKFKDPMNFSFQ V NLL K                                      
Subjt:  FGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVA

Query:  IQEQIISKENIETQNNFDETKTPTKARPIQMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNR
                                                                                                            
Subjt:  IQEQIISKENIETQNNFDETKTPTKARPIQMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNR

Query:  QDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFI
          LL R+                                                                                             
Subjt:  QDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFI

Query:  NIIKTNGLVVSLPKVKLFQTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEKELIQKQ
                                                      RLKKNPKPWT EHSRAVQSIKSL KGIPCLSL+D KAKMI+ETDA         
Subjt:  NIIKTNGLVVSLPKVKLFQTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEKELIQKQ

Query:  ALATALRIIYRESTSQNISSQNPQVPQKMLPNSDRPRPPRRPPPPLPSNQPSPTNSQPASSSRASSSKGKWPISQASALTPMCADNYAMDLSFQTVSRRR
                                                              NSQ ASSSR SSS GK PISQ SAL PMCA+NYAMDLSFQ VSRRR
Subjt:  ALATALRIIYRESTSQNISSQNPQVPQKMLPNSDRPRPPRRPPPPLPSNQPSPTNSQPASSSRASSSKGKWPISQASALTPMCADNYAMDLSFQTVSRRR

Query:  QGSSQRNLTNGSTPYPSTTLLRPISAATSNRLASRPQSYAQTIKPAVFMPRPPVTGYQTKTTLEDVVIEPEFDGPSVPEISSQVYPHGFNFFPEDIKK
        QGSSQRN+T GST  PST LLRPIS  TSNR +SRPQSYA+T+KPAVFMPRPPVTGYQTKTTLEDVVIEPEFDGP + EI SQVYPHGFNFF + ++K
Subjt:  QGSSQRNLTNGSTPYPSTTLLRPISAATSNRLASRPQSYAQTIKPAVFMPRPPVTGYQTKTTLEDVVIEPEFDGPSVPEISSQVYPHGFNFFPEDIKK

SwissProt top hitse value%identityAlignment
P03554 Enzymatic polyprotein1.1e-4632.02Show/hide
Query:  KVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKENIETQNNFDETKT------------PTKA---RPIQ---MSKDLVDTCFREISDLQKKG
        K+ +   EI +L +    +++++   +  +  I+E +   E + ++N  D  KT            P+KA   +P++   M ++  D   +E+ DL+   
Subjt:  KVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKENIETQNNFDETKT------------PTKA---RPIQ---MSKDLVDTCFREISDLQKKG

Query:  LISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMS
        +I PSKSP    AF VNN+AEK RG  R+V+NYK +NK      Y +PN+ +LL  I   KIFS FD KS FWQ+ ++ + +  TAF  P G YEWNV+ 
Subjt:  LISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMS

Query:  FGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLF--------------------------------
        FGLK APS F++ M++ F   + F  VY+DD+++FS   + H  H+ M +     +G+++S  K +LF                                
Subjt:  FGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLF--------------------------------

Query:  -----QTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKN-PKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE
             Q ++++G + Y  D+I     I  P   +LK+N P  WT E +  +Q +K   +G P L     + K+IIETDAS+
Subjt:  -----QTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKN-PKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE

P03555 Enzymatic polyprotein1.1e-4632.02Show/hide
Query:  KVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKENIETQNNFDETKT------------PTKA---RPIQ---MSKDLVDTCFREISDLQKKG
        K+ +   EI +L +    +++++   +  +  I+E +   E + ++N  D  KT            P+KA   +P++   M ++  D   +E+ DL+   
Subjt:  KVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKENIETQNNFDETKT------------PTKA---RPIQ---MSKDLVDTCFREISDLQKKG

Query:  LISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMS
        +I PSKSP    AF VNN+AEK RG  R+V+NYK +NK      Y +PN+ +LL  I   KIFS FD KS FWQ+ ++ + +  TAF  P G YEWNV+ 
Subjt:  LISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMS

Query:  FGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLF--------------------------------
        FGLK APS F++ M++ F   + F  VY+DD+++FS   + H  H+ M +     +G+++S  K +LF                                
Subjt:  FGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLF--------------------------------

Query:  -----QTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKN-PKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE
             Q ++++G + Y  D+I     I  P   +LK+N P  WT E +  +Q +K   +G P L     + K+IIETDAS+
Subjt:  -----QTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKN-PKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE

Q02964 Enzymatic polyprotein1.4e-4632.02Show/hide
Query:  KVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKENIETQNNFDETKT------------PTKA---RPIQ---MSKDLVDTCFREISDLQKKG
        K+ +   EI +L +    +++++   +  +  I+E +   E + ++N  D  KT            P+KA   +P++   M ++  D   +E+ DL+   
Subjt:  KVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKENIETQNNFDETKT------------PTKA---RPIQ---MSKDLVDTCFREISDLQKKG

Query:  LISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMS
        +I PSKSP    AF VNN+AEK RG  R+V+NYK +NK      Y +PN+ +LL  I   KIFS FD KS FWQ+ ++ + +  TAF  P G YEWNV+ 
Subjt:  LISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMS

Query:  FGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLF--------------------------------
        FGLK APS F++ M++ F   + F  VY+DD+++FS   + H  H+ M +     +G+++S  K +LF                                
Subjt:  FGLKNAPSEFEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLF--------------------------------

Query:  -----QTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKN-PKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE
             Q ++++G + Y  D+I     I  P   +LK+N P  WT E +  +Q +K   +G P L     + K+IIETDAS+
Subjt:  -----QTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKN-PKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASE

Q6XKE6 Genome polyprotein1.2e-5627.67Show/hide
Query:  IIFQELFLAATSYLRRMRFTGNLRSWWHN-HLTDANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSD
        ++ +E FL  + +  R+  +G LR WW++    D NR                   +T+Q  + N  + +   +F G  +   ++    +  MKC  +S 
Subjt:  IIFQELFLAATSYLRRMRFTGNLRSWWHN-HLTDANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSD

Query:  FKWYKDTFMSRLYNLM--TCRDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINST----IQKVC--LEIFQQQKHAT----KIAKDSD
         +   D    R+  L      D+  K  ++  LP  + E+  S ++   G +     V   GDI  T    +  +C   + F Q K  +    K    SD
Subjt:  FKWYKDTFMSRLYNLM--TCRDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINST----IQKVC--LEIFQQQKHAT----KIAKDSD

Query:  YRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRK-RKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRK-INLM-EIDE----DKK
           K    C  Y     PSR+  R  K FK     E  G   RK R++  KS+  +   ++  +CF C K GH++  CP  +K I L+ EI +    D +
Subjt:  YRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRK-RKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRK-INLM-EIDE----DKK

Query:  QSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSEN----------ETDSEEAISYFGCINVLTSTQKGLLDIIEEVDDEA---IRKKILL-
          L  V   E+  SE+  FS E  E+      + +S E   +EN          E    E       + V TS     + +I   D  A   I    +L 
Subjt:  QSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSEN----------ETDSEEAISYFGCINVLTSTQKGLLDIIEEVDDEA---IRKKILL-

Query:  ------------RLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKK----------------------EI---------NENKQ
                       + + T   K K P+   F        K   + +   DL    ++ ++                      EI         NE +Q
Subjt:  ------------RLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKK----------------------EI---------NENKQ

Query:  QISYLKNAVVAIQE-QIISKEN----------IETQNNFDETKTPTKARPIQMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPR
            +KN + A      +SK +          I+     +E   PTKA    M+ + +    +E  +LQ+  LI PS S W+C AFYVN ++E+ RG  R
Subjt:  QISYLKNAVVAIQE-QIISKEN----------IETQNNFDETKTPTKARPIQMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPR

Query:  LVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVY
        LVINY+PLN  L+  ++PIPN+  L   ++ AK+FSKFD+KS FWQ+ I+  ++ KT F +P   ++W VM FGLK APS F+K M  IF       +VY
Subjt:  LVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVY

Query:  IDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK-------------------------------------KYVGCVNYIGDFIKDPRTIC
        IDD+++FSET++ H K LN FI+++K  G+++S  K+ L Q K                                     +++G VNYI DFI +     
Subjt:  IDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK-------------------------------------KYVGCVNYIGDFIKDPRTIC

Query:  LPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEK
         P  D LKK P  W      AV+ +K LA+ +  L  +  + K I++TDAS++
Subjt:  LPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEK

Q91DM0 Genome polyprotein5.8e-5627.43Show/hide
Query:  IIFQELFLAATSYLRRMRFTGNLRSWWHN-HLTDANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSD
        ++ +E FL  + +  R+  +G LR WW++    D NR                   +T+Q  + N  + +   +F G  +   ++    +  MKC  +S 
Subjt:  IIFQELFLAATSYLRRMRFTGNLRSWWHN-HLTDANREAIKDAVLEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSD

Query:  FKWYKDTFMSRLYNLM--TCRDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINST----IQKVC--LEIFQQQKHAT----KIAKDSD
         +   D    R+  L      D+  K  ++  LP  + E+  S ++   G +     V   GDI  T    +  +C   + F Q K  +    K    SD
Subjt:  FKWYKDTFMSRLYNLM--TCRDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDWEVISDGDINST----IQKVC--LEIFQQQKHAT----KIAKDSD

Query:  YRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRK-RKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRK-INLM-EIDE----DKK
           K    C  Y     PSR+  R  K FK     E  G   RK R++  +S+  +   ++  +CF C K GH++  CP  +K I L+ EI +    D +
Subjt:  YRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRK-RKYHGKSRNKRKFFKENTTCFKCNKKGHYANRCPVVRK-INLM-EIDE----DKK

Query:  QSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSEN----------ETDSEEAISYFGCINVLTSTQKGLLDIIEEVDDEA---IRKKILL-
          L  V   E+  SE+  FS E  E+      + +S E   +EN          E    E       + + TS     + +I   D  A   I    +L 
Subjt:  QSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSEN----------ETDSEEAISYFGCINVLTSTQKGLLDIIEEVDDEA---IRKKILL-

Query:  ------------RLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKK----------------------EI---------NENKQ
                       + + T   K K P+   F        K   + +   DL    ++ ++                      EI         NE +Q
Subjt:  ------------RLREDLETPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKK----------------------EI---------NENKQ

Query:  QISYLKNAVVAIQE-QIISKEN----------IETQNNFDETKTPTKARPIQMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPR
            +KN + A      +SK +          I+     +E   PTKA    M+ + +    +E  +LQ+  LI PS S W+C AFYVN ++E+ RG  R
Subjt:  QISYLKNAVVAIQE-QIISKEN----------IETQNNFDETKTPTKARPIQMSKDLVDTCFREISDLQKKGLISPSKSPWSCAAFYVNNQAEKERGVPR

Query:  LVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVY
        LVINY+PLN  L+  ++PIPN+  L   ++ AK+FSKFD+KS FWQ+ I+  ++ KT F +P   ++W VM FGLK APS F+K M  IF       +VY
Subjt:  LVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSEFEKIMNDIFNKHQDFTIVY

Query:  IDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK-------------------------------------KYVGCVNYIGDFIKDPRTIC
        IDD+++FSET++ H K LN FI+++K  G+++S  K+ L Q K                                     +++G VNYI DFI +     
Subjt:  IDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTK-------------------------------------KYVGCVNYIGDFIKDPRTIC

Query:  LPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEK
         P  D LKK P  W      AV+ +K LA+ +  L  +  + K I++TDAS++
Subjt:  LPFYDRLKKNPKPWTNEHSRAVQSIKSLAKGIPCLSLVDYKAKMIIETDASEK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCCTCAATACAATCAAGTATGATTCTCCAAAAATTCTTCCTATCAACGTTTACCACATTGATATGAAAAATCATTATAAAAGGCCATCACCACAAGATCTTGG
ATGGGATGACTTACGCCCAGACTTTCGATCATTCGATGGAAATAATATCGAAACTTGGAACATCGATGAATGTTCTGAAGGACAGATAATGATCATTTTTCAAGAACTGT
TTCTTGCTGCAACAAGCTATCTTAGAAGAATGAGGTTTACAGGAAACCTTAGAAGTTGGTGGCACAATCATCTTACTGATGCAAATAGGGAAGCCATAAAGGACGCTGTT
TTGGAAAAAACAGAACAAGGCCCTAATGGCGATGTTGTTACTACCCAACCAAACTCTGTAAACACCCTCGTTTACGTAGCTATTAAACACTTTATAGGGAGAACAACCCT
ATACAGTGATCAACCAATGGAAGGTCTTTTGGGAATGAAATGCCCCAAAATGAGTGATTTCAAGTGGTATAAAGACACTTTTATGTCACGTCTTTACAACCTTATGACAT
GCAGAGATGTGGTCTGGAAACATAAATATGTAGAAGGGCTACCCAAGTATGTTAAAGAAAAGTTCTATTCCACAATGGTGACTAATAGTGGACGAACAGATATAGATTGG
GAAGTCATCTCCGATGGAGACATCAATTCCACTATTCAAAAGGTGTGTTTGGAAATTTTTCAGCAACAAAAGCACGCTACAAAGATTGCAAAAGACTCAGATTATCGAAG
GAAAATGGGATCCTTTTGCAAGCAATATGAAATTGATAACACTCCTTCTAGAAAGAAGAGTCGTTCTAAGAAAGTCTTTAAAAGGAGTAAAACGAGTGAATCACATGGGT
ACTCTAACCGCAAAAGAAAGTACCATGGCAAAAGTAGAAACAAGCGAAAATTTTTCAAAGAAAATACCACATGTTTCAAATGCAATAAGAAGGGTCATTATGCCAATAGA
TGCCCTGTTGTCAGGAAAATTAACCTAATGGAGATTGATGAAGATAAAAAACAATCTCTGTCGAAAGTTATCAAAGCTGAAGAATTATCTTCCGAAGAAGAAGAATTCTC
CTTCGAAAAGGAAGAAGATCTTCTCAATGTCCTCCTCGAAGAATCTTCAGAGGAACCTTCGTCATCAGAAAATGAAACAGACAGCGAAGAGGCGATTTCCTATTTTGGCT
GCATTAATGTTCTCACAAGCACCCAGAAGGGTCTCTTAGACATCATCGAAGAAGTTGATGATGAAGCAATTAGAAAAAAGATACTTTTGAGACTTCGAGAGGACTTAGAG
ACTCCCGATCAAAAGTTCAAAGATCCGATGAATTTCAGCTTTCAAGCTGTGATGAATCTGTTGCCAAAAGAACATGCGGCTCCAGTTAAAGTAATAGATCTCTACCATGA
AATCGAAGTCCTGAAGAAGGAAATTAACGAAAACAAACAACAAATATCTTATCTTAAAAATGCAGTTGTTGCGATTCAAGAACAGATAATTTCAAAAGAAAACATTGAAA
CGCAAAATAATTTTGATGAAACCAAAACTCCTACTAAGGCTAGGCCTATTCAAATGAGCAAGGATCTAGTTGATACTTGCTTCAGAGAGATATCTGATCTCCAAAAGAAG
GGTTTAATAAGCCCATCAAAGAGCCCATGGTCGTGTGCGGCGTTTTATGTTAACAATCAGGCAGAAAAGGAACGAGGAGTCCCTCGCCTGGTCATCAATTATAAACCACT
AAACAAAGTGCTCAAGTGGATTAGGTACCCAATACCTAATCGCCAAGACTTGCTTAAGCGTATCGCTAATGCAAAGATCTTCTCAAAATTTGATATTAAATCTGAATTTT
GGCAAATTCAAATTAACTCAAAAGACAAATATAAAACTGCTTTCAATGTTCCTTTCGGTCAATATGAATGGAATGTTATGTCATTCGGTCTTAAGAATGCTCCCTCAGAG
TTTGAAAAGATCATGAATGACATTTTCAATAAGCATCAAGACTTCACAATAGTTTATATTGATGATGTCATGATATTTTCTGAAACCATTGATCGACATTTTAAACATCT
TAATATGTTCATCAATATCATTAAAACAAATGGCCTTGTGGTATCTTTACCAAAAGTCAAGTTGTTTCAAACAAAAAAATATGTTGGATGTGTTAATTATATCGGAGATT
TCATTAAAGACCCCCGAACAATCTGTCTGCCATTTTATGACAGACTAAAAAAGAACCCAAAACCATGGACGAATGAGCATTCTCGAGCAGTCCAATCAATCAAATCCTTA
GCCAAAGGCATTCCATGCTTATCGCTTGTGGATTACAAAGCCAAAATGATCATCGAAACCGACGCTTCTGAAAAAGAACTGATTCAAAAGCAAGCACTAGCAACTGCCTT
ACGGATTATCTATCGAGAGAGCACATCTCAAAACATAAGCTCTCAAAATCCTCAAGTACCTCAAAAAATGCTTCCGAACTCCGATCGTCCTCGGCCACCAAGGAGGCCTC
CGCCTCCTCTTCCTTCCAATCAGCCAAGTCCGACTAACTCTCAACCAGCTAGCTCTTCTAGAGCGAGCTCCTCCAAAGGCAAATGGCCAATTTCTCAAGCATCTGCACTT
ACTCCAATGTGTGCAGACAATTACGCCATGGATCTGAGTTTCCAAACGGTATCCCGGCGTCGCCAAGGTTCCTCACAAAGGAATCTAACAAATGGATCTACTCCATATCC
CTCAACAACTTTGTTGCGCCCAATAAGTGCAGCAACATCAAATAGGTTAGCCTCTAGACCTCAATCCTATGCCCAGACAATTAAACCAGCGGTGTTTATGCCAAGGCCAC
CAGTCACAGGATACCAAACTAAAACAACTCTGGAAGATGTAGTCATCGAACCAGAATTTGATGGACCTTCCGTTCCAGAGATTTCCTCTCAAGTTTACCCCCATGGGTTC
AATTTCTTCCCTGAAGATATTAAGAAAACAAGACAATTCTATGAGTTCATTTTAGTAGACACTGTCTCTGCCGAAATCACTCATATTCCAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCCTCAATACAATCAAGTATGATTCTCCAAAAATTCTTCCTATCAACGTTTACCACATTGATATGAAAAATCATTATAAAAGGCCATCACCACAAGATCTTGG
ATGGGATGACTTACGCCCAGACTTTCGATCATTCGATGGAAATAATATCGAAACTTGGAACATCGATGAATGTTCTGAAGGACAGATAATGATCATTTTTCAAGAACTGT
TTCTTGCTGCAACAAGCTATCTTAGAAGAATGAGGTTTACAGGAAACCTTAGAAGTTGGTGGCACAATCATCTTACTGATGCAAATAGGGAAGCCATAAAGGACGCTGTT
TTGGAAAAAACAGAACAAGGCCCTAATGGCGATGTTGTTACTACCCAACCAAACTCTGTAAACACCCTCGTTTACGTAGCTATTAAACACTTTATAGGGAGAACAACCCT
ATACAGTGATCAACCAATGGAAGGTCTTTTGGGAATGAAATGCCCCAAAATGAGTGATTTCAAGTGGTATAAAGACACTTTTATGTCACGTCTTTACAACCTTATGACAT
GCAGAGATGTGGTCTGGAAACATAAATATGTAGAAGGGCTACCCAAGTATGTTAAAGAAAAGTTCTATTCCACAATGGTGACTAATAGTGGACGAACAGATATAGATTGG
GAAGTCATCTCCGATGGAGACATCAATTCCACTATTCAAAAGGTGTGTTTGGAAATTTTTCAGCAACAAAAGCACGCTACAAAGATTGCAAAAGACTCAGATTATCGAAG
GAAAATGGGATCCTTTTGCAAGCAATATGAAATTGATAACACTCCTTCTAGAAAGAAGAGTCGTTCTAAGAAAGTCTTTAAAAGGAGTAAAACGAGTGAATCACATGGGT
ACTCTAACCGCAAAAGAAAGTACCATGGCAAAAGTAGAAACAAGCGAAAATTTTTCAAAGAAAATACCACATGTTTCAAATGCAATAAGAAGGGTCATTATGCCAATAGA
TGCCCTGTTGTCAGGAAAATTAACCTAATGGAGATTGATGAAGATAAAAAACAATCTCTGTCGAAAGTTATCAAAGCTGAAGAATTATCTTCCGAAGAAGAAGAATTCTC
CTTCGAAAAGGAAGAAGATCTTCTCAATGTCCTCCTCGAAGAATCTTCAGAGGAACCTTCGTCATCAGAAAATGAAACAGACAGCGAAGAGGCGATTTCCTATTTTGGCT
GCATTAATGTTCTCACAAGCACCCAGAAGGGTCTCTTAGACATCATCGAAGAAGTTGATGATGAAGCAATTAGAAAAAAGATACTTTTGAGACTTCGAGAGGACTTAGAG
ACTCCCGATCAAAAGTTCAAAGATCCGATGAATTTCAGCTTTCAAGCTGTGATGAATCTGTTGCCAAAAGAACATGCGGCTCCAGTTAAAGTAATAGATCTCTACCATGA
AATCGAAGTCCTGAAGAAGGAAATTAACGAAAACAAACAACAAATATCTTATCTTAAAAATGCAGTTGTTGCGATTCAAGAACAGATAATTTCAAAAGAAAACATTGAAA
CGCAAAATAATTTTGATGAAACCAAAACTCCTACTAAGGCTAGGCCTATTCAAATGAGCAAGGATCTAGTTGATACTTGCTTCAGAGAGATATCTGATCTCCAAAAGAAG
GGTTTAATAAGCCCATCAAAGAGCCCATGGTCGTGTGCGGCGTTTTATGTTAACAATCAGGCAGAAAAGGAACGAGGAGTCCCTCGCCTGGTCATCAATTATAAACCACT
AAACAAAGTGCTCAAGTGGATTAGGTACCCAATACCTAATCGCCAAGACTTGCTTAAGCGTATCGCTAATGCAAAGATCTTCTCAAAATTTGATATTAAATCTGAATTTT
GGCAAATTCAAATTAACTCAAAAGACAAATATAAAACTGCTTTCAATGTTCCTTTCGGTCAATATGAATGGAATGTTATGTCATTCGGTCTTAAGAATGCTCCCTCAGAG
TTTGAAAAGATCATGAATGACATTTTCAATAAGCATCAAGACTTCACAATAGTTTATATTGATGATGTCATGATATTTTCTGAAACCATTGATCGACATTTTAAACATCT
TAATATGTTCATCAATATCATTAAAACAAATGGCCTTGTGGTATCTTTACCAAAAGTCAAGTTGTTTCAAACAAAAAAATATGTTGGATGTGTTAATTATATCGGAGATT
TCATTAAAGACCCCCGAACAATCTGTCTGCCATTTTATGACAGACTAAAAAAGAACCCAAAACCATGGACGAATGAGCATTCTCGAGCAGTCCAATCAATCAAATCCTTA
GCCAAAGGCATTCCATGCTTATCGCTTGTGGATTACAAAGCCAAAATGATCATCGAAACCGACGCTTCTGAAAAAGAACTGATTCAAAAGCAAGCACTAGCAACTGCCTT
ACGGATTATCTATCGAGAGAGCACATCTCAAAACATAAGCTCTCAAAATCCTCAAGTACCTCAAAAAATGCTTCCGAACTCCGATCGTCCTCGGCCACCAAGGAGGCCTC
CGCCTCCTCTTCCTTCCAATCAGCCAAGTCCGACTAACTCTCAACCAGCTAGCTCTTCTAGAGCGAGCTCCTCCAAAGGCAAATGGCCAATTTCTCAAGCATCTGCACTT
ACTCCAATGTGTGCAGACAATTACGCCATGGATCTGAGTTTCCAAACGGTATCCCGGCGTCGCCAAGGTTCCTCACAAAGGAATCTAACAAATGGATCTACTCCATATCC
CTCAACAACTTTGTTGCGCCCAATAAGTGCAGCAACATCAAATAGGTTAGCCTCTAGACCTCAATCCTATGCCCAGACAATTAAACCAGCGGTGTTTATGCCAAGGCCAC
CAGTCACAGGATACCAAACTAAAACAACTCTGGAAGATGTAGTCATCGAACCAGAATTTGATGGACCTTCCGTTCCAGAGATTTCCTCTCAAGTTTACCCCCATGGGTTC
AATTTCTTCCCTGAAGATATTAAGAAAACAAGACAATTCTATGAGTTCATTTTAGTAGACACTGTCTCTGCCGAAATCACTCATATTCCAGACTGA
Protein sequenceShow/hide protein sequence
MGSLNTIKYDSPKILPINVYHIDMKNHYKRPSPQDLGWDDLRPDFRSFDGNNIETWNIDECSEGQIMIIFQELFLAATSYLRRMRFTGNLRSWWHNHLTDANREAIKDAV
LEKTEQGPNGDVVTTQPNSVNTLVYVAIKHFIGRTTLYSDQPMEGLLGMKCPKMSDFKWYKDTFMSRLYNLMTCRDVVWKHKYVEGLPKYVKEKFYSTMVTNSGRTDIDW
EVISDGDINSTIQKVCLEIFQQQKHATKIAKDSDYRRKMGSFCKQYEIDNTPSRKKSRSKKVFKRSKTSESHGYSNRKRKYHGKSRNKRKFFKENTTCFKCNKKGHYANR
CPVVRKINLMEIDEDKKQSLSKVIKAEELSSEEEEFSFEKEEDLLNVLLEESSEEPSSSENETDSEEAISYFGCINVLTSTQKGLLDIIEEVDDEAIRKKILLRLREDLE
TPDQKFKDPMNFSFQAVMNLLPKEHAAPVKVIDLYHEIEVLKKEINENKQQISYLKNAVVAIQEQIISKENIETQNNFDETKTPTKARPIQMSKDLVDTCFREISDLQKK
GLISPSKSPWSCAAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRIANAKIFSKFDIKSEFWQIQINSKDKYKTAFNVPFGQYEWNVMSFGLKNAPSE
FEKIMNDIFNKHQDFTIVYIDDVMIFSETIDRHFKHLNMFINIIKTNGLVVSLPKVKLFQTKKYVGCVNYIGDFIKDPRTICLPFYDRLKKNPKPWTNEHSRAVQSIKSL
AKGIPCLSLVDYKAKMIIETDASEKELIQKQALATALRIIYRESTSQNISSQNPQVPQKMLPNSDRPRPPRRPPPPLPSNQPSPTNSQPASSSRASSSKGKWPISQASAL
TPMCADNYAMDLSFQTVSRRRQGSSQRNLTNGSTPYPSTTLLRPISAATSNRLASRPQSYAQTIKPAVFMPRPPVTGYQTKTTLEDVVIEPEFDGPSVPEISSQVYPHGF
NFFPEDIKKTRQFYEFILVDTVSAEITHIPD