; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g17680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g17680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionEnzymatic polyprotein
Genome locationchr7:12609607..12618465
RNA-Seq ExpressionMoc07g17680
SyntenyMoc07g17680
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001995 - Peptidase A2A, retrovirus, catalytic
IPR018061 - Retropepsins
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052109.1 Enzymatic polyprotein [Cucumis melo var. makuwa]2.2e-12038.9Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDR-GDSSKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVERIENP LP  +K+P IPQ++P  PIFQPNSF IG LKE+ S+ L EIN+RL+++S+++   ++ +    K IN++    +  QAS   IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDR-GDSSKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW +                                                                 
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+  KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT  A
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------
        NS   QI+W+ LT GDI++T+Q I VNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK                              
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI
                                                                        N+VKGEAK PIQ+EDL++EV TLKREVA +KQRL  
Subjt:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI

Query:  LQYTFRKFQE--LKPTEGETS------STPERTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSG
        L+  F+ FQE  +     ETS          + LL      IN ISKV N+KWMSKI+FK++DFQLET ALIDSGAD+NVIQEGLVPSKYFEKTKE LS 
Subjt:  LQYTFRKFQE--LKPTEGETS------STPERTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSG

Query:  AGGNLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS
        A GN LNI++KLS+VHICK DVCL+ NTFILVKNLNEG+ILG+
Subjt:  AGGNLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS

KAA0056776.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.0e-12539.46Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVER+EN   P   K+P IPQ++P  PIFQPNSF IGSL+E++S+ L EINRRL+++S+++G   + +    K+IN++    +  QAS S IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW +                                                                 
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+C KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT  A
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------
        NS   QI+W+ LT GDI++T+Q ICVNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK                              
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI
                                                                        N+VKGEAK PIQ+EDL++EV TLKREVA +KQRL  
Subjt:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI

Query:  LQYTFRKFQELKPTEGETSSTPER-----TLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGAGG
        L+  F+ FQ  + ++ E++S  ER      LL      IN ISK+QNQKWMSKI+FK++DFQLE  ALIDSGAD+NVIQEGLVPS+YFEKTKE LSGA G
Subjt:  LQYTFRKFQELKPTEGETSSTPER-----TLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGAGG

Query:  NLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS
        N LNI++KLSKVHICK DVCL+ NTFILVKNLNEG+ILG+
Subjt:  NLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS

KAA0057417.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.8e-11137.64Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVE IENP LP  +K+P IPQ++P  PIFQPNSF IG LKE+ S+ L EIN+RL+++S+++    + +    K IN++       Q S+S IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW                                                                   
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+C KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT   
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKSS----------------------------
        NS   QI+W+ LT GDI++T+Q ICVNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK S                            
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKSS----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------NKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSIL
                                                                       N+VKGEAK PIQ+EDL++EV  LKREV  +KQRL  L
Subjt:  ---------------------------------------------------------------NKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSIL

Query:  QYTFRKFQELKP-TEGETSSTPE-------RTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGA
        +  F+ FQE +   E   +ST +       + LL      IN ISKV N+KWMSKI+FK++DFQLE  ALIDSGAD+NVIQE LVPSKYFEKTKE LSGA
Subjt:  QYTFRKFQELKP-TEGETSSTPE-------RTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGA

Query:  GGNLLNIKYKLSKVHICKDD
        GGN LNI++KLSKVHICK D
Subjt:  GGNLLNIKYKLSKVHICKDD

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]1.7e-12539.32Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVER+EN   P   K+P IPQ++P  PIFQPNSF IGSL+E++S+ L EINRRL+++S+++G   + +    K+IN++    +  QAS S IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW +                                                                 
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+C KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT  A
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------
        NS   QI+W+ LT GDI++T+Q ICVNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK                              
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI
                                                                        N+VKGEAK PIQ+EDL++EV TLKREVA +KQRL  
Subjt:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI

Query:  LQYTFRKFQELKPTEGETSSTPER-----TLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGAGG
        L+  F+ FQ  + ++ E++S  ER      LL      IN IS++QNQKWMSKI+FK++DFQLE  ALIDSGAD+NVIQEGLVPS+YFEKTKE LSGA G
Subjt:  LQYTFRKFQELKPTEGETSSTPER-----TLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGAGG

Query:  NLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS
        N LNI++KLSKVHICK DVCL+ NTFILVKNLNEG+ILG+
Subjt:  NLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS

TYJ98087.1 Enzymatic polyprotein [Cucumis melo var. makuwa]2.2e-12038.9Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDR-GDSSKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVERIENP LP  +K+P IPQ++P  PIFQPNSF IG LKE+ S+ L EIN+RL+++S+++   ++ +    K IN++    +  QAS   IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDR-GDSSKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW +                                                                 
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+  KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT  A
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------
        NS   QI+W+ LT GDI++T+Q I VNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK                              
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI
                                                                        N+VKGEAK PIQ+EDL++EV TLKREVA +KQRL  
Subjt:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI

Query:  LQYTFRKFQE--LKPTEGETS------STPERTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSG
        L+  F+ FQE  +     ETS          + LL      IN ISKV N+KWMSKI+FK++DFQLET ALIDSGAD+NVIQEGLVPSKYFEKTKE LS 
Subjt:  LQYTFRKFQE--LKPTEGETS------STPERTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSG

Query:  AGGNLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS
        A GN LNI++KLS+VHICK DVCL+ NTFILVKNLNEG+ILG+
Subjt:  AGGNLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS

TrEMBL top hitse value%identityAlignment
A0A5A7UF59 Enzymatic polyprotein1.0e-12038.9Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDR-GDSSKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVERIENP LP  +K+P IPQ++P  PIFQPNSF IG LKE+ S+ L EIN+RL+++S+++   ++ +    K IN++    +  QAS   IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDR-GDSSKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW +                                                                 
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+  KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT  A
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------
        NS   QI+W+ LT GDI++T+Q I VNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK                              
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI
                                                                        N+VKGEAK PIQ+EDL++EV TLKREVA +KQRL  
Subjt:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI

Query:  LQYTFRKFQE--LKPTEGETS------STPERTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSG
        L+  F+ FQE  +     ETS          + LL      IN ISKV N+KWMSKI+FK++DFQLET ALIDSGAD+NVIQEGLVPSKYFEKTKE LS 
Subjt:  LQYTFRKFQE--LKPTEGETS------STPERTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSG

Query:  AGGNLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS
        A GN LNI++KLS+VHICK DVCL+ NTFILVKNLNEG+ILG+
Subjt:  AGGNLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS

A0A5A7UR29 Enzymatic polyprotein4.8e-12639.46Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVER+EN   P   K+P IPQ++P  PIFQPNSF IGSL+E++S+ L EINRRL+++S+++G   + +    K+IN++    +  QAS S IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW +                                                                 
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+C KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT  A
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------
        NS   QI+W+ LT GDI++T+Q ICVNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK                              
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI
                                                                        N+VKGEAK PIQ+EDL++EV TLKREVA +KQRL  
Subjt:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI

Query:  LQYTFRKFQELKPTEGETSSTPER-----TLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGAGG
        L+  F+ FQ  + ++ E++S  ER      LL      IN ISK+QNQKWMSKI+FK++DFQLE  ALIDSGAD+NVIQEGLVPS+YFEKTKE LSGA G
Subjt:  LQYTFRKFQELKPTEGETSSTPER-----TLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGAGG

Query:  NLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS
        N LNI++KLSKVHICK DVCL+ NTFILVKNLNEG+ILG+
Subjt:  NLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS

A0A5A7URX9 Enzymatic polyprotein8.8e-11237.64Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVE IENP LP  +K+P IPQ++P  PIFQPNSF IG LKE+ S+ L EIN+RL+++S+++    + +    K IN++       Q S+S IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW                                                                   
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+C KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT   
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKSS----------------------------
        NS   QI+W+ LT GDI++T+Q ICVNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK S                            
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKSS----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------NKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSIL
                                                                       N+VKGEAK PIQ+EDL++EV  LKREV  +KQRL  L
Subjt:  ---------------------------------------------------------------NKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSIL

Query:  QYTFRKFQELKP-TEGETSSTPE-------RTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGA
        +  F+ FQE +   E   +ST +       + LL      IN ISKV N+KWMSKI+FK++DFQLE  ALIDSGAD+NVIQE LVPSKYFEKTKE LSGA
Subjt:  QYTFRKFQELKP-TEGETSSTPE-------RTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGA

Query:  GGNLLNIKYKLSKVHICKDD
        GGN LNI++KLSKVHICK D
Subjt:  GGNLLNIKYKLSKVHICKDD

A0A5D3BEY3 Enzymatic polyprotein8.2e-12639.32Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVER+EN   P   K+P IPQ++P  PIFQPNSF IGSL+E++S+ L EINRRL+++S+++G   + +    K+IN++    +  QAS S IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDS-SKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW +                                                                 
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+C KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT  A
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------
        NS   QI+W+ LT GDI++T+Q ICVNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK                              
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI
                                                                        N+VKGEAK PIQ+EDL++EV TLKREVA +KQRL  
Subjt:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI

Query:  LQYTFRKFQELKPTEGETSSTPER-----TLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGAGG
        L+  F+ FQ  + ++ E++S  ER      LL      IN IS++QNQKWMSKI+FK++DFQLE  ALIDSGAD+NVIQEGLVPS+YFEKTKE LSGA G
Subjt:  LQYTFRKFQELKPTEGETSSTPER-----TLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGAGG

Query:  NLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS
        N LNI++KLSKVHICK DVCL+ NTFILVKNLNEG+ILG+
Subjt:  NLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS

A0A5D3BG41 Enzymatic polyprotein1.0e-12038.9Show/hide
Query:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDR-GDSSKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE
        AVERIENP LP  +K+P IPQ++P  PIFQPNSF IG LKE+ S+ L EIN+RL+++S+++   ++ +    K IN++    +  QAS   IL V    +
Subjt:  AVERIENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDR-GDSSKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTE

Query:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------
        MK+HYPQPSPPDLGWDDL H++R YDG S++TW +                                                                 
Subjt:  MKSHYPQPSPPDLGWDDLRHDQRAYDGTSIVTWKL-----------------------------------------------------------------

Query:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA
                        D++N             T+I+ +L  EALLGL+  KMS YKWYKDTF+ RL+T+TTCGADIWKQKFVEGLPHYI+Q+FYQT  A
Subjt:  ---------------MDILN-------------TKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVA

Query:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------
        NS   QI+W+ LT GDI++T+Q I VNLC ENKHT KVIKD DYRKELGTFCKQYG   GP++E+KKKKK                              
Subjt:  NSTTNQINWSELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKS-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI
                                                                        N+VKGEAK PIQ+EDL++EV TLKREVA +KQRL  
Subjt:  ---------------------------------------------------------------SNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSI

Query:  LQYTFRKFQE--LKPTEGETS------STPERTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSG
        L+  F+ FQE  +     ETS          + LL      IN ISKV N+KWMSKI+FK++DFQLET ALIDSGAD+NVIQEGLVPSKYFEKTKE LS 
Subjt:  LQYTFRKFQE--LKPTEGETS------STPERTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSG

Query:  AGGNLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS
        A GN LNI++KLS+VHICK DVCL+ NTFILVKNLNEG+ILG+
Subjt:  AGGNLLNIKYKLSKVHICKDDVCLINNTFILVKNLNEGVILGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTCGTGTAGGAGGGAGCGACGCCGTGGGACACCGGAAACTAAGTCGCCGCCGCTGCTGCTCGAACGCCGCCGAGGAGAAGGTACCGCCGCCGTTGCTATG
GATCGTCGGGACGTCGGGTCGAAGTCGCGCCGCCGCTGGATGTGTCGCCAGCCGCCGCTGCTGCTATTACGAACCCGCCGTCTCGGGATGTGGTCGTCGGTCGTC
GAACTCCAACAACGCGGGTGGGGGCTGAAATCTCGTCGCACCGCCGCAGAGAAGGACGTTGGAGTTGCTGCTACCCCGTGCGCTGCTGCAGCTGTTGAAAGAATT
GAGAATCCGGTTCTTCCTACAGTCTCAAAAAGCCCAGGGATCCCACAAGTAGACCCCTGTCATCCAATCTTTCAACCAAACAGTTTTAAGATTGGATCTCTCAAA
GAAAATCTCTCAAATCTCTTGCTTGAGATCAACAGAAGACTTTCTTCTTTGTCTATCGATAGAGGAGACTCTTCCAAGAAGAATGATGTGGTTAAAATCATAAAC
GTAGTTGCTGCGATACCAACTACAACTCAAGCCTCATCTTCAACAATCCTTACAGTCACCATGCATACGGAAATGAAGAGCCATTATCCTCAACCATCTCCTCCT
GATCTAGGATGGGACGATCTCCGCCATGATCAAAGGGCTTATGACGGAACATCTATAGTCACTTGGAAATTGATGGATATTCTGAACACTAAAATCTACTCCGAC
CTCAATGCCGAAGCTCTTTTAGGCCTCCGATGTCGAAAGATGAGCAACTACAAATGGTATAAAGACACCTTCCTAACACGTCTTCATACTATTACAACATGCGGA
GCAGACATCTGGAAACAAAAATTTGTTGAAGGACTTCCGCATTATATTGCTCAAAGATTTTATCAGACGGCAGTAGCAAACTCTACAACCAATCAGATCAATTGG
TCAGAGTTAACCATTGGAGATATTACTGCCACAATTCAAGGAATATGCGTCAATCTCTGCCTGGAGAATAAGCATACAGCCAAAGTAATCAAAGATCCTGACTAC
CGAAAGGAATTGGGAACTTTCTGCAAACAATATGGTTTTGATTGTGGACCCAGAGATGAAAGGAAGAAGAAAAAGAAATCTTCGAACAAGGTCAAAGGAGAAGCT
AAAAAGCCTATCCAAATTGAAGATCTCTACAATGAAGTGAACACTCTCAAAAGGGAAGTTGCCAATAGTAAGCAACGTCTTTCTATTCTTCAATACACCTTTAGA
AAGTTTCAAGAGTTAAAACCCACGGAAGGAGAAACTTCCTCAACACCTGAAAGAACATTACTAACTGGTTCACCAAGCGGAATCAATTACATTAGTAAAGTTCAA
AACCAGAAGTGGATGTCCAAGATTATCTTCAAAATCAGAGACTTCCAACTGGAGACTTTCGCTCTTATCGACTCTGGAGCTGATAAGAACGTCATCCAAGAAGGT
TTAGTTCCTTCTAAATACTTCGAGAAAACCAAAGAAGTTCTTAGCGGAGCTGGTGGAAATCTATTGAACATCAAGTACAAATTATCTAAGGTCCATATCTGCAAA
GACGACGTGTGCCTTATCAACAACACCTTCATCCTAGTCAAAAACCTTAATGAAGGAGTCATACTAGGAAGTGAGAATTCTCTTGCTGATTACCTCTCAAGAGAA
CACCTCTTGAAGACTCTTAAATCAGCCCTGACCTCTCTTCCTTCTGATGGAACCTCCTCCCAGCTGAAGGCGGCTGAACACTCAGCGGCCCCGCCGCAGAACAAC
CAGCAACCCCCATCGCCGAGAAATGAAAGCATCATTTCTCCTCAAGCCGTAACATCCTCTTCAAGAGCTGCTACCTCAAAGGGCAAAAGGCCCATCTCTCAATCA
TCTGTACCATCTCCGATGAGTGCAGAAAACTACGCCGTGCATATTCAGTTCAAGACTGTATCCAGACGTCAACAAGGTTCTTCACAAAAAACCTTGTCGATTCAA
TCAGGCCCTTTTAGCCTTCCAATTCCCTCAAGTACGTTGTTACGCCCTTTCGGCACTACAACGAAAAATAGGCGTCCTGCTACAGCAACCACCTCTTCAAAACCT
ACTGTTCCGAGGAATCCTTCCTCGTTCTCTCAAATCGTCAGACCAAAGGTTTTTCAACCAAGGCCTCCAATCAATGGTTATTTCACCAAAACTACCATGGTGGAC
TCAACCATCGAACCAGAATTCAACGGACCCTCGGTCCAAGAAGGTATGTTTGTAGGGAAGAAACTTACAAGGCCCTTCCAATCACAAACCTACAACTATCGCGAC
TACATAAAGGCGTGGTATGTTGTTTTCTGTCTTCAAGGCTACAATCACTCTTGGTTTGTGACTTTTTCTGAAAAGATGAAAAACTGGTTCAAGACCAATGTTCAT
CTTCAAGACATGACGAGGCAAGAAGACGAGAGTTTCCTTCTAGCCAAAAACGCCGTCATGAGCTCACTAGCTGGAGCCGGATCTCAAGCCGTCTTCAACTCAGTT
CTCAATACAGTCGCGGTTCAAATCTCTTATGCAGACCACATTCAGATGGATGTTGATTCCTCCGCCTCTGTCAATAATGATGTTGAAGACGACGAAGACGACTTT
GATCCCTTCGAAGGCTACGACATCAATGATCCATACCTAGATTCACAGCCCTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTCGTGTAGGAGGGAGCGACGCCGTGGGACACCGGAAACTAAGTCGCCGCCGCTGCTGCTCGAACGCCGCCGAGGAGAAGGTACCGCCGCCGTTGCTATG
GATCGTCGGGACGTCGGGTCGAAGTCGCGCCGCCGCTGGATGTGTCGCCAGCCGCCGCTGCTGCTATTACGAACCCGCCGTCTCGGGATGTGGTCGTCGGTCGTC
GAACTCCAACAACGCGGGTGGGGGCTGAAATCTCGTCGCACCGCCGCAGAGAAGGACGTTGGAGTTGCTGCTACCCCGTGCGCTGCTGCAGCTGTTGAAAGAATT
GAGAATCCGGTTCTTCCTACAGTCTCAAAAAGCCCAGGGATCCCACAAGTAGACCCCTGTCATCCAATCTTTCAACCAAACAGTTTTAAGATTGGATCTCTCAAA
GAAAATCTCTCAAATCTCTTGCTTGAGATCAACAGAAGACTTTCTTCTTTGTCTATCGATAGAGGAGACTCTTCCAAGAAGAATGATGTGGTTAAAATCATAAAC
GTAGTTGCTGCGATACCAACTACAACTCAAGCCTCATCTTCAACAATCCTTACAGTCACCATGCATACGGAAATGAAGAGCCATTATCCTCAACCATCTCCTCCT
GATCTAGGATGGGACGATCTCCGCCATGATCAAAGGGCTTATGACGGAACATCTATAGTCACTTGGAAATTGATGGATATTCTGAACACTAAAATCTACTCCGAC
CTCAATGCCGAAGCTCTTTTAGGCCTCCGATGTCGAAAGATGAGCAACTACAAATGGTATAAAGACACCTTCCTAACACGTCTTCATACTATTACAACATGCGGA
GCAGACATCTGGAAACAAAAATTTGTTGAAGGACTTCCGCATTATATTGCTCAAAGATTTTATCAGACGGCAGTAGCAAACTCTACAACCAATCAGATCAATTGG
TCAGAGTTAACCATTGGAGATATTACTGCCACAATTCAAGGAATATGCGTCAATCTCTGCCTGGAGAATAAGCATACAGCCAAAGTAATCAAAGATCCTGACTAC
CGAAAGGAATTGGGAACTTTCTGCAAACAATATGGTTTTGATTGTGGACCCAGAGATGAAAGGAAGAAGAAAAAGAAATCTTCGAACAAGGTCAAAGGAGAAGCT
AAAAAGCCTATCCAAATTGAAGATCTCTACAATGAAGTGAACACTCTCAAAAGGGAAGTTGCCAATAGTAAGCAACGTCTTTCTATTCTTCAATACACCTTTAGA
AAGTTTCAAGAGTTAAAACCCACGGAAGGAGAAACTTCCTCAACACCTGAAAGAACATTACTAACTGGTTCACCAAGCGGAATCAATTACATTAGTAAAGTTCAA
AACCAGAAGTGGATGTCCAAGATTATCTTCAAAATCAGAGACTTCCAACTGGAGACTTTCGCTCTTATCGACTCTGGAGCTGATAAGAACGTCATCCAAGAAGGT
TTAGTTCCTTCTAAATACTTCGAGAAAACCAAAGAAGTTCTTAGCGGAGCTGGTGGAAATCTATTGAACATCAAGTACAAATTATCTAAGGTCCATATCTGCAAA
GACGACGTGTGCCTTATCAACAACACCTTCATCCTAGTCAAAAACCTTAATGAAGGAGTCATACTAGGAAGTGAGAATTCTCTTGCTGATTACCTCTCAAGAGAA
CACCTCTTGAAGACTCTTAAATCAGCCCTGACCTCTCTTCCTTCTGATGGAACCTCCTCCCAGCTGAAGGCGGCTGAACACTCAGCGGCCCCGCCGCAGAACAAC
CAGCAACCCCCATCGCCGAGAAATGAAAGCATCATTTCTCCTCAAGCCGTAACATCCTCTTCAAGAGCTGCTACCTCAAAGGGCAAAAGGCCCATCTCTCAATCA
TCTGTACCATCTCCGATGAGTGCAGAAAACTACGCCGTGCATATTCAGTTCAAGACTGTATCCAGACGTCAACAAGGTTCTTCACAAAAAACCTTGTCGATTCAA
TCAGGCCCTTTTAGCCTTCCAATTCCCTCAAGTACGTTGTTACGCCCTTTCGGCACTACAACGAAAAATAGGCGTCCTGCTACAGCAACCACCTCTTCAAAACCT
ACTGTTCCGAGGAATCCTTCCTCGTTCTCTCAAATCGTCAGACCAAAGGTTTTTCAACCAAGGCCTCCAATCAATGGTTATTTCACCAAAACTACCATGGTGGAC
TCAACCATCGAACCAGAATTCAACGGACCCTCGGTCCAAGAAGGTATGTTTGTAGGGAAGAAACTTACAAGGCCCTTCCAATCACAAACCTACAACTATCGCGAC
TACATAAAGGCGTGGTATGTTGTTTTCTGTCTTCAAGGCTACAATCACTCTTGGTTTGTGACTTTTTCTGAAAAGATGAAAAACTGGTTCAAGACCAATGTTCAT
CTTCAAGACATGACGAGGCAAGAAGACGAGAGTTTCCTTCTAGCCAAAAACGCCGTCATGAGCTCACTAGCTGGAGCCGGATCTCAAGCCGTCTTCAACTCAGTT
CTCAATACAGTCGCGGTTCAAATCTCTTATGCAGACCACATTCAGATGGATGTTGATTCCTCCGCCTCTGTCAATAATGATGTTGAAGACGACGAAGACGACTTT
GATCCCTTCGAAGGCTACGACATCAATGATCCATACCTAGATTCACAGCCCTGCTGA
Protein sequenceShow/hide protein sequence
MWSCRRERRRGTPETKSPPLLLERRRGEGTAAVAMDRRDVGSKSRRRWMCRQPPLLLLRTRRLGMWSSVVELQQRGWGLKSRRTAAEKDVGVAATPCAAAAVERI
ENPVLPTVSKSPGIPQVDPCHPIFQPNSFKIGSLKENLSNLLLEINRRLSSLSIDRGDSSKKNDVVKIINVVAAIPTTTQASSSTILTVTMHTEMKSHYPQPSPP
DLGWDDLRHDQRAYDGTSIVTWKLMDILNTKIYSDLNAEALLGLRCRKMSNYKWYKDTFLTRLHTITTCGADIWKQKFVEGLPHYIAQRFYQTAVANSTTNQINW
SELTIGDITATIQGICVNLCLENKHTAKVIKDPDYRKELGTFCKQYGFDCGPRDERKKKKKSSNKVKGEAKKPIQIEDLYNEVNTLKREVANSKQRLSILQYTFR
KFQELKPTEGETSSTPERTLLTGSPSGINYISKVQNQKWMSKIIFKIRDFQLETFALIDSGADKNVIQEGLVPSKYFEKTKEVLSGAGGNLLNIKYKLSKVHICK
DDVCLINNTFILVKNLNEGVILGSENSLADYLSREHLLKTLKSALTSLPSDGTSSQLKAAEHSAAPPQNNQQPPSPRNESIISPQAVTSSSRAATSKGKRPISQS
SVPSPMSAENYAVHIQFKTVSRRQQGSSQKTLSIQSGPFSLPIPSSTLLRPFGTTTKNRRPATATTSSKPTVPRNPSSFSQIVRPKVFQPRPPINGYFTKTTMVD
STIEPEFNGPSVQEGMFVGKKLTRPFQSQTYNYRDYIKAWYVVFCLQGYNHSWFVTFSEKMKNWFKTNVHLQDMTRQEDESFLLAKNAVMSSLAGAGSQAVFNSV
LNTVAVQISYADHIQMDVDSSASVNNDVEDDEDDFDPFEGYDINDPYLDSQPC