; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005960 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005960
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionEnzymatic polyprotein
Genome locationchr6:34572516..34574400
RNA-Seq ExpressionLag0005960
SyntenyLag0005960
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR001995 - Peptidase A2A, retrovirus, catalytic
IPR018061 - Retropepsins
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052109.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.7e-13444.39Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWNIDG SE Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LK  K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFK
                                                             QYGL   P      K+K  S KK F+K K    E+P+RR+    + K
Subjt:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFK

Query:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP
         KKK S+K  T+CFKC + GHYANRCPL  K+N L IDE+TKQSLL   R+ +D SS+T+SSS  D     IN + EE SS E  F   S+ S+D+GAIP
Subjt:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP

Query:  --------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKT
                CSG INV+T  Q+ L  LI++IPD+EA+++ LLKL+ SL+ Q  +   +NP+ +S+Q ILN+++ E    +++ D+  E+K+LK+EVAENK 
Subjt:  --------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKT

Query:  RITNLEEAFYLIQEKTIFQDIQKGPSNE-------SGPSSQNEDGINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQ
        R+  LE AF   QE  + ++  +   N+             +   IN+ISKV NKKW+S I  K++D +LET+ALIDSGADQNVI+EGL+P++Y E TK+
Subjt:  RITNLEEAFYLIQEKTIFQDIQKGPSNE-------SGPSSQNEDGINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQ

Query:  DLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK
         L  A   PL I +KLS VHIC  +VC V+TF++VK+L E +ILGTPFLTQLYPF V++K
Subjt:  DLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK

KAA0056776.1 Enzymatic polyprotein [Cucumis melo var. makuwa]9.7e-13544.48Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWNIDG SE Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LKC K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQRF----K
                                                             QYGL   P      K+K  S KK F+K K    E+PRRR+R     K
Subjt:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQRF----K

Query:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL-RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP--
        SKK  S+K  T+CFKC + GHYANRCPL  K+N + IDE+TKQSLL  I +D  + + + SSS+ED   IN + EE SS E  F   S+ S+D+GAIP  
Subjt:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL-RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP--

Query:  ------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTRI
              CSG INV+T  Q+ L DLI++IPD+EA+++ LLKL+ SL+ Q  +   +NP+ +S+Q ILN+++ E    +++ D+  E+K+LK+EVAENK R+
Subjt:  ------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTRI

Query:  TNLEEAFYLIQEKTIFQDIQKGPSNESGPSSQNEDG----------INAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTK
          LE AF        FQ  Q      +    +   G          IN+ISK+ N+KW+S I  K++D +LE +ALIDSGADQNVI+EGL+P+RY E TK
Subjt:  TNLEEAFYLIQEKTIFQDIQKGPSNESGPSSQNEDG----------INAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTK

Query:  QDLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK
        + L GA   PL I +KLS VHIC  +VC V+TF++VK+L E +ILGTPFLTQLYPF V++K
Subjt:  QDLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK

KAA0057417.1 Enzymatic polyprotein [Cucumis melo var. makuwa]7.7e-11642.67Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWN DG  E Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LKC K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP---PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFKS
                                                             QYGL   P     K+K  S KK F++ K    E+PRRR+    + K 
Subjt:  ----------------------------------------------------SQYGLEDLP---PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFKS

Query:  KKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIPC
        KK+ S+K  T+CFKC + GHYANRCPL  K+N L IDE TKQS+L   R  +D SS+T+SSS  D     IN + EE SS E  F   S+ S+D+GAIPC
Subjt:  KKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIPC

Query:  SG-----C---INVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQ--EVRPRNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTR
        +G     C   INV+T  Q+ L DLI++I D+EA+++ LLKL+ SL+ Q  +   +N + + +Q I N+++ E    +++ D+  E+K LK+EV ENK R
Subjt:  SG-----C---INVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQ--EVRPRNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTR

Query:  ITNLEEAFYLIQEKTIFQDIQKGPSNE-----SGPSSQNED--GINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQD
        +  LE AF   QE  + ++  +  +N+     +G +   ED   IN+ISKV NKKW+S I  K++D +LE +ALIDSGADQNVI+E L+P++Y E TK+ 
Subjt:  ITNLEEAFYLIQEKTIFQDIQKGPSNE-----SGPSSQNED--GINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQD

Query:  LRGAGNYPLKISYKLSNVHIC
        L GAG  PL I +KLS VHIC
Subjt:  LRGAGNYPLKISYKLSNVHIC

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.7e-13444.18Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWNIDG SE Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LKC K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQRF----K
                                                             QYGL   P      K+K  S KK F+K K    E+P+RR+R     K
Subjt:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQRF----K

Query:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL-RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP--
        SKK  S+K  T+CFKC + GHYANRCPL  K+N + IDE+TKQSLL  I +D  + + + SSS+ED   IN + EE SS E  F   S+ S+D+GAIP  
Subjt:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL-RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP--

Query:  ------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTRI
              CSG INV+T  Q+ L DLI++IPD+EA+++ LLKL+ SL+ Q  +   +NP+ +S+Q ILN+++ E    +++ D+  E+K+LK+EVAENK R+
Subjt:  ------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTRI

Query:  TNLEEAFYLIQEKTIFQDIQKGPSNESGPSSQNEDG----------INAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTK
          LE AF        FQ  Q      +    +   G          IN+IS++ N+KW+S I  K++D +LE +ALIDSGADQNVI+EGL+P+RY E TK
Subjt:  TNLEEAFYLIQEKTIFQDIQKGPSNESGPSSQNEDG----------INAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTK

Query:  QDLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK
        + L GA   PL I +KLS VHIC  +VC V+TF++VK+L E +ILGTPFLTQLYPF V++K
Subjt:  QDLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK

TYJ98087.1 Enzymatic polyprotein [Cucumis melo var. makuwa]3.7e-13444.39Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWNIDG SE Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LK  K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFK
                                                             QYGL   P      K+K  S KK F+K K    E+P+RR+    + K
Subjt:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFK

Query:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP
         KKK S+K  T+CFKC + GHYANRCPL  K+N L IDE+TKQSLL   R+ +D SS+T+SSS  D     IN + EE SS E  F   S+ S+D+GAIP
Subjt:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP

Query:  --------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKT
                CSG INV+T  Q+ L  LI++IPD+EA+++ LLKL+ SL+ Q  +   +NP+ +S+Q ILN+++ E    +++ D+  E+K+LK+EVAENK 
Subjt:  --------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKT

Query:  RITNLEEAFYLIQEKTIFQDIQKGPSNE-------SGPSSQNEDGINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQ
        R+  LE AF   QE  + ++  +   N+             +   IN+ISKV NKKW+S I  K++D +LET+ALIDSGADQNVI+EGL+P++Y E TK+
Subjt:  RITNLEEAFYLIQEKTIFQDIQKGPSNE-------SGPSSQNEDGINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQ

Query:  DLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK
         L  A   PL I +KLS VHIC  +VC V+TF++VK+L E +ILGTPFLTQLYPF V++K
Subjt:  DLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK

TrEMBL top hitse value%identityAlignment
A0A5A7UF59 Enzymatic polyprotein1.8e-13444.39Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWNIDG SE Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LK  K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFK
                                                             QYGL   P      K+K  S KK F+K K    E+P+RR+    + K
Subjt:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFK

Query:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP
         KKK S+K  T+CFKC + GHYANRCPL  K+N L IDE+TKQSLL   R+ +D SS+T+SSS  D     IN + EE SS E  F   S+ S+D+GAIP
Subjt:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP

Query:  --------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKT
                CSG INV+T  Q+ L  LI++IPD+EA+++ LLKL+ SL+ Q  +   +NP+ +S+Q ILN+++ E    +++ D+  E+K+LK+EVAENK 
Subjt:  --------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKT

Query:  RITNLEEAFYLIQEKTIFQDIQKGPSNE-------SGPSSQNEDGINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQ
        R+  LE AF   QE  + ++  +   N+             +   IN+ISKV NKKW+S I  K++D +LET+ALIDSGADQNVI+EGL+P++Y E TK+
Subjt:  RITNLEEAFYLIQEKTIFQDIQKGPSNE-------SGPSSQNEDGINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQ

Query:  DLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK
         L  A   PL I +KLS VHIC  +VC V+TF++VK+L E +ILGTPFLTQLYPF V++K
Subjt:  DLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK

A0A5A7UR29 Enzymatic polyprotein4.7e-13544.48Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWNIDG SE Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LKC K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQRF----K
                                                             QYGL   P      K+K  S KK F+K K    E+PRRR+R     K
Subjt:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQRF----K

Query:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL-RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP--
        SKK  S+K  T+CFKC + GHYANRCPL  K+N + IDE+TKQSLL  I +D  + + + SSS+ED   IN + EE SS E  F   S+ S+D+GAIP  
Subjt:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL-RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP--

Query:  ------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTRI
              CSG INV+T  Q+ L DLI++IPD+EA+++ LLKL+ SL+ Q  +   +NP+ +S+Q ILN+++ E    +++ D+  E+K+LK+EVAENK R+
Subjt:  ------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTRI

Query:  TNLEEAFYLIQEKTIFQDIQKGPSNESGPSSQNEDG----------INAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTK
          LE AF        FQ  Q      +    +   G          IN+ISK+ N+KW+S I  K++D +LE +ALIDSGADQNVI+EGL+P+RY E TK
Subjt:  TNLEEAFYLIQEKTIFQDIQKGPSNESGPSSQNEDG----------INAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTK

Query:  QDLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK
        + L GA   PL I +KLS VHIC  +VC V+TF++VK+L E +ILGTPFLTQLYPF V++K
Subjt:  QDLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK

A0A5A7URX9 Enzymatic polyprotein3.7e-11642.67Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWN DG  E Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LKC K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP---PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFKS
                                                             QYGL   P     K+K  S KK F++ K    E+PRRR+    + K 
Subjt:  ----------------------------------------------------SQYGLEDLP---PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFKS

Query:  KKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIPC
        KK+ S+K  T+CFKC + GHYANRCPL  K+N L IDE TKQS+L   R  +D SS+T+SSS  D     IN + EE SS E  F   S+ S+D+GAIPC
Subjt:  KKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIPC

Query:  SG-----C---INVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQ--EVRPRNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTR
        +G     C   INV+T  Q+ L DLI++I D+EA+++ LLKL+ SL+ Q  +   +N + + +Q I N+++ E    +++ D+  E+K LK+EV ENK R
Subjt:  SG-----C---INVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQ--EVRPRNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTR

Query:  ITNLEEAFYLIQEKTIFQDIQKGPSNE-----SGPSSQNED--GINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQD
        +  LE AF   QE  + ++  +  +N+     +G +   ED   IN+ISKV NKKW+S I  K++D +LE +ALIDSGADQNVI+E L+P++Y E TK+ 
Subjt:  ITNLEEAFYLIQEKTIFQDIQKGPSNE-----SGPSSQNED--GINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQD

Query:  LRGAGNYPLKISYKLSNVHIC
        L GAG  PL I +KLS VHIC
Subjt:  LRGAGNYPLKISYKLSNVHIC

A0A5D3BEY3 Enzymatic polyprotein1.8e-13444.18Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWNIDG SE Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LKC K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQRF----K
                                                             QYGL   P      K+K  S KK F+K K    E+P+RR+R     K
Subjt:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQRF----K

Query:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL-RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP--
        SKK  S+K  T+CFKC + GHYANRCPL  K+N + IDE+TKQSLL  I +D  + + + SSS+ED   IN + EE SS E  F   S+ S+D+GAIP  
Subjt:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL-RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP--

Query:  ------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTRI
              CSG INV+T  Q+ L DLI++IPD+EA+++ LLKL+ SL+ Q  +   +NP+ +S+Q ILN+++ E    +++ D+  E+K+LK+EVAENK R+
Subjt:  ------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTRI

Query:  TNLEEAFYLIQEKTIFQDIQKGPSNESGPSSQNEDG----------INAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTK
          LE AF        FQ  Q      +    +   G          IN+IS++ N+KW+S I  K++D +LE +ALIDSGADQNVI+EGL+P+RY E TK
Subjt:  TNLEEAFYLIQEKTIFQDIQKGPSNESGPSSQNEDG----------INAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTK

Query:  QDLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK
        + L GA   PL I +KLS VHIC  +VC V+TF++VK+L E +ILGTPFLTQLYPF V++K
Subjt:  QDLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK

A0A5D3BG41 Enzymatic polyprotein1.8e-13444.39Show/hide
Query:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE
        MKNHY +PSPPDLGWDDL  + RT+D +SL TWNIDG SE Q++  F+EM  AA  Y+ K S  +T QIL+ GF G LRSWWHN LT ++R+ IL     
Subjt:  MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIE

Query:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------
         +             P+ VN L+YT  K+F+G T ++ + AT ALL LK  K+S +KWYKDTF+  LY LTTC   +WK K+VEGLP             
Subjt:  EIAKGVDGIEYTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLP-------------

Query:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFK
                                                             QYGL   P      K+K  S KK F+K K    E+P+RR+    + K
Subjt:  ----------------------------------------------------SQYGLEDLP----PSKRKNGSFKKTFKKQKNIISEAPRRRQ----RFK

Query:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP
         KKK S+K  T+CFKC + GHYANRCPL  K+N L IDE+TKQSLL   R+ +D SS+T+SSS  D     IN + EE SS E  F   S+ S+D+GAIP
Subjt:  SKKKQSTK-ETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLL---RILNDYSSETDSSSSSDEDLGRINEILEE-SSGESSF--SSEKSEDDGAIP

Query:  --------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKT
                CSG INV+T  Q+ L  LI++IPD+EA+++ LLKL+ SL+ Q  +   +NP+ +S+Q ILN+++ E    +++ D+  E+K+LK+EVAENK 
Subjt:  --------CSGCINVLTSYQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRP--RNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKT

Query:  RITNLEEAFYLIQEKTIFQDIQKGPSNE-------SGPSSQNEDGINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQ
        R+  LE AF   QE  + ++  +   N+             +   IN+ISKV NKKW+S I  K++D +LET+ALIDSGADQNVI+EGL+P++Y E TK+
Subjt:  RITNLEEAFYLIQEKTIFQDIQKGPSNE-------SGPSSQNEDGINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQ

Query:  DLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK
         L  A   PL I +KLS VHIC  +VC V+TF++VK+L E +ILGTPFLTQLYPF V++K
Subjt:  DLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTPFLTQLYPFIVSEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACCATTACAGACGACCATCTCCTCCAGATCTGGGATGGGATGATCTCGCTCCTGATCATAGGACCTTCGATCCGAAATCCCTAACGACATGGAATATCGACGG
CTGTTCAGAAGGTCAAATTGTTAGACTGTTTGAAGAAATGTTTGCCGCAGCAATAGTCTATACTCAAAAATTTAGTCAAAAGGATACAACTCAAATCCTAGTTTCGGGAT
TCACAGGTATTCTAAGAAGTTGGTGGCACAACCACCTTACAGCTGAGAATCGAGAGTTGATTCTCAATCACCAAATAGAAGAAATAGCAAAAGGCGTCGATGGTATCGAA
TATACCACGATGGCTCCTAATGCAGTCAATGCCCTTATCTATACGACATTAAAGAACTTTGTAGGACACACGACCCTATACAGTGATCAAGCAACAGCTGCTCTGCTAAG
CCTAAAATGTCCTAAGATTAGTGACTTCAAATGGTATAAAGACACTTTTCTTATGTGTCTTTACATTCTCACTACATGCCATGATGTCCTATGGAAACACAAATATGTTG
AAGGACTTCCAAGCCAGTATGGCCTCGAAGATCTTCCTCCTTCCAAAAGAAAGAATGGCTCTTTCAAAAAGACCTTCAAAAAGCAAAAGAATATTATTTCTGAAGCTCCT
CGCAGACGACAGAGGTTCAAAAGCAAAAAGAAGCAATCTACAAAAGAAACCGTATGCTTCAAATGCAAAAAACCAGGACACTATGCCAATCGGTGCCCATTGGTCAAGAA
AGTTAACAAATTAGAGATCGACGAAGATACTAAACAGTCTCTTCTTCGAATCTTAAACGATTATTCTTCAGAGACAGACTCCTCCTCCTCCTCCGATGAAGACTTGGGAA
GAATAAATGAAATTCTCGAAGAATCCTCAGGAGAATCTTCATTTTCTAGTGAAAAGTCTGAAGATGACGGTGCTATTCCCTGTAGTGGATGCATCAATGTTCTGACTTCT
TATCAAAAAGATCTTTTGGATCTAATTGATGAAATCCCTGATAAGGAAGCTCAAAAATCAATTCTCCTAAAGCTTCGATCAAGTTTAAAAGCTCAGGAGGTTCGACCCAG
AAATCCGGTAGATTTCTCTTTTCAGAGTATCTTGAACCAGCTCCAAAGTGAAGGTACGACTTCAGTCAAAATTCCAGATATGCAACAAGAAATCAAATCCTTGAAGAAGG
AGGTTGCAGAAAACAAAACAAGAATTACCAATCTTGAGGAAGCCTTCTATCTTATCCAAGAAAAGACAATCTTCCAAGATATTCAAAAGGGGCCTAGCAATGAGTCAGGT
CCATCTTCTCAAAATGAGGATGGCATAAATGCCATCAGCAAAGTCTTCAATAAAAAATGGTTATCCAATATCACACTGAAGATTCAAGACTCTAAGCTCGAAACTATTGC
TCTAATAGATTCCGGAGCCGATCAAAATGTCATTCGCGAAGGATTGATACCAACAAGATATTGCGAAATGACAAAACAAGATCTTAGAGGAGCAGGCAACTATCCACTAA
AGATCAGTTACAAATTATCCAACGTTCATATCTGCAATGAAGAAGTGTGTTTCGTAAGCACATTTCTTATTGTAAAGGATCTGACGGAAGAAGTAATTCTGGGTACTCCT
TTCTTAACCCAGTTATATCCATTCATAGTTTCTGAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACCATTACAGACGACCATCTCCTCCAGATCTGGGATGGGATGATCTCGCTCCTGATCATAGGACCTTCGATCCGAAATCCCTAACGACATGGAATATCGACGG
CTGTTCAGAAGGTCAAATTGTTAGACTGTTTGAAGAAATGTTTGCCGCAGCAATAGTCTATACTCAAAAATTTAGTCAAAAGGATACAACTCAAATCCTAGTTTCGGGAT
TCACAGGTATTCTAAGAAGTTGGTGGCACAACCACCTTACAGCTGAGAATCGAGAGTTGATTCTCAATCACCAAATAGAAGAAATAGCAAAAGGCGTCGATGGTATCGAA
TATACCACGATGGCTCCTAATGCAGTCAATGCCCTTATCTATACGACATTAAAGAACTTTGTAGGACACACGACCCTATACAGTGATCAAGCAACAGCTGCTCTGCTAAG
CCTAAAATGTCCTAAGATTAGTGACTTCAAATGGTATAAAGACACTTTTCTTATGTGTCTTTACATTCTCACTACATGCCATGATGTCCTATGGAAACACAAATATGTTG
AAGGACTTCCAAGCCAGTATGGCCTCGAAGATCTTCCTCCTTCCAAAAGAAAGAATGGCTCTTTCAAAAAGACCTTCAAAAAGCAAAAGAATATTATTTCTGAAGCTCCT
CGCAGACGACAGAGGTTCAAAAGCAAAAAGAAGCAATCTACAAAAGAAACCGTATGCTTCAAATGCAAAAAACCAGGACACTATGCCAATCGGTGCCCATTGGTCAAGAA
AGTTAACAAATTAGAGATCGACGAAGATACTAAACAGTCTCTTCTTCGAATCTTAAACGATTATTCTTCAGAGACAGACTCCTCCTCCTCCTCCGATGAAGACTTGGGAA
GAATAAATGAAATTCTCGAAGAATCCTCAGGAGAATCTTCATTTTCTAGTGAAAAGTCTGAAGATGACGGTGCTATTCCCTGTAGTGGATGCATCAATGTTCTGACTTCT
TATCAAAAAGATCTTTTGGATCTAATTGATGAAATCCCTGATAAGGAAGCTCAAAAATCAATTCTCCTAAAGCTTCGATCAAGTTTAAAAGCTCAGGAGGTTCGACCCAG
AAATCCGGTAGATTTCTCTTTTCAGAGTATCTTGAACCAGCTCCAAAGTGAAGGTACGACTTCAGTCAAAATTCCAGATATGCAACAAGAAATCAAATCCTTGAAGAAGG
AGGTTGCAGAAAACAAAACAAGAATTACCAATCTTGAGGAAGCCTTCTATCTTATCCAAGAAAAGACAATCTTCCAAGATATTCAAAAGGGGCCTAGCAATGAGTCAGGT
CCATCTTCTCAAAATGAGGATGGCATAAATGCCATCAGCAAAGTCTTCAATAAAAAATGGTTATCCAATATCACACTGAAGATTCAAGACTCTAAGCTCGAAACTATTGC
TCTAATAGATTCCGGAGCCGATCAAAATGTCATTCGCGAAGGATTGATACCAACAAGATATTGCGAAATGACAAAACAAGATCTTAGAGGAGCAGGCAACTATCCACTAA
AGATCAGTTACAAATTATCCAACGTTCATATCTGCAATGAAGAAGTGTGTTTCGTAAGCACATTTCTTATTGTAAAGGATCTGACGGAAGAAGTAATTCTGGGTACTCCT
TTCTTAACCCAGTTATATCCATTCATAGTTTCTGAGAAATGA
Protein sequenceShow/hide protein sequence
MKNHYRRPSPPDLGWDDLAPDHRTFDPKSLTTWNIDGCSEGQIVRLFEEMFAAAIVYTQKFSQKDTTQILVSGFTGILRSWWHNHLTAENRELILNHQIEEIAKGVDGIE
YTTMAPNAVNALIYTTLKNFVGHTTLYSDQATAALLSLKCPKISDFKWYKDTFLMCLYILTTCHDVLWKHKYVEGLPSQYGLEDLPPSKRKNGSFKKTFKKQKNIISEAP
RRRQRFKSKKKQSTKETVCFKCKKPGHYANRCPLVKKVNKLEIDEDTKQSLLRILNDYSSETDSSSSSDEDLGRINEILEESSGESSFSSEKSEDDGAIPCSGCINVLTS
YQKDLLDLIDEIPDKEAQKSILLKLRSSLKAQEVRPRNPVDFSFQSILNQLQSEGTTSVKIPDMQQEIKSLKKEVAENKTRITNLEEAFYLIQEKTIFQDIQKGPSNESG
PSSQNEDGINAISKVFNKKWLSNITLKIQDSKLETIALIDSGADQNVIREGLIPTRYCEMTKQDLRGAGNYPLKISYKLSNVHICNEEVCFVSTFLIVKDLTEEVILGTP
FLTQLYPFIVSEK