; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0006175 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0006175
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr04:10448991..10451083
RNA-Seq ExpressionPay0006175
SyntenyPay0006175
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035966.1 F5J5.1 [Cucumis melo var. makuwa]1.2e-18555.74Show/hide
Query:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL
        +I VNGVSVPKPE+DWT+AEEQASVGN RA+N IFNGVDLNVFKLI+SC+T KEAWK LEVAYEGTSKVKISRLQLITSKFEAL+M EDE+ S+YN+RVL
Subjt:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL

Query:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL
        +IAN+SLLLGEKI +SKIVRKVLRS+ RKFD+KVTAIEEA DITTLKLDELFGSLLTFEMA  DRESKK K I+FKS +   N       +AN DESIAL
Subjt:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL

Query:  ITKLFSKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDD----------------------------HELTFEELK
        +TK F+   R  K+ N                   ECPT+L++QKKN+  TLSDE+S D  DD                             EL+ E+LK
Subjt:  ITKLFSKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDD----------------------------HELTFEELK

Query:  ILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKR--SYTV
         L KED EAR IQKE IQDL++ENE LM +ISSLK+KL+EVQN                  LDSIL +G NGS +YG+GF +S    K T E +    ++
Subjt:  ILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKR--SYTV

Query:  KLLRDRRHQQ---RPKLVNQHNQYRIIKRNNDVRG----------------------THLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYF
        ++  D  H +   R  + +    Y    +   +R                         ++WR+K  E+C IAFT+VQT  D WYFDSGCSRHMTGNR +
Subjt:  KLLRDRRHQQ---RPKLVNQHNQYRIIKRNNDVRG----------------------THLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYF

Query:  FTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNIC
        F  L +C   HVTFGDGAKGKIIAKGNI+K +LP LN+VRYVDGL ANL S+++LCDQGY V+F++  C+V +K NQ+ MSG+ +ADNCYHW+SN S+ C
Subjt:  FTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNIC

Query:  HLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK
         L + DQ WLWHRKL +  +R L+KVI+N+AVVGIP L +NG FFCGDCQ+GK
Subjt:  HLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK

KAA0058389.1 F5J5.1 [Cucumis melo var. makuwa]8.0e-22569.32Show/hide
Query:  GVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVLEIAND
        G S  +P +   DAEEQASVGNARAIN IFNGVDLNVFKLI+ CT  KEAWKILEVAYEGTSKVKISRLQLITSKFEALKM EDE+ SEYNERVLEIAND
Subjt:  GVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVLEIAND

Query:  SLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIALITKLF
        SLLLGEKI  SKIVRKVLRSLP KFD+KVTAIEEAQD+TTLKLD+LFGSLLTFEMAM DRESKK                                    
Subjt:  SLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIALITKLF

Query:  SKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDD-----------------------EDDHELTFEELKILRKEDSEAR
                     G++ +  +  GFGHY AEC TYL+RQKKNYCATLSDEDSDD                        ++D ELT EELK+LRKEDSEAR
Subjt:  SKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDD-----------------------EDDHELTFEELKILRKEDSEAR

Query:  AIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYTVKLLRDRRHQQRP
        AIQKERIQDLMDENERLMG+ISSLKVKLKEVQNV                 LDSILSS QN SSKYG+GFDTSTRGVKITPE+                 
Subjt:  AIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYTVKLLRDRRHQQRP

Query:  KLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLN
              NQ R+ KRNN+VRGTH+IWRVKT EKCN+AFTTVQTH DAWYFDSGC RHMTGNR FFTELEEC+S  VTFGDGAK KIIAKGNIDKSNLPCLN
Subjt:  KLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLN

Query:  EVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPS
        EVRYVDGL ANL S+S+LCDQGY+VNFNN  C+VTDKNNQVFMSGR EADNCYHWSSN SN CHLTK+DQ WLWHRKLG+I LRSLDKVIRNEAVVGIPS
Subjt:  EVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPS

Query:  LHINGKFFCGDCQVGK
        L INGKFFCGDCQVGK
Subjt:  LHINGKFFCGDCQVGK

TYK30437.1 F5J5.1 [Cucumis melo var. makuwa]1.2e-18555.74Show/hide
Query:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL
        +I VNGVSVPKPE+DWT+AEEQASVGN RA+N IFNGVDLNVFKLI+SC+T KEAWK LEVAYEGTSKVKISRLQLITSKFEAL+M EDE+ S+YN+RVL
Subjt:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL

Query:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL
        +IAN+SLLLGEKI +SKIVRKVLRS+ RKFD+KVTAIEEA DITTLKLDELFGSLLTFEMA  DRESKK K I+FKS +   N       +AN DESIAL
Subjt:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL

Query:  ITKLFSKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDD----------------------------HELTFEELK
        +TK F+   R  K+ N                   ECPT+L++QKKN+  TLSDE+S D  DD                             EL+ E+LK
Subjt:  ITKLFSKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDD----------------------------HELTFEELK

Query:  ILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKR--SYTV
         L KED EAR IQKE IQDL++ENE LM +ISSLK+KL+EVQN                  LDSIL +G NGS +YG+GF +S    K T E +    ++
Subjt:  ILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKR--SYTV

Query:  KLLRDRRHQQ---RPKLVNQHNQYRIIKRNNDVRG----------------------THLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYF
        ++  D  H +   R  + +    Y    +   +R                         ++WR+K  E+C IAFT+VQT  D WYFDSGCSRHMTGNR +
Subjt:  KLLRDRRHQQ---RPKLVNQHNQYRIIKRNNDVRG----------------------THLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYF

Query:  FTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNIC
        F  L +C   HVTFGDGAKGKIIAKGNI+K +LP LN+VRYVDGL ANL S+++LCDQGY V+F++  C+V +K NQ+ MSG+ +ADNCYHW+SN S+ C
Subjt:  FTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNIC

Query:  HLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK
         L + DQ WLWHRKL +  +R L+KVI+N+AVVGIP L +NG FFCGDCQ+GK
Subjt:  HLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK

XP_008454684.1 PREDICTED: uncharacterized protein LOC103495039 [Cucumis melo]0.0e+0094.85Show/hide
Query:  EIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVLEIANDSLLLGEK
        +IDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTT KEAWKILEVAYEGTSKVKI RLQL+TSKFEALKMIEDETASEYNERVLEIANDSLLLGEK
Subjt:  EIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVLEIANDSLLLGEK

Query:  IHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIALITKLFSKMARKF
        IHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQ  NEANQDESIALI +LFSKMARKF
Subjt:  IHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIALITKLFSKMARKF

Query:  KSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDDH-------------ELTFEELKILRKEDSEARAIQKERIQDLMDENERL
        KSLNTAGKTEKMEDVM      AECPTYLKRQKKNYCATLSDEDSDDDEDDH             ELTFEELKILRKEDSEARAIQKERIQDLMDENERL
Subjt:  KSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDDH-------------ELTFEELKILRKEDSEARAIQKERIQDLMDENERL

Query:  MGIISSLKVKLKEVQNVLDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYTVKLLRDRRHQQRPKLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCN
        MGIISSLKVKLKEVQNVLDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYT+KLLRDRRHQQRPKL NQHNQYRIIKRNNDVRGTHLIWRVKTSEKCN
Subjt:  MGIISSLKVKLKEVQNVLDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYTVKLLRDRRHQQRPKLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCN

Query:  IAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIV
        IAFTTVQTHFDAWYFDSGCSRHMT NRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIV
Subjt:  IAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIV

Query:  TDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK
        TDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK
Subjt:  TDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK

XP_016903608.1 PREDICTED: uncharacterized protein LOC107992254 [Cucumis melo]5.6e-19474.9Show/hide
Query:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL
        MITVNGVSVPKPEIDWTDAEE+ASVGNARAIN +FNGVDLN+FKLI+S TT KEAWKILEVAYEGTSKVKISRLQLITSKFEALKM EDET SEYNERVL
Subjt:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL

Query:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL
        EIANDSLLL EKI ESKIV KVLRSLPRKFD+KVTAIEEAQDITTLKLDELFGSLLTFEMA+ DRESKK KRIAFKSVYDQENTVNQS NEANQDES+AL
Subjt:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL

Query:  ITKLFSKMARK-----FKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDDH---------------------------ELTF
        +TK FSKMARK      K     G + +  +  G GHY AECP YL RQKKNYCATLSDE+SD+DEDDH                           ELT 
Subjt:  ITKLFSKMARK-----FKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDDH---------------------------ELTF

Query:  EELKILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEK---
        EELKILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLK+VQNV                 LDSILSSGQNGSSKY        R  ++  +K   
Subjt:  EELKILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEK---

Query:  RSYTVKLLRDRRHQQRPKLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKG
        RS+  KLLRDRRHQQRPK  N+ NQYR IKRNNDVRGTH IWRV TS KCN+AFTTVQTH DAWYFDSGCSR MTGNR FFTELEEC+S HVTF DGAKG
Subjt:  RSYTVKLLRDRRHQQRPKLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKG

Query:  KIIAKGNIDK
        KIIAKGNIDK
Subjt:  KIIAKGNIDK

TrEMBL top hitse value%identityAlignment
A0A1S3BZ69 uncharacterized protein LOC1034950390.0e+0094.85Show/hide
Query:  EIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVLEIANDSLLLGEK
        +IDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTT KEAWKILEVAYEGTSKVKI RLQL+TSKFEALKMIEDETASEYNERVLEIANDSLLLGEK
Subjt:  EIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVLEIANDSLLLGEK

Query:  IHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIALITKLFSKMARKF
        IHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQ  NEANQDESIALI +LFSKMARKF
Subjt:  IHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIALITKLFSKMARKF

Query:  KSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDDH-------------ELTFEELKILRKEDSEARAIQKERIQDLMDENERL
        KSLNTAGKTEKMEDVM      AECPTYLKRQKKNYCATLSDEDSDDDEDDH             ELTFEELKILRKEDSEARAIQKERIQDLMDENERL
Subjt:  KSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDDH-------------ELTFEELKILRKEDSEARAIQKERIQDLMDENERL

Query:  MGIISSLKVKLKEVQNVLDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYTVKLLRDRRHQQRPKLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCN
        MGIISSLKVKLKEVQNVLDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYT+KLLRDRRHQQRPKL NQHNQYRIIKRNNDVRGTHLIWRVKTSEKCN
Subjt:  MGIISSLKVKLKEVQNVLDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYTVKLLRDRRHQQRPKLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCN

Query:  IAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIV
        IAFTTVQTHFDAWYFDSGCSRHMT NRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIV
Subjt:  IAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIV

Query:  TDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK
        TDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK
Subjt:  TDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK

A0A1S4E5V5 uncharacterized protein LOC1079922542.7e-19474.9Show/hide
Query:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL
        MITVNGVSVPKPEIDWTDAEE+ASVGNARAIN +FNGVDLN+FKLI+S TT KEAWKILEVAYEGTSKVKISRLQLITSKFEALKM EDET SEYNERVL
Subjt:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL

Query:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL
        EIANDSLLL EKI ESKIV KVLRSLPRKFD+KVTAIEEAQDITTLKLDELFGSLLTFEMA+ DRESKK KRIAFKSVYDQENTVNQS NEANQDES+AL
Subjt:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL

Query:  ITKLFSKMARK-----FKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDDH---------------------------ELTF
        +TK FSKMARK      K     G + +  +  G GHY AECP YL RQKKNYCATLSDE+SD+DEDDH                           ELT 
Subjt:  ITKLFSKMARK-----FKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDDH---------------------------ELTF

Query:  EELKILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEK---
        EELKILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLK+VQNV                 LDSILSSGQNGSSKY        R  ++  +K   
Subjt:  EELKILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEK---

Query:  RSYTVKLLRDRRHQQRPKLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKG
        RS+  KLLRDRRHQQRPK  N+ NQYR IKRNNDVRGTH IWRV TS KCN+AFTTVQTH DAWYFDSGCSR MTGNR FFTELEEC+S HVTF DGAKG
Subjt:  RSYTVKLLRDRRHQQRPKLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKG

Query:  KIIAKGNIDK
        KIIAKGNIDK
Subjt:  KIIAKGNIDK

A0A5A7T169 F5J5.16.0e-18655.74Show/hide
Query:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL
        +I VNGVSVPKPE+DWT+AEEQASVGN RA+N IFNGVDLNVFKLI+SC+T KEAWK LEVAYEGTSKVKISRLQLITSKFEAL+M EDE+ S+YN+RVL
Subjt:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL

Query:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL
        +IAN+SLLLGEKI +SKIVRKVLRS+ RKFD+KVTAIEEA DITTLKLDELFGSLLTFEMA  DRESKK K I+FKS +   N       +AN DESIAL
Subjt:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL

Query:  ITKLFSKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDD----------------------------HELTFEELK
        +TK F+   R  K+ N                   ECPT+L++QKKN+  TLSDE+S D  DD                             EL+ E+LK
Subjt:  ITKLFSKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDD----------------------------HELTFEELK

Query:  ILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKR--SYTV
         L KED EAR IQKE IQDL++ENE LM +ISSLK+KL+EVQN                  LDSIL +G NGS +YG+GF +S    K T E +    ++
Subjt:  ILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKR--SYTV

Query:  KLLRDRRHQQ---RPKLVNQHNQYRIIKRNNDVRG----------------------THLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYF
        ++  D  H +   R  + +    Y    +   +R                         ++WR+K  E+C IAFT+VQT  D WYFDSGCSRHMTGNR +
Subjt:  KLLRDRRHQQ---RPKLVNQHNQYRIIKRNNDVRG----------------------THLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYF

Query:  FTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNIC
        F  L +C   HVTFGDGAKGKIIAKGNI+K +LP LN+VRYVDGL ANL S+++LCDQGY V+F++  C+V +K NQ+ MSG+ +ADNCYHW+SN S+ C
Subjt:  FTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNIC

Query:  HLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK
         L + DQ WLWHRKL +  +R L+KVI+N+AVVGIP L +NG FFCGDCQ+GK
Subjt:  HLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK

A0A5A7UXZ3 F5J5.13.9e-22569.32Show/hide
Query:  GVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVLEIAND
        G S  +P +   DAEEQASVGNARAIN IFNGVDLNVFKLI+ CT  KEAWKILEVAYEGTSKVKISRLQLITSKFEALKM EDE+ SEYNERVLEIAND
Subjt:  GVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVLEIAND

Query:  SLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIALITKLF
        SLLLGEKI  SKIVRKVLRSLP KFD+KVTAIEEAQD+TTLKLD+LFGSLLTFEMAM DRESKK                                    
Subjt:  SLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIALITKLF

Query:  SKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDD-----------------------EDDHELTFEELKILRKEDSEAR
                     G++ +  +  GFGHY AEC TYL+RQKKNYCATLSDEDSDD                        ++D ELT EELK+LRKEDSEAR
Subjt:  SKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDD-----------------------EDDHELTFEELKILRKEDSEAR

Query:  AIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYTVKLLRDRRHQQRP
        AIQKERIQDLMDENERLMG+ISSLKVKLKEVQNV                 LDSILSS QN SSKYG+GFDTSTRGVKITPE+                 
Subjt:  AIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYTVKLLRDRRHQQRP

Query:  KLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLN
              NQ R+ KRNN+VRGTH+IWRVKT EKCN+AFTTVQTH DAWYFDSGC RHMTGNR FFTELEEC+S  VTFGDGAK KIIAKGNIDKSNLPCLN
Subjt:  KLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLN

Query:  EVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPS
        EVRYVDGL ANL S+S+LCDQGY+VNFNN  C+VTDKNNQVFMSGR EADNCYHWSSN SN CHLTK+DQ WLWHRKLG+I LRSLDKVIRNEAVVGIPS
Subjt:  EVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPS

Query:  LHINGKFFCGDCQVGK
        L INGKFFCGDCQVGK
Subjt:  LHINGKFFCGDCQVGK

A0A5D3E2Y4 F5J5.16.0e-18655.74Show/hide
Query:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL
        +I VNGVSVPKPE+DWT+AEEQASVGN RA+N IFNGVDLNVFKLI+SC+T KEAWK LEVAYEGTSKVKISRLQLITSKFEAL+M EDE+ S+YN+RVL
Subjt:  MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVL

Query:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL
        +IAN+SLLLGEKI +SKIVRKVLRS+ RKFD+KVTAIEEA DITTLKLDELFGSLLTFEMA  DRESKK K I+FKS +   N       +AN DESIAL
Subjt:  EIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIAL

Query:  ITKLFSKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDD----------------------------HELTFEELK
        +TK F+   R  K+ N                   ECPT+L++QKKN+  TLSDE+S D  DD                             EL+ E+LK
Subjt:  ITKLFSKMARKFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDD----------------------------HELTFEELK

Query:  ILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKR--SYTV
         L KED EAR IQKE IQDL++ENE LM +ISSLK+KL+EVQN                  LDSIL +G NGS +YG+GF +S    K T E +    ++
Subjt:  ILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQNV-----------------LDSILSSGQNGSSKYGIGFDTSTRGVKITPEKR--SYTV

Query:  KLLRDRRHQQ---RPKLVNQHNQYRIIKRNNDVRG----------------------THLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYF
        ++  D  H +   R  + +    Y    +   +R                         ++WR+K  E+C IAFT+VQT  D WYFDSGCSRHMTGNR +
Subjt:  KLLRDRRHQQ---RPKLVNQHNQYRIIKRNNDVRG----------------------THLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSRHMTGNRYF

Query:  FTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNIC
        F  L +C   HVTFGDGAKGKIIAKGNI+K +LP LN+VRYVDGL ANL S+++LCDQGY V+F++  C+V +K NQ+ MSG+ +ADNCYHW+SN S+ C
Subjt:  FTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGSNIC

Query:  HLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK
         L + DQ WLWHRKL +  +R L+KVI+N+AVVGIP L +NG FFCGDCQ+GK
Subjt:  HLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein2.1e-0526.79Show/hide
Query:  ITVNGV---SVPKPEIDWTDAEEQAS------VGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLIT-----SKFEALKMI
        + VNGV       PE+  T   E+ S      V +A+A+ ++ + +  +VF+   S ++ K+ W +L    +G  +  I RL+ +T      + E LKM+
Subjt:  ITVNGV---SVPKPEIDWTDAEEQAS------VGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLIT-----SKFEALKMI

Query:  EDETASEYNERVLEIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDI---TTLKLDELF
        + E+ S Y ++ LEI         +  + +I + V  +L   FD   + +EE  D+   T+  L E F
Subjt:  EDETASEYNERVLEIANDSLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDI---TTLKLDELF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTACTGTGAATGGAGTATCAGTGCCAAAACCAGAAATTGACTGGACAGATGCTGAAGAACAAGCTTCAGTTGGAAATGCAAGAGCCATAAATGTTATCTTC
AATGGTGTCGATTTAAACGTGTTCAAACTTATCGATTCCTGCACTACTGGTAAAGAGGCGTGGAAAATACTGGAAGTTGCATATGAAGGAACTTCTAAAGTGAAG
ATATCCAGACTGCAGTTGATAACTTCAAAATTCGAAGCCTTGAAAATGATTGAAGATGAGACAGCCTCTGAATACAATGAGAGGGTCCTAGAGATAGCTAATGAT
TCGTTACTACTTGGTGAAAAGATACATGAGTCTAAAATTGTTCGCAAAGTGTTGCGCTCCTTACCCAGAAAGTTTGACATAAAGGTCACTGCCATAGAAGAAGCC
CAAGATATAACGACGTTAAAACTTGACGAGCTATTTGGGTCACTACTTACGTTTGAAATGGCTATGTTGGATAGAGAAAGTAAGAAAGATAAGAGGATAGCATTC
AAATCAGTTTATGATCAGGAGAACACTGTAAATCAATCTAGTAATGAAGCTAATCAAGATGAGTCAATAGCTCTCATAACAAAACTATTCTCTAAGATGGCCAGA
AAGTTCAAAAGTTTGAATACTGCTGGAAAAACTGAAAAAATGGAAGACGTGATGGGGTTTGGTCATTATCTGGCTGAATGTCCTACTTATCTCAAAAGACAAAAG
AAAAATTATTGTGCTACCCTGTCTGATGAGGATTCAGATGATGATGAAGATGATCATGAGCTAACGTTTGAAGAACTCAAAATACTGAGGAAGGAAGACTCAGAA
GCCAGAGCTATTCAAAAAGAAAGAATTCAAGACTTAATGGATGAAAATGAACGATTGATGGGGATTATATCATCTCTGAAAGTAAAGTTGAAAGAAGTTCAGAAT
GTCTTAGACTCAATCCTGAGTTCAGGGCAAAATGGTTCAAGTAAATATGGCATCGGATTTGATACTTCAACTAGGGGTGTCAAGATTACTCCAGAAAAGAGGTCA
TATACGGTCAAATTACTGAGAGATAGGAGACATCAGCAAAGACCAAAACTTGTAAACCAGCATAATCAGTATAGGATCATCAAAAGGAACAATGATGTAAGGGGA
ACTCACTTGATCTGGAGGGTGAAGACTTCTGAGAAGTGCAATATAGCATTTACAACAGTTCAAACCCATTTTGATGCTTGGTACTTTGACAGTGGATGCTCAAGA
CATATGACTGGCAATCGATATTTCTTTACTGAGTTAGAAGAATGTTCATCAAGTCATGTCACTTTTGGAGATGGGGCCAAAGGAAAAATTATTGCGAAAGGAAAC
ATTGACAAAAGTAATCTACCCTGTCTTAATGAAGTTAGATATGTGGATGGACTGAATGCAAACTTGAGTAGTGTAAGTCGACTATGTGACCAAGGATACAACGTA
AACTTTAACAATACTTGTTGTATAGTTACAGACAAGAATAATCAAGTGTTCATGAGTGGCAGATGGGAAGCAGATAACTGTTATCATTGGAGTTCCAATGGTTCA
AACATATGTCACTTAACTAAAGTTGATCAGAACTGGTTGTGGCATAGGAAATTGGGGAACATCGGCTTGAGAAGTTTAGATAAAGTTATCAGAAACGAGGCGGTT
GTAGGCATTCCTTCGTTACACATAAATGGCAAATTCTTTTGTGGTGACTGTCAAGTTGGAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTACTGTGAATGGAGTATCAGTGCCAAAACCAGAAATTGACTGGACAGATGCTGAAGAACAAGCTTCAGTTGGAAATGCAAGAGCCATAAATGTTATCTTC
AATGGTGTCGATTTAAACGTGTTCAAACTTATCGATTCCTGCACTACTGGTAAAGAGGCGTGGAAAATACTGGAAGTTGCATATGAAGGAACTTCTAAAGTGAAG
ATATCCAGACTGCAGTTGATAACTTCAAAATTCGAAGCCTTGAAAATGATTGAAGATGAGACAGCCTCTGAATACAATGAGAGGGTCCTAGAGATAGCTAATGAT
TCGTTACTACTTGGTGAAAAGATACATGAGTCTAAAATTGTTCGCAAAGTGTTGCGCTCCTTACCCAGAAAGTTTGACATAAAGGTCACTGCCATAGAAGAAGCC
CAAGATATAACGACGTTAAAACTTGACGAGCTATTTGGGTCACTACTTACGTTTGAAATGGCTATGTTGGATAGAGAAAGTAAGAAAGATAAGAGGATAGCATTC
AAATCAGTTTATGATCAGGAGAACACTGTAAATCAATCTAGTAATGAAGCTAATCAAGATGAGTCAATAGCTCTCATAACAAAACTATTCTCTAAGATGGCCAGA
AAGTTCAAAAGTTTGAATACTGCTGGAAAAACTGAAAAAATGGAAGACGTGATGGGGTTTGGTCATTATCTGGCTGAATGTCCTACTTATCTCAAAAGACAAAAG
AAAAATTATTGTGCTACCCTGTCTGATGAGGATTCAGATGATGATGAAGATGATCATGAGCTAACGTTTGAAGAACTCAAAATACTGAGGAAGGAAGACTCAGAA
GCCAGAGCTATTCAAAAAGAAAGAATTCAAGACTTAATGGATGAAAATGAACGATTGATGGGGATTATATCATCTCTGAAAGTAAAGTTGAAAGAAGTTCAGAAT
GTCTTAGACTCAATCCTGAGTTCAGGGCAAAATGGTTCAAGTAAATATGGCATCGGATTTGATACTTCAACTAGGGGTGTCAAGATTACTCCAGAAAAGAGGTCA
TATACGGTCAAATTACTGAGAGATAGGAGACATCAGCAAAGACCAAAACTTGTAAACCAGCATAATCAGTATAGGATCATCAAAAGGAACAATGATGTAAGGGGA
ACTCACTTGATCTGGAGGGTGAAGACTTCTGAGAAGTGCAATATAGCATTTACAACAGTTCAAACCCATTTTGATGCTTGGTACTTTGACAGTGGATGCTCAAGA
CATATGACTGGCAATCGATATTTCTTTACTGAGTTAGAAGAATGTTCATCAAGTCATGTCACTTTTGGAGATGGGGCCAAAGGAAAAATTATTGCGAAAGGAAAC
ATTGACAAAAGTAATCTACCCTGTCTTAATGAAGTTAGATATGTGGATGGACTGAATGCAAACTTGAGTAGTGTAAGTCGACTATGTGACCAAGGATACAACGTA
AACTTTAACAATACTTGTTGTATAGTTACAGACAAGAATAATCAAGTGTTCATGAGTGGCAGATGGGAAGCAGATAACTGTTATCATTGGAGTTCCAATGGTTCA
AACATATGTCACTTAACTAAAGTTGATCAGAACTGGTTGTGGCATAGGAAATTGGGGAACATCGGCTTGAGAAGTTTAGATAAAGTTATCAGAAACGAGGCGGTT
GTAGGCATTCCTTCGTTACACATAAATGGCAAATTCTTTTGTGGTGACTGTCAAGTTGGAAAGTAA
Protein sequenceShow/hide protein sequence
MITVNGVSVPKPEIDWTDAEEQASVGNARAINVIFNGVDLNVFKLIDSCTTGKEAWKILEVAYEGTSKVKISRLQLITSKFEALKMIEDETASEYNERVLEIAND
SLLLGEKIHESKIVRKVLRSLPRKFDIKVTAIEEAQDITTLKLDELFGSLLTFEMAMLDRESKKDKRIAFKSVYDQENTVNQSSNEANQDESIALITKLFSKMAR
KFKSLNTAGKTEKMEDVMGFGHYLAECPTYLKRQKKNYCATLSDEDSDDDEDDHELTFEELKILRKEDSEARAIQKERIQDLMDENERLMGIISSLKVKLKEVQN
VLDSILSSGQNGSSKYGIGFDTSTRGVKITPEKRSYTVKLLRDRRHQQRPKLVNQHNQYRIIKRNNDVRGTHLIWRVKTSEKCNIAFTTVQTHFDAWYFDSGCSR
HMTGNRYFFTELEECSSSHVTFGDGAKGKIIAKGNIDKSNLPCLNEVRYVDGLNANLSSVSRLCDQGYNVNFNNTCCIVTDKNNQVFMSGRWEADNCYHWSSNGS
NICHLTKVDQNWLWHRKLGNIGLRSLDKVIRNEAVVGIPSLHINGKFFCGDCQVGK