; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:1264118..1270164
RNA-Seq ExpressionMoc01g01880
SyntenyMoc01g01880
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]8.8e-23679.77Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QV+ALKA+CE+KE P +DGDLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADET
        DAIKCRAF+IALTGSARLWYRRLPA  ISTYSQLR++F++ FSSRHYD+KT THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE 
Subjt:  DAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADET

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE--------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNI
        LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE                          + R EYR +E+GPT+SRPYER+TPTTIPIS+ILTNI
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE--------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNI

Query:  EESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSG
        EESGMEKLLKR EKLRG PE+R+KDKYCRFHR+HGHNTS+ WELKRQIE+LIQDGYFKKFVGK R++S EK+EERKRSRTPPRR DRPAVINTIFG  SG
Subjt:  EESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSG

Query:  GQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
        GQSG KRKELAR ARREVC+IREQ+PTC ITF  ADLE VHLPHNDALVIAPL+DHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCIDLSVTIEQDATQVTQMAEFV
        ESV PEG IDL VT+ QD TQVTQMAEFV
Subjt:  ESVSPEGCIDLSVTIEQDATQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]5.7e-20368.75Show/hide
Query:  NSNLKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREEFD ++ + + QV+ALKA+CE+KE P +DGDLGESPFTSD++EA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE-------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILT
        DE LTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPE                         + R E+R + +GPT+SRPYER+TPTTIPIS+ILT
Subjt:  DETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE-------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILT

Query:  NIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSL
        NIEESGMEKLLKR EKLRG PE+RNKDKYCRFHR+H HNTS+ WELKRQIEDLIQD YFKKFVGK R++S EK+EERK SRTP RR DRPAVINTIFG  
Subjt:  NIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSL

Query:  SGGQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF
        SGGQSG+KRKELAR ARREVC+IREQ+PTC ITF  ADLE VHLPHNDALVIAPL+DHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGF
Subjt:  SGGQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF

Query:  SGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTLQK
        S ESV PEGCIDL VT+  D TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTL +
Subjt:  SGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTLQK

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.2e-24681Show/hide
Query:  KAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+ P  VITREEFD +K +FD QV+ALKARCEKKES FDDGDLGE  F+SDI+EA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETL
        AIKC AFQIALTGSARLWYRRLPARLISTYSQLRK+FISQFSSRHYDRKT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADETL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE---------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNI
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE                           +SR +YR S S   QSRPYE YTPTTIPI +ILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE---------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNI

Query:  EESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSG
        EE+GMEKLLKR EKLRGDPEKRN DKYCRFHRDHGHNTSN WELKRQIEDLIQDGYFKKFVGK RSNS+EK+EERKR RTPPRRDDRPAVI         
Subjt:  EESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSG

Query:  GQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREARREVC+IREQ+PT SI F  ADLEGVHLPHNDALVIAPL+D V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTLQK
        ES+S EGCIDL V+I QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFR VPSTL +
Subjt:  ESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTLQK

XP_022153957.1 uncharacterized protein LOC111021344 [Momordica charantia]2.0e-19278.45Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALT SARLWYRRLPAR ISTYSQLRK+ ISQFSSRHYDRKT THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEN---------------------------SRTEYRGSESGPTQSRPYERYTPTTIP
        TGLADETLTVKLGEEAPATFAEVL+KAKKVIDGQELLRTKTGRPE                            SRTEYR SESGP++SRPYERYT TTIP
Subjt:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEN---------------------------SRTEYRGSESGPTQSRPYERYTPTTIP

Query:  ISKILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVIN
        IS+ILTNIEESGMEKLLKR EKLRGD EKRNKDKYCRFHRDHGHNT++CWELKRQIEDLIQD YFK                                  
Subjt:  ISKILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVIN

Query:  TIFGSLSGGQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
              +GGQSGNKRK+LAREARREVC+IREQKPTC I F D+DLEGVHLPHNDALVIAPL+DHV VRRVLVDG ASANILSLPTYLALGWTR QLKKSP
Subjt:  TIFGSLSGGQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTL
        TP VGFSGESVSPEGCIDL VTI QDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSF  VPSTL
Subjt:  TPLVGFSGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTL

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]7.4e-23572.55Show/hide
Query:  PGEPGEKGAPSIQPGDGEPIPNNEGVDYSLRDNDLRKHLTEKKKRASWEPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALK
        PG PGEKGAPSIQPG+ EPIPN+EGVDYSLRDNDLRKHLT+KKK+ASWEPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKHRFDEQV+ALK
Subjt:  PGEPGEKGAPSIQPGDGEPIPNNEGVDYSLRDNDLRKHLTEKKKRASWEPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALK

Query:  ARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRK
        ARCEKKESPFDD DLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPAR ISTYSQLRK
Subjt:  ARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRK

Query:  KFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR
        +FI QFS RHYDRKT THLATIRQKE                                   DETLTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT R
Subjt:  KFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR

Query:  PE---------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHG
        PE                            SRTEYR SESGP++SRPYER                                                  
Subjt:  PE---------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHG

Query:  HNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSGGQSGNKRKELAREARREVCVIREQKPTCSITFGDA
             CWELKRQIEDLIQD YFKKFVGK RSNS+EK+EERKRSRTPPRR+DRPAVINTIFG  SGGQ  NKRKELA EARR+V +IREQKPTCSITF D 
Subjt:  HNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSGGQSGNKRKELAREARREVCVIREQKPTCSITFGDA

Query:  DLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRS
        DLEGVHLPHNDALVIAPL+DHV+VRRVLVDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCIDL VTI QD+TQVTQMAEFVVIDGR 
Subjt:  DLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRS

Query:  AYNAIFGRPIIHSFRTVPSTLQK
        AYNAIF RPIIHSF+ VPS L +
Subjt:  AYNAIFGRPIIHSFRTVPSTLQK

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088134.2e-23679.77Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QV+ALKA+CE+KE P +DGDLGESPFTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADET
        DAIKCRAF+IALTGSARLWYRRLPA  ISTYSQLR++F++ FSSRHYD+KT THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE 
Subjt:  DAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADET

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE--------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNI
        LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE                          + R EYR +E+GPT+SRPYER+TPTTIPIS+ILTNI
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE--------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNI

Query:  EESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSG
        EESGMEKLLKR EKLRG PE+R+KDKYCRFHR+HGHNTS+ WELKRQIE+LIQDGYFKKFVGK R++S EK+EERKRSRTPPRR DRPAVINTIFG  SG
Subjt:  EESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSG

Query:  GQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
        GQSG KRKELAR ARREVC+IREQ+PTC ITF  ADLE VHLPHNDALVIAPL+DHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCIDLSVTIEQDATQVTQMAEFV
        ESV PEG IDL VT+ QD TQVTQMAEFV
Subjt:  ESVSPEGCIDLSVTIEQDATQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.8e-20368.75Show/hide
Query:  NSNLKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREEFD ++ + + QV+ALKA+CE+KE P +DGDLGESPFTSD++EA        PT+K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE-------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILT
        DE LTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPE                         + R E+R + +GPT+SRPYER+TPTTIPIS+ILT
Subjt:  DETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE-------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILT

Query:  NIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSL
        NIEESGMEKLLKR EKLRG PE+RNKDKYCRFHR+H HNTS+ WELKRQIEDLIQD YFKKFVGK R++S EK+EERK SRTP RR DRPAVINTIFG  
Subjt:  NIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSL

Query:  SGGQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF
        SGGQSG+KRKELAR ARREVC+IREQ+PTC ITF  ADLE VHLPHNDALVIAPL+DHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGF
Subjt:  SGGQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGF

Query:  SGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTLQK
        S ESV PEGCIDL VT+  D TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTL +
Subjt:  SGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTLQK

A0A6J1DHB3 uncharacterized protein LOC1110204792.0e-24681Show/hide
Query:  KAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+ P  VITREEFD +K +FD QV+ALKARCEKKES FDDGDLGE  F+SDI+EA IPPKFKTPTMKPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETL
        AIKC AFQIALTGSARLWYRRLPARLISTYSQLRK+FISQFSSRHYDRKT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADETL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE---------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNI
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE                           +SR +YR S S   QSRPYE YTPTTIPI +ILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE---------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNI

Query:  EESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSG
        EE+GMEKLLKR EKLRGDPEKRN DKYCRFHRDHGHNTSN WELKRQIEDLIQDGYFKKFVGK RSNS+EK+EERKR RTPPRRDDRPAVI         
Subjt:  EESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSG

Query:  GQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREARREVC+IREQ+PT SI F  ADLEGVHLPHNDALVIAPL+D V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTLQK
        ES+S EGCIDL V+I QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFR VPSTL +
Subjt:  ESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTLQK

A0A6J1DKD3 uncharacterized protein LOC1110213449.9e-19378.45Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALT SARLWYRRLPAR ISTYSQLRK+ ISQFSSRHYDRKT THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKKFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEN---------------------------SRTEYRGSESGPTQSRPYERYTPTTIP
        TGLADETLTVKLGEEAPATFAEVL+KAKKVIDGQELLRTKTGRPE                            SRTEYR SESGP++SRPYERYT TTIP
Subjt:  TGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEN---------------------------SRTEYRGSESGPTQSRPYERYTPTTIP

Query:  ISKILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVIN
        IS+ILTNIEESGMEKLLKR EKLRGD EKRNKDKYCRFHRDHGHNT++CWELKRQIEDLIQD YFK                                  
Subjt:  ISKILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVIN

Query:  TIFGSLSGGQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
              +GGQSGNKRK+LAREARREVC+IREQKPTC I F D+DLEGVHLPHNDALVIAPL+DHV VRRVLVDG ASANILSLPTYLALGWTR QLKKSP
Subjt:  TIFGSLSGGQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTL
        TP VGFSGESVSPEGCIDL VTI QDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSF  VPSTL
Subjt:  TPLVGFSGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTL

A0A6J1DPC9 uncharacterized protein LOC1110222803.6e-23572.55Show/hide
Query:  PGEPGEKGAPSIQPGDGEPIPNNEGVDYSLRDNDLRKHLTEKKKRASWEPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALK
        PG PGEKGAPSIQPG+ EPIPN+EGVDYSLRDNDLRKHLT+KKK+ASWEPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKHRFDEQV+ALK
Subjt:  PGEPGEKGAPSIQPGDGEPIPNNEGVDYSLRDNDLRKHLTEKKKRASWEPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHRFDEQVDALK

Query:  ARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRK
        ARCEKKESPFDD DLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW RRLPAR ISTYSQLRK
Subjt:  ARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRK

Query:  KFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR
        +FI QFS RHYDRKT THLATIRQKE                                   DETLTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT R
Subjt:  KFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR

Query:  PE---------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHG
        PE                            SRTEYR SESGP++SRPYER                                                  
Subjt:  PE---------------------------NSRTEYRGSESGPTQSRPYERYTPTTIPISKILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHG

Query:  HNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSGGQSGNKRKELAREARREVCVIREQKPTCSITFGDA
             CWELKRQIEDLIQD YFKKFVGK RSNS+EK+EERKRSRTPPRR+DRPAVINTIFG  SGGQ  NKRKELA EARR+V +IREQKPTCSITF D 
Subjt:  HNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQEERKRSRTPPRRDDRPAVINTIFGSLSGGQSGNKRKELAREARREVCVIREQKPTCSITFGDA

Query:  DLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRS
        DLEGVHLPHNDALVIAPL+DHV+VRRVLVDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCIDL VTI QD+TQVTQMAEFVVIDGR 
Subjt:  DLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRS

Query:  AYNAIFGRPIIHSFRTVPSTLQK
        AYNAIF RPIIHSF+ VPS L +
Subjt:  AYNAIFGRPIIHSFRTVPSTLQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAAAGGCAAGGTCCACTCTAGTGTTCAGGTCAGAACCGGAGACCGGGTTCGAGTTCAATTCGTGAAGAACCGTTGGCAAGAGGGAGATCTGCCGCGCAGA
TCTGCCCACCATGCGAACCAAGAGCTACCACCTACTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAGGAGGGACAGCGAGAAAGACCTCCCAAAAGGCCAAC
CAGGCAGCAGACCCTGAAGCTCTATCTACTCTCCAACGCGAGTTGGATGATATGCGTCATCGGTTGCGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCT
AACCGAACTGCATCTCCCTCTATAGCCCCGGGCGAACCCGGTGAAAAGGGAGCTCCATCTATCCAACCTGGCGATGGCGAGCCCATTCCTAACAATGAAGGGGTG
GATTACAGCTTGCGAGACAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTTGGGAGCCGGAAGACTCTCCTTCCTACTCTCGAGAATTCTCC
AACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTAGCACCAGAAGCTGTGATCACCAGGGAAGAATTCGACTTGATGAAACACAGGTTCGACGAGCAGGTC
GATGCACTCAAAGCCAGGTGCGAGAAGAAGGAGAGCCCGTTTGACGATGGCGACTTGGGAGAATCGCCATTCACCTCGGATATTATGGAGGCTCCAATCCCTCCG
AAGTTTAAGACTCCTACCATGAAGCCCTATGATGGGTCTAAGGACCCCAAAGACTATGTTGAGGTCTTCGAGGGCCTCATGGATTTTCAAGCGGCAACGGATGCA
ATAAAATGCCGCGCCTTCCAGATCGCTCTTACCGGCAGCGCGCGCCTGTGGTACCGGAGACTGCCGGCTAGGTTGATATCGACCTATTCTCAGCTGAGAAAGAAG
TTCATTAGCCAGTTCTCTTCTCGGCATTACGATAGAAAAACAACGACTCACCTTGCCACCATCAGACAGAAGGAAGGAGAGACGTTGAGAGAATATGTCACACGG
TTCCAGGAGGAGCAGCTTAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCCGATGAGACCTTGACAGTAAAACTTGGAGAG
GAGGCTCCAGCCACCTTCGCCGAAGTATTGCAAAAAGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAATAGCAGAACA
GAGTACCGTGGGTCGGAGAGCGGCCCTACCCAGAGCCGACCTTATGAACGGTACACCCCAACCACCATCCCCATCTCCAAGATACTCACGAACATCGAGGAGAGC
GGGATGGAAAAGCTCCTCAAGCGACTTGAGAAGCTCCGAGGAGACCCAGAAAAGCGCAACAAAGATAAGTACTGTCGTTTTCATCGCGATCACGGCCACAATACG
TCAAATTGCTGGGAGTTAAAACGCCAGATTGAAGACCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGTAAACTGAGGTCTAACTCGATTGAAAAGCAAGAA
GAGAGGAAGCGTTCAAGAACGCCGCCTCGCCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGGAGCCTGAGTGGGGGCCAGTCCGGAAACAAGAGGAAG
GAGCTAGCTCGCGAGGCCAGGCGCGAGGTATGCGTCATCAGGGAGCAGAAGCCTACTTGCTCCATCACTTTCGGCGACGCCGACTTGGAGGGGGTCCACTTGCCT
CACAATGACGCGCTTGTGATCGCCCCTCTCGTTGATCACGTCGTGGTCCGAAGAGTATTGGTTGATGGAGGTGCATCTGCCAACATCTTGTCCCTCCCAACATAT
CTAGCATTGGGATGGACCAGGTCACAATTGAAAAAAAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCCCCAGAAGGGTGCATCGACCTGTCGGTA
ACTATCGAGCAAGATGCTACCCAAGTAACGCAGATGGCCGAGTTCGTAGTGATCGACGGTAGATCGGCCTATAACGCCATTTTCGGGAGACCCATCATCCACTCA
TTTCGGACCGTCCCCTCCACACTACAGAAAGCTCGAGCTTCATACGAGACCGACCTGGCTAGATCGGTCCCGGTCGAAATCTTGGACACTCCTTCAATCTTGGAG
CCAGATGTAATGGAGGTTGATACTCCATCACCCACTTGGATGGACCGAATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATG
GCACGAAGAGCAGCTCGGTTCACACTCCGAGAAGGAATGTTGTACCGACAGTATCAGAACAGAATGGCCAGACATTACAATGCCCGAGTTCGACCTCGAAGCTTC
CAAGTTGGACATTTGGTCTTGAGAAAAATTCAGAGTCATGTTGGCACCCTTGACCCAAGTTGGGAGGGACCATTCGAAGTCAAAGGCATAGTCCGACCTGGAACT
TATATGCTGGCTGACCTGGAAGGAAGAGTGCTTGCGCATCCATGGAACGCGGAGCACTTGAAGTGCTATTACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAAAGGCAAGGTCCACTCTAGTGTTCAGGTCAGAACCGGAGACCGGGTTCGAGTTCAATTCGTGAAGAACCGTTGGCAAGAGGGAGATCTGCCGCGCAGA
TCTGCCCACCATGCGAACCAAGAGCTACCACCTACTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAGGAGGGACAGCGAGAAAGACCTCCCAAAAGGCCAAC
CAGGCAGCAGACCCTGAAGCTCTATCTACTCTCCAACGCGAGTTGGATGATATGCGTCATCGGTTGCGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCT
AACCGAACTGCATCTCCCTCTATAGCCCCGGGCGAACCCGGTGAAAAGGGAGCTCCATCTATCCAACCTGGCGATGGCGAGCCCATTCCTAACAATGAAGGGGTG
GATTACAGCTTGCGAGACAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTTGGGAGCCGGAAGACTCTCCTTCCTACTCTCGAGAATTCTCC
AACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTAGCACCAGAAGCTGTGATCACCAGGGAAGAATTCGACTTGATGAAACACAGGTTCGACGAGCAGGTC
GATGCACTCAAAGCCAGGTGCGAGAAGAAGGAGAGCCCGTTTGACGATGGCGACTTGGGAGAATCGCCATTCACCTCGGATATTATGGAGGCTCCAATCCCTCCG
AAGTTTAAGACTCCTACCATGAAGCCCTATGATGGGTCTAAGGACCCCAAAGACTATGTTGAGGTCTTCGAGGGCCTCATGGATTTTCAAGCGGCAACGGATGCA
ATAAAATGCCGCGCCTTCCAGATCGCTCTTACCGGCAGCGCGCGCCTGTGGTACCGGAGACTGCCGGCTAGGTTGATATCGACCTATTCTCAGCTGAGAAAGAAG
TTCATTAGCCAGTTCTCTTCTCGGCATTACGATAGAAAAACAACGACTCACCTTGCCACCATCAGACAGAAGGAAGGAGAGACGTTGAGAGAATATGTCACACGG
TTCCAGGAGGAGCAGCTTAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCCGATGAGACCTTGACAGTAAAACTTGGAGAG
GAGGCTCCAGCCACCTTCGCCGAAGTATTGCAAAAAGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAATAGCAGAACA
GAGTACCGTGGGTCGGAGAGCGGCCCTACCCAGAGCCGACCTTATGAACGGTACACCCCAACCACCATCCCCATCTCCAAGATACTCACGAACATCGAGGAGAGC
GGGATGGAAAAGCTCCTCAAGCGACTTGAGAAGCTCCGAGGAGACCCAGAAAAGCGCAACAAAGATAAGTACTGTCGTTTTCATCGCGATCACGGCCACAATACG
TCAAATTGCTGGGAGTTAAAACGCCAGATTGAAGACCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGTAAACTGAGGTCTAACTCGATTGAAAAGCAAGAA
GAGAGGAAGCGTTCAAGAACGCCGCCTCGCCGGGATGACCGACCTGCGGTCATCAACACTATTTTCGGGAGCCTGAGTGGGGGCCAGTCCGGAAACAAGAGGAAG
GAGCTAGCTCGCGAGGCCAGGCGCGAGGTATGCGTCATCAGGGAGCAGAAGCCTACTTGCTCCATCACTTTCGGCGACGCCGACTTGGAGGGGGTCCACTTGCCT
CACAATGACGCGCTTGTGATCGCCCCTCTCGTTGATCACGTCGTGGTCCGAAGAGTATTGGTTGATGGAGGTGCATCTGCCAACATCTTGTCCCTCCCAACATAT
CTAGCATTGGGATGGACCAGGTCACAATTGAAAAAAAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCCCCAGAAGGGTGCATCGACCTGTCGGTA
ACTATCGAGCAAGATGCTACCCAAGTAACGCAGATGGCCGAGTTCGTAGTGATCGACGGTAGATCGGCCTATAACGCCATTTTCGGGAGACCCATCATCCACTCA
TTTCGGACCGTCCCCTCCACACTACAGAAAGCTCGAGCTTCATACGAGACCGACCTGGCTAGATCGGTCCCGGTCGAAATCTTGGACACTCCTTCAATCTTGGAG
CCAGATGTAATGGAGGTTGATACTCCATCACCCACTTGGATGGACCGAATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATG
GCACGAAGAGCAGCTCGGTTCACACTCCGAGAAGGAATGTTGTACCGACAGTATCAGAACAGAATGGCCAGACATTACAATGCCCGAGTTCGACCTCGAAGCTTC
CAAGTTGGACATTTGGTCTTGAGAAAAATTCAGAGTCATGTTGGCACCCTTGACCCAAGTTGGGAGGGACCATTCGAAGTCAAAGGCATAGTCCGACCTGGAACT
TATATGCTGGCTGACCTGGAAGGAAGAGTGCTTGCGCATCCATGGAACGCGGAGCACTTGAAGTGCTATTACCCCTGA
Protein sequenceShow/hide protein sequence
MVKGKVHSSVQVRTGDRVRVQFVKNRWQEGDLPRRSAHHANQELPPTHPKPSKANRGRGGTARKTSQKANQAADPEALSTLQRELDDMRHRLRTMEEMYAEATRA
NRTASPSIAPGEPGEKGAPSIQPGDGEPIPNNEGVDYSLRDNDLRKHLTEKKKRASWEPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHRFDEQV
DALKARCEKKESPFDDGDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLPARLISTYSQLRKK
FISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPENSRT
EYRGSESGPTQSRPYERYTPTTIPISKILTNIEESGMEKLLKRLEKLRGDPEKRNKDKYCRFHRDHGHNTSNCWELKRQIEDLIQDGYFKKFVGKLRSNSIEKQE
ERKRSRTPPRRDDRPAVINTIFGSLSGGQSGNKRKELAREARREVCVIREQKPTCSITFGDADLEGVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTY
LALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLSVTIEQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRTVPSTLQKARASYETDLARSVPVEILDTPSILE
PDVMEVDTPSPTWMDRIVEFIKGNPPQDPKEQKKMARRAARFTLREGMLYRQYQNRMARHYNARVRPRSFQVGHLVLRKIQSHVGTLDPSWEGPFEVKGIVRPGT
YMLADLEGRVLAHPWNAEHLKCYYP