; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g14960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g14960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:11556222..11564043
RNA-Seq ExpressionMoc08g14960
SyntenyMoc08g14960
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.5e-22377.36Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+KE   +DGDLGESPFTSD++EAPIPPKFK P++KPYDGSKDPKDYVEVF+ +MDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAAT

Query:  DAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADET
        DAIK RAF+IALTGSARLWYRRL A SISTYSQLR+EF++ FSSRHYD+KT THLATIRQKEGETLREYVTR  EEQLKVAHCSDDSAM YFLTGLADE 
Subjt:  DAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADET

Query:  LTVKLGEEAPATFAEVLQKAKK----------ETRRP----------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTN
        LTVKLGEEAPATFAEVLQKAKK          +T RP                D KSKDKG SFSS R EYRR++ GP RSRPYER+TPT IPISEILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKK----------ETRRP----------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTN

Query:  IEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS
        IEE+ MEKLLKRPEKLRG PE+R+KDK CRFHR+HGHNTS  WE KRQIE+LIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS

Query:  GGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFS
        GGQSG KRKELAR ARREVCIIREQ+PTC I+F  ADLE VHLPHNDALVIAPLIDHV+V RVL+DGG SANIL LPTYLALGWTRSQLKKSPTPLVGFS
Subjt:  GGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFS

Query:  GESVSPEGCIDLPVTIGQDATQ-TDLARSV
        GESV PEG IDLPVT+GQD TQ T +A  V
Subjt:  GESVSPEGCIDLPVTIGQDATQ-TDLARSV

XP_022149377.1 uncharacterized protein LOC111017807 [Momordica charantia]1.7e-18381.26Show/hide
Query:  KDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKV
        +DPKDYVEVF+G+MDFQAATDAIK RAFQIALTG ARLWYRRL ARSISTYSQLRKEFISQF SRHYDRKT THLATIRQKE ETLREYVTR  EEQLKV
Subjt:  KDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKV

Query:  AHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK--------------------------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNR
         HCSDDSAM YFLTGLADETLTVKLGEEAPATFAEVLQKAKK                          E R+ D KS+DKG S S+SR E+RR + GP+R
Subjt:  AHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK--------------------------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNR

Query:  SRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR
        SRPYERYTPT I ISEILTNIEE+ MEKLLK PEKLRGDPEKR+KDK+CRFHRDH HNT+SCWE KRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR
Subjt:  SRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR

Query:  TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYL
        TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSI+F DADLEGVHLPHNDALVIAPLIDHVLV  +L+DGGASANIL LPTYL
Subjt:  TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYL

Query:  ALGWTRSQLKKSPTPLVGFSGESVSPE
        ALGWTR QLKKSPT  +  S E+ SP+
Subjt:  ALGWTRSQLKKSPTPLVGFSGESVSPE

XP_022149377.1 uncharacterized protein LOC111017807 [Momordica charantia]1.8e-0797.06Show/hide
Query:  MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQ
        MVHPANSANTTEQRGVNADNGP+RDLGARIVEDQ
Subjt:  MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQ

XP_022149377.1 uncharacterized protein LOC111017807 [Momordica charantia]9.3e-17777.54Show/hide
Query:  KESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQ
        K+ S +DGDLGES FTSD++EAPIPPKFK P++KPYDGSKDPKDYVEVF+G+MDF AA+DAIK RAFQIALTGSARLWYRRL ARSISTYSQLR+EF++Q
Subjt:  KESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQ

Query:  FSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAK--------------------
        FSSR Y +KT THLATIRQKEG TLREYVTR  EEQLKVAHCSDDSAM YFLTGLADE LTVKLGE+AP TFAEVLQKAK                    
Subjt:  FSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAK--------------------

Query:  ------KETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSS
              K+  R D KSKDKG SFSS R EYRR++ GP +SRPYER+TPT IPISEILTNIEE+ MEKLLKRPEKLRG PE+R+KDK CRFHR+HGHNTS 
Subjt:  ------KETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSS

Query:  CWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGV
        CWE KRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG+KRKELAR ARREVCIIREQ PTC I+F  AD E V
Subjt:  CWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGV

Query:  HLPHNDALVIAPLIDHVLVRRVL
        HLPHNDA VIAPLIDHV+VRRVL
Subjt:  HLPHNDALVIAPLIDHVLVRRVL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.0e-21978.2Show/hide
Query:  KAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATD
        KA+S Y P+ P  VITREEFD +K +FD QVEALKARCEKKESSFDDGDLGE  F+SDI+EA IPPKFKTP+MKPYDGSKDPKDYVEVF+ +MDFQAATD
Subjt:  KAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATD

Query:  AIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETL
        AIK  AFQIALTGSARLWYRRL AR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTR  EEQLKVAHCSDDSAM YFLTGLADETL
Subjt:  AIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETL

Query:  TVKLGEEAPATFAEVLQKAKK----------ETRRP----------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNI
        TVKL EEAPATFAEVLQK KK          +T RP                D KS+DKGPS SSSR++YRRS+   N+SRPYE YTPT IPI EILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKK----------ETRRP----------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNI

Query:  EETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG
        EET MEKLLKRPEKLRGDPEKRN DK CRFHRDHGHNTS+ WE KRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPRRDDRPAVI         
Subjt:  EETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREARREVCIIREQ+PT SI+F  ADLEGVHLPHNDALVIAPLID VLVRR+L+DGGASANIL L TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE
        ES+S EGCIDLPV+I QD TQ T +A  V+++
Subjt:  ESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.3e-0130.67Show/hide
Query:  MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPLKANRGRGGTSRKTSQRANQTADPEALSTLQRELDYMR
        MV PANS NT ++R + A++G +R++GA +VE Q       +   RSAR     LPPAHPKP KA       +     R       +  S    +++ ++
Subjt:  MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPLKANRGRGGTSRKTSQRANQTADPEALSTLQRELDYMR

Query:  HRLRTVEEMYAEATRANRTASPSI--ALGAPDEKGAPSIQPGDREPIPND
         R    E  + +      + S  I  AL  P  K  P+++P D    P D
Subjt:  HRLRTVEEMYAEATRANRTASPSI--ALGAPDEKGAPSIQPGDREPIPND

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.7e-21365.75Show/hide
Query:  GAPDEKGAPSIQPGDREPIPNDEGVDYNLRDNDLRKHLTEKKKRASREPEDSSSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKA
        GAP EKGAPSIQPG+REPIPNDEGVDY+LRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDL+KHRFDEQVEALKA
Subjt:  GAPDEKGAPSIQPGDREPIPNDEGVDYNLRDNDLRKHLTEKKKRASREPEDSSSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKA

Query:  RCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKE
        RCEKKES FDD DLGESPFTSDIMEAPIPPKFKTP+MKPYDGSKDPKDYVEVF+G+MDFQAATDAIK  AFQIALTGSARLW RRL ARSISTYSQLRKE
Subjt:  RCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKE

Query:  FISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK---------------
        FI QFS RHYDRKT THLATIRQKE                                   DETLTVKLGEEAPATFAEVLQ AKK               
Subjt:  FISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK---------------

Query:  -----------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGH
                   + R+ D KSKDKG S S SR EYRRS+ GP+RSRPYER                                                   
Subjt:  -----------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGH

Query:  NTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDAD
            CWE KRQIEDLIQD YFKKFVGKPRSNSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSI+F D D
Subjt:  NTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDAD

Query:  LEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE-ILD
        LEGVHLPHNDALVIAPLIDHVLVRRVL+DGGASANIL LPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCIDLPVTIGQD+TQ T +A  V+++  L 
Subjt:  LEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE-ILD

Query:  THSILEPDVMEVNTPSPTWMDPIVEFIKGNPPQDPKEQKKMVRRAARFTLR
         ++I E  ++      P+ +  ++++   N     + ++K  R      L+
Subjt:  THSILEPDVMEVNTPSPTWMDPIVEFIKGNPPQDPKEQKKMVRRAARFTLR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.2e-22377.36Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+KE   +DGDLGESPFTSD++EAPIPPKFK P++KPYDGSKDPKDYVEVF+ +MDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAAT

Query:  DAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADET
        DAIK RAF+IALTGSARLWYRRL A SISTYSQLR+EF++ FSSRHYD+KT THLATIRQKEGETLREYVTR  EEQLKVAHCSDDSAM YFLTGLADE 
Subjt:  DAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADET

Query:  LTVKLGEEAPATFAEVLQKAKK----------ETRRP----------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTN
        LTVKLGEEAPATFAEVLQKAKK          +T RP                D KSKDKG SFSS R EYRR++ GP RSRPYER+TPT IPISEILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKK----------ETRRP----------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTN

Query:  IEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS
        IEE+ MEKLLKRPEKLRG PE+R+KDK CRFHR+HGHNTS  WE KRQIE+LIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPS

Query:  GGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFS
        GGQSG KRKELAR ARREVCIIREQ+PTC I+F  ADLE VHLPHNDALVIAPLIDHV+V RVL+DGG SANIL LPTYLALGWTRSQLKKSPTPLVGFS
Subjt:  GGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFS

Query:  GESVSPEGCIDLPVTIGQDATQ-TDLARSV
        GESV PEG IDLPVT+GQD TQ T +A  V
Subjt:  GESVSPEGCIDLPVTIGQDATQ-TDLARSV

A0A6J1D7S8 uncharacterized protein LOC1110178078.4e-18481.26Show/hide
Query:  KDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKV
        +DPKDYVEVF+G+MDFQAATDAIK RAFQIALTG ARLWYRRL ARSISTYSQLRKEFISQF SRHYDRKT THLATIRQKE ETLREYVTR  EEQLKV
Subjt:  KDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKV

Query:  AHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK--------------------------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNR
         HCSDDSAM YFLTGLADETLTVKLGEEAPATFAEVLQKAKK                          E R+ D KS+DKG S S+SR E+RR + GP+R
Subjt:  AHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK--------------------------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNR

Query:  SRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR
        SRPYERYTPT I ISEILTNIEE+ MEKLLK PEKLRGDPEKR+KDK+CRFHRDH HNT+SCWE KRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR
Subjt:  SRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSR

Query:  TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYL
        TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSI+F DADLEGVHLPHNDALVIAPLIDHVLV  +L+DGGASANIL LPTYL
Subjt:  TPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYL

Query:  ALGWTRSQLKKSPTPLVGFSGESVSPE
        ALGWTR QLKKSPT  +  S E+ SP+
Subjt:  ALGWTRSQLKKSPTPLVGFSGESVSPE

A0A6J1D7S8 uncharacterized protein LOC1110178078.9e-0897.06Show/hide
Query:  MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQ
        MVHPANSANTTEQRGVNADNGP+RDLGARIVEDQ
Subjt:  MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQ

A0A6J1D7S8 uncharacterized protein LOC1110178074.5e-17777.54Show/hide
Query:  KESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQ
        K+ S +DGDLGES FTSD++EAPIPPKFK P++KPYDGSKDPKDYVEVF+G+MDF AA+DAIK RAFQIALTGSARLWYRRL ARSISTYSQLR+EF++Q
Subjt:  KESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQ

Query:  FSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAK--------------------
        FSSR Y +KT THLATIRQKEG TLREYVTR  EEQLKVAHCSDDSAM YFLTGLADE LTVKLGE+AP TFAEVLQKAK                    
Subjt:  FSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAK--------------------

Query:  ------KETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSS
              K+  R D KSKDKG SFSS R EYRR++ GP +SRPYER+TPT IPISEILTNIEE+ MEKLLKRPEKLRG PE+R+KDK CRFHR+HGHNTS 
Subjt:  ------KETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSS

Query:  CWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGV
        CWE KRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG+KRKELAR ARREVCIIREQ PTC I+F  AD E V
Subjt:  CWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGV

Query:  HLPHNDALVIAPLIDHVLVRRVL
        HLPHNDA VIAPLIDHV+VRRVL
Subjt:  HLPHNDALVIAPLIDHVLVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204799.5e-22078.2Show/hide
Query:  KAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATD
        KA+S Y P+ P  VITREEFD +K +FD QVEALKARCEKKESSFDDGDLGE  F+SDI+EA IPPKFKTP+MKPYDGSKDPKDYVEVF+ +MDFQAATD
Subjt:  KAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATD

Query:  AIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETL
        AIK  AFQIALTGSARLWYRRL AR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTR  EEQLKVAHCSDDSAM YFLTGLADETL
Subjt:  AIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETL

Query:  TVKLGEEAPATFAEVLQKAKK----------ETRRP----------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNI
        TVKL EEAPATFAEVLQK KK          +T RP                D KS+DKGPS SSSR++YRRS+   N+SRPYE YTPT IPI EILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKK----------ETRRP----------------DVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNI

Query:  EETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG
        EET MEKLLKRPEKLRGDPEKRN DK CRFHRDHGHNTS+ WE KRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPRRDDRPAVI         
Subjt:  EETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREARREVCIIREQ+PT SI+F  ADLEGVHLPHNDALVIAPLID VLVRR+L+DGGASANIL L TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE
        ES+S EGCIDLPV+I QD TQ T +A  V+++
Subjt:  ESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE

A0A6J1DHB3 uncharacterized protein LOC1110204791.6e-0130.67Show/hide
Query:  MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPLKANRGRGGTSRKTSQRANQTADPEALSTLQRELDYMR
        MV PANS NT ++R + A++G +R++GA +VE Q       +   RSAR     LPPAHPKP KA       +     R       +  S    +++ ++
Subjt:  MVHPANSANTTEQRGVNADNGPRRDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPLKANRGRGGTSRKTSQRANQTADPEALSTLQRELDYMR

Query:  HRLRTVEEMYAEATRANRTASPSI--ALGAPDEKGAPSIQPGDREPIPND
         R    E  + +      + S  I  AL  P  K  P+++P D    P D
Subjt:  HRLRTVEEMYAEATRANRTASPSI--ALGAPDEKGAPSIQPGDREPIPND

A0A6J1DHB3 uncharacterized protein LOC1110204792.3e-21365.75Show/hide
Query:  GAPDEKGAPSIQPGDREPIPNDEGVDYNLRDNDLRKHLTEKKKRASREPEDSSSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKA
        GAP EKGAPSIQPG+REPIPNDEGVDY+LRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDL+KHRFDEQVEALKA
Subjt:  GAPDEKGAPSIQPGDREPIPNDEGVDYNLRDNDLRKHLTEKKKRASREPEDSSSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKA

Query:  RCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKE
        RCEKKES FDD DLGESPFTSDIMEAPIPPKFKTP+MKPYDGSKDPKDYVEVF+G+MDFQAATDAIK  AFQIALTGSARLW RRL ARSISTYSQLRKE
Subjt:  RCEKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKE

Query:  FISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK---------------
        FI QFS RHYDRKT THLATIRQKE                                   DETLTVKLGEEAPATFAEVLQ AKK               
Subjt:  FISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKK---------------

Query:  -----------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGH
                   + R+ D KSKDKG S S SR EYRRS+ GP+RSRPYER                                                   
Subjt:  -----------ETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYERYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGH

Query:  NTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDAD
            CWE KRQIEDLIQD YFKKFVGKPRSNSVEKKEERKRSRTPPRR+DRPAVINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSI+F D D
Subjt:  NTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDAD

Query:  LEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE-ILD
        LEGVHLPHNDALVIAPLIDHVLVRRVL+DGGASANIL LPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCIDLPVTIGQD+TQ T +A  V+++  L 
Subjt:  LEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCIDLPVTIGQDATQ-TDLARSVLVE-ILD

Query:  THSILEPDVMEVNTPSPTWMDPIVEFIKGNPPQDPKEQKKMVRRAARFTLR
         ++I E  ++      P+ +  ++++   N     + ++K  R      L+
Subjt:  THSILEPDVMEVNTPSPTWMDPIVEFIKGNPPQDPKEQKKMVRRAARFTLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCTATCTTTAGCACCATGGTGTTATAGACGCAAAAGATGTCTGCACACGTCTAGATGGCGTCGTGATGCTGCCTCCTCATCCAAAATGACCTTGCAGCCTGCTTC
TTCAATTCTACTTCATTCTTGCCTCGTCCCGACCCGTTCTGATGTTGTCCACTCTAGTGTTCAGGTCGGAACCGGAGATCGGGTTCGAGCTCGATTCGTGGAGAACCGTT
GTGCAAATTCCTGCATAAACATTTGGCGCCGTCTGTGGGGAAGACATCTTAAGTCATCCCGATCTAAAAAAAAAATATACGCAAAAATGGTGCATCCAGCAAACTCTGCC
AATACGACAGAACAGAGGGGTGTGAATGCTGATAACGGCCCTCGGCGAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCG
CAGATCTGCCCGCCATGCGAACCAAGAGCTACCACCTGCTCACCCAAAACCCTTAAAAGCCAACAGAGGCCGAGGAGGGACGTCGAGAAAGACCTCCCAAAGGGCCAACC
AGACAGCAGACCCTGAAGCTCTGTCTACTCTCCAGCGCGAGTTGGATTATATGCGCCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGTGCTAACCGA
ACTGCGTCTCCCTCTATAGCCCTGGGGGCACCCGATGAAAAGGGAGCTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCCAACGATGAAGGAGTGGATTACAACTT
GCGGGATAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTTCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGG
CTCAATCAAAATACAAGCCTCTAGCACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGGTGAAACACAGGTTCGATGAGCAGGTCGAGGCACTCAAAGCCAGGTGC
GAGAAGAAGGAGAGCTCGTTTGACGATGGCGACTTGGGAGAATCGCCATTCACCTCGGATATTATGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCAGCATGAAGCC
CTATGATGGGTCTAAGGACCCCAAAGACTATGTTGAGGTATTTAAAGGCATCATGGATTTTCAAGCGGCAACGGATGCAATAAAATACCGCGCCTTCCAGATCGCGCTTA
CCGGCAGCGCGCGCCTGTGGTACCGGAGACTGTCGGCTAGGTCGATATCAACCTATTCTCAGCTGAGAAAGGAGTTCATAAGCCAATTCTCTTCTCGGCACTACGATAGG
AAAACAACGACTCACCTCGCCACCATCAGACAGAAGGAAGGAGAGACGCTGAGAGAATATGTCACACGGCTCCATGAGGAGCAGCTGAAGGTTGCACACTGCTCCGATGA
TTCGGCCATGTACTACTTCCTCACCGGCTTGGCCGATGAGACCTTGACAGTAAAACTTGGAGAGGAGGCTCCAGCCACCTTCGCCGAAGTATTGCAGAAGGCGAAAAAAG
AGACGAGGAGGCCTGATGTCAAGTCCAAGGATAAGGGACCATCCTTCTCCAGTAGCCGAATTGAGTATCGCAGGTCGGACGGTGGCCCCAACCGAAGCCGACCTTACGAA
CGTTATACCCCGACCATCATCCCAATCTCTGAAATACTTACAAACATTGAGGAGACTGAGATGGAAAAGCTCCTCAAGCGACCCGAGAAGCTCCGGGGAGACCCAGAAAA
ACGTAACAAAGACAAGGACTGCCGTTTTCATCGCGATCACGGCCACAATACGTCAAGTTGCTGGGAATTTAAGCGCCAGATTGAAGACCTCATTCAAGACGGCTACTTCA
AAAAATTTGTGGGCAAACCAAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGTCGGGATGACCGACCTGCGGTCATCAACACTATT
TTCGGGGGCCCGAGTGGGGGCCAGTCCGGAAACAAAAGGAAAGAGTTAGCTCGCGAGGCCAGACGCGAGGTATGTATCATCAGGGAGCAGAAGCCCACTTGCTCCATCAG
TTTCGGCGATGCCGATCTAGAGGGGGTCCATTTGCCCCATAATGACGCGCTTGTGATCGCTCCTCTCATCGACCACGTCCTGGTCCGAAGAGTACTGATCGATGGAGGCG
CATCTGCCAACATCTTGTGTCTCCCAACATATCTTGCCTTGGGATGGACCAGGTCACAGTTGAAGAAGAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCC
CCAGAAGGGTGCATTGATCTGCCGGTAACTATCGGGCAAGATGCTACCCAGACCGACCTGGCTAGATCAGTCCTGGTCGAGATCTTGGACACTCATTCAATCTTGGAGCC
AGATGTAATGGAGGTTAATACTCCATCACCTACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGTGCGAA
GAGCAGCTCGGTTCACACTTCGAGAAGGAATGTTGTTTAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCTATCTTTAGCACCATGGTGTTATAGACGCAAAAGATGTCTGCACACGTCTAGATGGCGTCGTGATGCTGCCTCCTCATCCAAAATGACCTTGCAGCCTGCTTC
TTCAATTCTACTTCATTCTTGCCTCGTCCCGACCCGTTCTGATGTTGTCCACTCTAGTGTTCAGGTCGGAACCGGAGATCGGGTTCGAGCTCGATTCGTGGAGAACCGTT
GTGCAAATTCCTGCATAAACATTTGGCGCCGTCTGTGGGGAAGACATCTTAAGTCATCCCGATCTAAAAAAAAAATATACGCAAAAATGGTGCATCCAGCAAACTCTGCC
AATACGACAGAACAGAGGGGTGTGAATGCTGATAACGGCCCTCGGCGAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCG
CAGATCTGCCCGCCATGCGAACCAAGAGCTACCACCTGCTCACCCAAAACCCTTAAAAGCCAACAGAGGCCGAGGAGGGACGTCGAGAAAGACCTCCCAAAGGGCCAACC
AGACAGCAGACCCTGAAGCTCTGTCTACTCTCCAGCGCGAGTTGGATTATATGCGCCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGTGCTAACCGA
ACTGCGTCTCCCTCTATAGCCCTGGGGGCACCCGATGAAAAGGGAGCTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCCAACGATGAAGGAGTGGATTACAACTT
GCGGGATAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTTCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGG
CTCAATCAAAATACAAGCCTCTAGCACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGGTGAAACACAGGTTCGATGAGCAGGTCGAGGCACTCAAAGCCAGGTGC
GAGAAGAAGGAGAGCTCGTTTGACGATGGCGACTTGGGAGAATCGCCATTCACCTCGGATATTATGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCAGCATGAAGCC
CTATGATGGGTCTAAGGACCCCAAAGACTATGTTGAGGTATTTAAAGGCATCATGGATTTTCAAGCGGCAACGGATGCAATAAAATACCGCGCCTTCCAGATCGCGCTTA
CCGGCAGCGCGCGCCTGTGGTACCGGAGACTGTCGGCTAGGTCGATATCAACCTATTCTCAGCTGAGAAAGGAGTTCATAAGCCAATTCTCTTCTCGGCACTACGATAGG
AAAACAACGACTCACCTCGCCACCATCAGACAGAAGGAAGGAGAGACGCTGAGAGAATATGTCACACGGCTCCATGAGGAGCAGCTGAAGGTTGCACACTGCTCCGATGA
TTCGGCCATGTACTACTTCCTCACCGGCTTGGCCGATGAGACCTTGACAGTAAAACTTGGAGAGGAGGCTCCAGCCACCTTCGCCGAAGTATTGCAGAAGGCGAAAAAAG
AGACGAGGAGGCCTGATGTCAAGTCCAAGGATAAGGGACCATCCTTCTCCAGTAGCCGAATTGAGTATCGCAGGTCGGACGGTGGCCCCAACCGAAGCCGACCTTACGAA
CGTTATACCCCGACCATCATCCCAATCTCTGAAATACTTACAAACATTGAGGAGACTGAGATGGAAAAGCTCCTCAAGCGACCCGAGAAGCTCCGGGGAGACCCAGAAAA
ACGTAACAAAGACAAGGACTGCCGTTTTCATCGCGATCACGGCCACAATACGTCAAGTTGCTGGGAATTTAAGCGCCAGATTGAAGACCTCATTCAAGACGGCTACTTCA
AAAAATTTGTGGGCAAACCAAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGTCGGGATGACCGACCTGCGGTCATCAACACTATT
TTCGGGGGCCCGAGTGGGGGCCAGTCCGGAAACAAAAGGAAAGAGTTAGCTCGCGAGGCCAGACGCGAGGTATGTATCATCAGGGAGCAGAAGCCCACTTGCTCCATCAG
TTTCGGCGATGCCGATCTAGAGGGGGTCCATTTGCCCCATAATGACGCGCTTGTGATCGCTCCTCTCATCGACCACGTCCTGGTCCGAAGAGTACTGATCGATGGAGGCG
CATCTGCCAACATCTTGTGTCTCCCAACATATCTTGCCTTGGGATGGACCAGGTCACAGTTGAAGAAGAGTCCAACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCC
CCAGAAGGGTGCATTGATCTGCCGGTAACTATCGGGCAAGATGCTACCCAGACCGACCTGGCTAGATCAGTCCTGGTCGAGATCTTGGACACTCATTCAATCTTGGAGCC
AGATGTAATGGAGGTTAATACTCCATCACCTACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGTGCGAA
GAGCAGCTCGGTTCACACTTCGAGAAGGAATGTTGTTTAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
Protein sequenceShow/hide protein sequence
MLLSLAPWCYRRKRCLHTSRWRRDAASSSKMTLQPASSILLHSCLVPTRSDVVHSSVQVGTGDRVRARFVENRCANSCINIWRRLWGRHLKSSRSKKKIYAKMVHPANSA
NTTEQRGVNADNGPRRDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPLKANRGRGGTSRKTSQRANQTADPEALSTLQRELDYMRHRLRTVEEMYAEATRANR
TASPSIALGAPDEKGAPSIQPGDREPIPNDEGVDYNLRDNDLRKHLTEKKKRASREPEDSSSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLVKHRFDEQVEALKARC
EKKESSFDDGDLGESPFTSDIMEAPIPPKFKTPSMKPYDGSKDPKDYVEVFKGIMDFQAATDAIKYRAFQIALTGSARLWYRRLSARSISTYSQLRKEFISQFSSRHYDR
KTTTHLATIRQKEGETLREYVTRLHEEQLKVAHCSDDSAMYYFLTGLADETLTVKLGEEAPATFAEVLQKAKKETRRPDVKSKDKGPSFSSSRIEYRRSDGGPNRSRPYE
RYTPTIIPISEILTNIEETEMEKLLKRPEKLRGDPEKRNKDKDCRFHRDHGHNTSSCWEFKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRDDRPAVINTI
FGGPSGGQSGNKRKELAREARREVCIIREQKPTCSISFGDADLEGVHLPHNDALVIAPLIDHVLVRRVLIDGGASANILCLPTYLALGWTRSQLKKSPTPLVGFSGESVS
PEGCIDLPVTIGQDATQTDLARSVLVEILDTHSILEPDVMEVNTPSPTWMDPIVEFIKGNPPQDPKEQKKMVRRAARFTLREGMLFKKFNPLNLRVRGAM