; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g02400 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g02400
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:1937884..1943948
RNA-Seq ExpressionMoc07g02400
SyntenyMoc07g02400
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.2e-23080.12Show/hide
Query:  LKAQSKYKPLAPEAVVTREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTAT
        +KA+S   P  P  V+TREEFD ++ + D QVEALKA+CE+K+   +DGDLGESPFT+D+LEAPIPPKFK   +KPYDGSKDPKDYVEVFE LMDFQ A+
Subjt:  LKAQSKYKPLAPEAVVTREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTAT

Query:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADET
        DAIKCRAF+IALTGSARLWYRRLPA SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQ+EGETLREYVTRFQEEQLKV+HCSD SA+CYFLTGLADE 
Subjt:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADET

Query:  LTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTN
        LTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKG  S S R EYRR E+GPTRSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPS
        IEESGMEKLLKRPEKLRG  E+ SKDKYCRFHR+H H+T+  WELKRQIE LIQDGYFKKFVGKPR +S EKKEERKRSRTPPRR  RP VINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPS

Query:  GGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFS
        GGQSG KRKELAR ARREVCIIREQ+PTC ITF  A+LE VHLPHNDALVIAPLIDHV+V RVLVD G SANILSLPTYLALGWTRSQLK+SPTPLVGFS
Subjt:  GGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFS

Query:  GESVSPEGCINLP
        GESV PEG I+LP
Subjt:  GESVSPEGCINLP

XP_022149377.1 uncharacterized protein LOC111017807 [Momordica charantia]6.2e-20387.82Show/hide
Query:  KDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQ ATDAIKCRAFQIALTG ARLWYRRLPARSISTYSQLRKEFISQF SRHYDRKTATHLATIRQ+E ETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKV

Query:  SHCSDYSAICYFLTGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTR
         HCSD SA+CYFLTGLADETLTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKL QE R+ D KS+DKG SS +SR E+RR ESGP+R
Subjt:  SHCSDYSAICYFLTGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTR

Query:  SRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSR
        SRPYERYTPTTI ISEILTNIEESGMEKLLK PEKLRGD EK SKDK CRFHRDHDH+TT CWELKRQIE+LIQDGYFKKFVGKPR NSVEKKEERKRSR
Subjt:  SRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSR

Query:  TPPRRDGRPVVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYL
        TPPRRD RP VINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITF +A+LEGVHLPHNDALVIAPLIDHVLV  +LVD GASANILSLPTYL
Subjt:  TPPRRDGRPVVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYL

Query:  ALGWTRSQLKRSPTPLVGFSGESVSPE
        ALGWTR QLK+SPT  +  S E+ SP+
Subjt:  ALGWTRSQLKRSPTPLVGFSGESVSPE

XP_022149377.1 uncharacterized protein LOC111017807 [Momordica charantia]8.9e-0897.06Show/hide
Query:  MVHPANSANTTEQRGVNADNGPQRDLGARVVEDQ
        MVHPANSANTTEQRGVNADNGPQRDLGAR+VEDQ
Subjt:  MVHPANSANTTEQRGVNADNGPQRDLGARVVEDQ

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.2e-22463.07Show/hide
Query:  MVHPANSANTTEQRGVNADNGPQRDLGARVVEDQVRAGQEDDLPRRSARHANQELPPAHPKPSKANRGRGRTSRKTSQRASQGADPEALSTLQRELDDMR
        MV PANS NT ++R + A++G QR++GA VVE Q       +   RSAR     LPPAHPKPS                                     
Subjt:  MVHPANSANTTEQRGVNADNGPQRDLGARVVEDQVRAGQEDDLPRRSARHANQELPPAHPKPSKANRGRGRTSRKTSQRASQGADPEALSTLQRELDDMR

Query:  HRLRTMEEMYAEAKRANRTASPSMAPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASWEPEDSSSYSREFSNSNLKAQSKYKPLA
                                                                                                  KA+S Y P+ 
Subjt:  HRLRTMEEMYAEAKRANRTASPSMAPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASWEPEDSSSYSREFSNSNLKAQSKYKPLA

Query:  PEAVVTREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIA
        P  V+TREEFD +K KFD QVEALKARCEKK+ SFDDGDLGE  F++DILEA IPPKFKT  MKPYDGSKDPKDYVEVFE LMDFQ ATDAIKC AFQIA
Subjt:  PEAVVTREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADETLTVKLGEEAPT
        LTGSARLWYRRLPAR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQ+EGETLREYVTRF EEQLKV+HCSD SA+CYFLTGLADETLTVKL EEAP 
Subjt:  LTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADETLTVKLGEEAPT

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTNIEESGMEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  ++  + D KS+DKGPSS SSR +YRR+ S   +SRPYE YTPTTIPI EILTNIEE+GMEKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPSGGQSGNKRKEL
        RPEKLRGD EK + DKYCRFHRDH H+T+  WELKRQIE+LIQDGYFKKFVGKPR NSVEKKEERKR RTPPRRD RP VI             NK+KEL
Subjt:  RPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPSGGQSGNKRKEL

Query:  AREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVSPEGCIN
        AREARREVCIIREQ+PT SI F +A+LEGVHLPHNDALVIAPLID VLVRR+LVD GASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+S EGCI+
Subjt:  AREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVSPEGCIN

Query:  LPGS
        LP S
Subjt:  LPGS

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]3.4e-22574.18Show/hide
Query:  PGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASWEPEDSSSYSREFSNSNLKAQSKYKPLAPEAVVTREEFDLMKHKFDEQVEALK
        PGAPGEKGAPSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+ASWEPEDS SYSREFSNSNLKAQSKYKPL PEAV+ REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASWEPEDSSSYSREFSNSNLKAQSKYKPLAPEAVVTREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRK
        ARCEKK+  FDD DLGESPFT+DI+EAPIPPKFKT  MKPYDGSKDPKDYVEVFEGLMDFQ ATDAIKC AFQIALTGSARLW RRLPARSISTYSQLRK
Subjt:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRK

Query:  EFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS RHYDRKTATHLATIRQ+E                                   DETLTVKLGEEAP TFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGR

Query:  PEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHD
        PEKQIDQK+L+Q+ R+ D KSKDKG SS  SRTEYRR+ESGP+RSRPYER                                                  
Subjt:  PEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHD

Query:  HDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNA
             CWELKRQIE+LIQD YFKKFVGKPR NSVEKKEERKRSRTPPRR+ RP VINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSITF + 
Subjt:  HDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNA

Query:  NLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVSPEGCINLP
        +LEGVHLPHNDALVIAPLIDHVLVRRVLVD GASANILSLPTYLAL  TRSQLK+SPTPLVGFS ESVSPEGCI+LP
Subjt:  NLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVSPEGCINLP

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]8.7e-20577.67Show/hide
Query:  MDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFL
        MDFQ ATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSS HYDRKTATHLATIRQ+E ETLREYVTRFQEEQLKV+HCSD SA+CYFL
Subjt:  MDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFL

Query:  TGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIP
        T LADETLTVKLGEEAPTTF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKL+QE R+ D KS+DKG SS +SRTEYRR ESGP+RSRPYERYT +TIP
Subjt:  TGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVIN
        ISEILTNIEESGMEKLLKRPEKLRGDLEK +K+KYCRFHRDH H+TT CWELKRQIE+LIQDGYFKKFVGKPR NSVEKKEERKRSRTPPRR+ RP VIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVIN

Query:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEG-------------------------
        TIFGGP+GGQSGNKRKELAREARREVCIIRE KPTCSITFG+A+LEGVHLPHNDALVIA LIDH LVRRVL+D G                         
Subjt:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEG-------------------------

Query:  ---ASANILSLP---TYLALGWTRSQLKRSPTP-LVGF-SGESVSPEGCI--NLPGSAVCALEEQTNRGKLQESEADLPKESKRQFSPPIEELELVPLLS
           A   I   P   ++ A+  T  Q+ +  TP  VG   GE  +   C    L GSAVCALEEQTNRGKLQESEADLPKE KRQF PP EELELVPLLS
Subjt:  ---ASANILSLP---TYLALGWTRSQLKRSPTP-LVGF-SGESVSPEGCI--NLPGSAVCALEEQTNRGKLQESEADLPKESKRQFSPPIEELELVPLLS

Query:  PEKQVS
        PE+Q +
Subjt:  PEKQVS

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.9e-23180.12Show/hide
Query:  LKAQSKYKPLAPEAVVTREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTAT
        +KA+S   P  P  V+TREEFD ++ + D QVEALKA+CE+K+   +DGDLGESPFT+D+LEAPIPPKFK   +KPYDGSKDPKDYVEVFE LMDFQ A+
Subjt:  LKAQSKYKPLAPEAVVTREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTAT

Query:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADET
        DAIKCRAF+IALTGSARLWYRRLPA SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQ+EGETLREYVTRFQEEQLKV+HCSD SA+CYFLTGLADE 
Subjt:  DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADET

Query:  LTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTN
        LTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKG  S S R EYRR E+GPTRSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPS
        IEESGMEKLLKRPEKLRG  E+ SKDKYCRFHR+H H+T+  WELKRQIE LIQDGYFKKFVGKPR +S EKKEERKRSRTPPRR  RP VINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPS

Query:  GGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFS
        GGQSG KRKELAR ARREVCIIREQ+PTC ITF  A+LE VHLPHNDALVIAPLIDHV+V RVLVD G SANILSLPTYLALGWTRSQLK+SPTPLVGFS
Subjt:  GGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFS

Query:  GESVSPEGCINLP
        GESV PEG I+LP
Subjt:  GESVSPEGCINLP

A0A6J1D7S8 uncharacterized protein LOC1110178073.0e-20387.82Show/hide
Query:  KDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQ ATDAIKCRAFQIALTG ARLWYRRLPARSISTYSQLRKEFISQF SRHYDRKTATHLATIRQ+E ETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKV

Query:  SHCSDYSAICYFLTGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTR
         HCSD SA+CYFLTGLADETLTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKL QE R+ D KS+DKG SS +SR E+RR ESGP+R
Subjt:  SHCSDYSAICYFLTGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTR

Query:  SRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSR
        SRPYERYTPTTI ISEILTNIEESGMEKLLK PEKLRGD EK SKDK CRFHRDHDH+TT CWELKRQIE+LIQDGYFKKFVGKPR NSVEKKEERKRSR
Subjt:  SRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSR

Query:  TPPRRDGRPVVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYL
        TPPRRD RP VINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITF +A+LEGVHLPHNDALVIAPLIDHVLV  +LVD GASANILSLPTYL
Subjt:  TPPRRDGRPVVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYL

Query:  ALGWTRSQLKRSPTPLVGFSGESVSPE
        ALGWTR QLK+SPT  +  S E+ SP+
Subjt:  ALGWTRSQLKRSPTPLVGFSGESVSPE

A0A6J1D7S8 uncharacterized protein LOC1110178074.3e-0897.06Show/hide
Query:  MVHPANSANTTEQRGVNADNGPQRDLGARVVEDQ
        MVHPANSANTTEQRGVNADNGPQRDLGAR+VEDQ
Subjt:  MVHPANSANTTEQRGVNADNGPQRDLGARVVEDQ

A0A6J1DHB3 uncharacterized protein LOC1110204791.1e-22463.07Show/hide
Query:  MVHPANSANTTEQRGVNADNGPQRDLGARVVEDQVRAGQEDDLPRRSARHANQELPPAHPKPSKANRGRGRTSRKTSQRASQGADPEALSTLQRELDDMR
        MV PANS NT ++R + A++G QR++GA VVE Q       +   RSAR     LPPAHPKPS                                     
Subjt:  MVHPANSANTTEQRGVNADNGPQRDLGARVVEDQVRAGQEDDLPRRSARHANQELPPAHPKPSKANRGRGRTSRKTSQRASQGADPEALSTLQRELDDMR

Query:  HRLRTMEEMYAEAKRANRTASPSMAPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASWEPEDSSSYSREFSNSNLKAQSKYKPLA
                                                                                                  KA+S Y P+ 
Subjt:  HRLRTMEEMYAEAKRANRTASPSMAPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASWEPEDSSSYSREFSNSNLKAQSKYKPLA

Query:  PEAVVTREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIA
        P  V+TREEFD +K KFD QVEALKARCEKK+ SFDDGDLGE  F++DILEA IPPKFKT  MKPYDGSKDPKDYVEVFE LMDFQ ATDAIKC AFQIA
Subjt:  PEAVVTREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADETLTVKLGEEAPT
        LTGSARLWYRRLPAR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQ+EGETLREYVTRF EEQLKV+HCSD SA+CYFLTGLADETLTVKL EEAP 
Subjt:  LTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADETLTVKLGEEAPT

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTNIEESGMEKLLK
        TFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  ++  + D KS+DKGPSS SSR +YRR+ S   +SRPYE YTPTTIPI EILTNIEE+GMEKLLK
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPSGGQSGNKRKEL
        RPEKLRGD EK + DKYCRFHRDH H+T+  WELKRQIE+LIQDGYFKKFVGKPR NSVEKKEERKR RTPPRRD RP VI             NK+KEL
Subjt:  RPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPSGGQSGNKRKEL

Query:  AREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVSPEGCIN
        AREARREVCIIREQ+PT SI F +A+LEGVHLPHNDALVIAPLID VLVRR+LVD GASANILSL TYLALGWTRSQLK+SPTPLVGFSGES+S EGCI+
Subjt:  AREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVSPEGCIN

Query:  LPGS
        LP S
Subjt:  LPGS

A0A6J1DPC9 uncharacterized protein LOC1110222801.6e-22574.18Show/hide
Query:  PGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASWEPEDSSSYSREFSNSNLKAQSKYKPLAPEAVVTREEFDLMKHKFDEQVEALK
        PGAPGEKGAPSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+ASWEPEDS SYSREFSNSNLKAQSKYKPL PEAV+ REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASWEPEDSSSYSREFSNSNLKAQSKYKPLAPEAVVTREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRK
        ARCEKK+  FDD DLGESPFT+DI+EAPIPPKFKT  MKPYDGSKDPKDYVEVFEGLMDFQ ATDAIKC AFQIALTGSARLW RRLPARSISTYSQLRK
Subjt:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRK

Query:  EFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS RHYDRKTATHLATIRQ+E                                   DETLTVKLGEEAP TFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFLTGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGR

Query:  PEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHD
        PEKQIDQK+L+Q+ R+ D KSKDKG SS  SRTEYRR+ESGP+RSRPYER                                                  
Subjt:  PEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHD

Query:  HDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNA
             CWELKRQIE+LIQD YFKKFVGKPR NSVEKKEERKRSRTPPRR+ RP VINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSITF + 
Subjt:  HDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNA

Query:  NLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVSPEGCINLP
        +LEGVHLPHNDALVIAPLIDHVLVRRVLVD GASANILSLPTYLAL  TRSQLK+SPTPLVGFS ESVSPEGCI+LP
Subjt:  NLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVSPEGCINLP

A0A6J1DZB9 uncharacterized protein LOC1110249044.2e-20577.67Show/hide
Query:  MDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFL
        MDFQ ATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSS HYDRKTATHLATIRQ+E ETLREYVTRFQEEQLKV+HCSD SA+CYFL
Subjt:  MDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYFL

Query:  TGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIP
        T LADETLTVKLGEEAPTTF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKL+QE R+ D KS+DKG SS +SRTEYRR ESGP+RSRPYERYT +TIP
Subjt:  TGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVIN
        ISEILTNIEESGMEKLLKRPEKLRGDLEK +K+KYCRFHRDH H+TT CWELKRQIE+LIQDGYFKKFVGKPR NSVEKKEERKRSRTPPRR+ RP VIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVIN

Query:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEG-------------------------
        TIFGGP+GGQSGNKRKELAREARREVCIIRE KPTCSITFG+A+LEGVHLPHNDALVIA LIDH LVRRVL+D G                         
Subjt:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEG-------------------------

Query:  ---ASANILSLP---TYLALGWTRSQLKRSPTP-LVGF-SGESVSPEGCI--NLPGSAVCALEEQTNRGKLQESEADLPKESKRQFSPPIEELELVPLLS
           A   I   P   ++ A+  T  Q+ +  TP  VG   GE  +   C    L GSAVCALEEQTNRGKLQESEADLPKE KRQF PP EELELVPLLS
Subjt:  ---ASANILSLP---TYLALGWTRSQLKRSPTP-LVGF-SGESVSPEGCI--NLPGSAVCALEEQTNRGKLQESEADLPKESKRQFSPPIEELELVPLLS

Query:  PEKQVS
        PE+Q +
Subjt:  PEKQVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCATGCCCAACTCGAACCGCCGAAATTACCACCACTAGCATTGACAGGTGCTAAGTATAAAGGGCCGAGGTGCGACCTGGTCAAGGTCCGATCTTTCGGGAAGCT
CGGCAGGTCCGCCCAAGTGTTCAGATCGGACATCCGAAGTCATCCCGACCTAAAAACATACGTAAAGATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGG
GTGTGAACGCTGACAATGGCCCTCAGCGAGACCTTGGTGCGAGAGTAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGATGATCTGCCGCGCAGATCTGCCCGTCATGCG
AATCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAGGCCAACAGAGGCCGAGGAAGGACGTCGAGAAAAACCTCCCAAAGGGCCAGTCAGGGAGCAGACCCCGAAGC
TCTGTCTACTCTCCAGCGCGAGTTGGATGATATGCGCCATCGATTGCGCACAATGGAAGAAATGTACGCCGAGGCAAAGCGTGCTAACCGAACTGCGTCTCCCTCTATGG
CCCCGGGCGCACCCGGAGAAAAGGGAGCTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCCAACGATGGAGGAGTGGATTACAGCTTGCGGGATAACGATCTAAGA
AAGCATCTCACTGAAAAGAAGAAGAGAGCATCTTGGGAGCCGGAAGACTCTTCTTCCTACTCCCGAGAATTCTCCAATTCGAACCTAAAGGCTCAATCAAAATACAAGCC
TCTTGCACCAGAAGCTGTGGTCACTAGGGAAGAGTTCGACCTGATGAAGCACAAGTTTGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGT
TCGACGATGGCGACTTGGGAGAATCGCCATTCACCGCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCTCGCCATGAAGCCCTATGACGGGTCTAAGGAC
CCTAAAGACTATGTTGAGGTCTTCGAGGGCCTCATGGACTTTCAAACGGCGACAGATGCGATCAAATGTCGCGCCTTCCAGATCGCGCTTACCGGCAGCGCGCGCTTGTG
GTACCGAAGACTGCCGGCCAGGTCGATCTCGACCTACTCCCAGCTAAGGAAGGAGTTCATCAGTCAGTTCTCCTCTCGGCATTATGACAGAAAGACAGCGACTCACCTTG
CCACCATCAGGCAGAGGGAAGGAGAGACGCTGAGAGAATATGTCACACGGTTCCAGGAGGAGCAACTTAAGGTCTCGCACTGCTCCGATTATTCGGCCATATGCTACTTC
CTCACCGGCCTGGCCGATGAGACCTTGACAGTAAAACTTGGAGAGGAGGCTCCAACCACCTTCGCCGAAGTATTGCAGAAGGCGAAAAAAGTCATTGATGGACAGGAGCT
ACTCCGAACCAAAACTGGCCGACCTGAGAAGCAGATCGACCAAAAGAAGTTGAACCAGGAGACGAGGAGGCCTGATGTCAAGTCCAAGGATAAGGGTCCATCCTCCTTCA
GTAGCAGAACAGAGTACCGTAGGACGGAGAGCGGCCCTACCCGGAGCCGACCTTATGAACGGTACACCCCAACCACCATCCCCATCTCCGAGATACTCACGAACATCGAA
GAGAGCGGGATGGAAAAACTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCTAGAAAAACTTAGCAAAGACAAGTACTGCCGCTTTCATCGAGATCACGACCACGATAC
GACAGGTTGCTGGGAATTGAAGCGCCAGATTGAAGAACTCATTCAGGACGGCTACTTCAAAAAATTTGTGGGCAAACCGAGAGTGAACTCGGTCGAAAAGAAAGAGGAAA
GGAAGCGTTCAAGAACGCCGCCTCGCCGGGATGGCCGACCTGTGGTCATCAATACTATTTTTGGGGGTCCGAGCGGGGGCCAGTCTGGAAACAAGAGGAAAGAGCTAGCT
CGCGAGGCCAGACGCGAGGTATGCATCATCAGAGAGCAGAAGCCCACTTGCTCCATCACTTTTGGCAACGCCAACCTGGAGGGGGTCCACTTGCCTCACAACGATGCATT
GGTGATCGCCCCTCTGATTGATCACGTCCTGGTCCGAAGAGTGTTGGTCGATGAAGGCGCATCTGCCAACATCTTGTCCCTCCCAACATATCTGGCATTAGGATGGACCA
GGTCACAATTGAAAAGGAGTCCAACACCCTTGGTCGGATTCTCTGGAGAATCGGTCTCCCCCGAAGGGTGCATCAACTTGCCGGGGTCGGCCGTATGCGCCCTGGAAGAA
CAAACGAATCGTGGCAAGCTGCAGGAGTCAGAGGCCGACCTGCCAAAGGAAAGCAAAAGGCAGTTCTCCCCGCCAATAGAGGAGCTCGAGCTCGTTCCTTTACTTAGTCC
CGAAAAACAAGTAAGTATAGGAACCAAGCTGGGGGCCACTGACAGCGAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCACGTGGTCTCACGAGGATATGC
CTGCATCAGCATACGAGACCGACCTGGCCACGGCGGTTCCTGTTGAGATCTTGGATAATCCCTCGATCTTGGAGCCAGATCTGATGGAGATTGGCGTTCCAGAGCCCTCA
TGGATGGACCCGATTATGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGAAGGCAGCTCGTCAAGGGAATAGTCTGACCTGGGA
CATACATATTGGCCGATCTGAAAGGAGACGTCTTGGAGCACCCATGGAACGCGGAGCACCTGAAGCGTTATTACCCGTGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCATGCCCAACTCGAACCGCCGAAATTACCACCACTAGCATTGACAGGTGCTAAGTATAAAGGGCCGAGGTGCGACCTGGTCAAGGTCCGATCTTTCGGGAAGCT
CGGCAGGTCCGCCCAAGTGTTCAGATCGGACATCCGAAGTCATCCCGACCTAAAAACATACGTAAAGATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGG
GTGTGAACGCTGACAATGGCCCTCAGCGAGACCTTGGTGCGAGAGTAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGATGATCTGCCGCGCAGATCTGCCCGTCATGCG
AATCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAGGCCAACAGAGGCCGAGGAAGGACGTCGAGAAAAACCTCCCAAAGGGCCAGTCAGGGAGCAGACCCCGAAGC
TCTGTCTACTCTCCAGCGCGAGTTGGATGATATGCGCCATCGATTGCGCACAATGGAAGAAATGTACGCCGAGGCAAAGCGTGCTAACCGAACTGCGTCTCCCTCTATGG
CCCCGGGCGCACCCGGAGAAAAGGGAGCTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCCAACGATGGAGGAGTGGATTACAGCTTGCGGGATAACGATCTAAGA
AAGCATCTCACTGAAAAGAAGAAGAGAGCATCTTGGGAGCCGGAAGACTCTTCTTCCTACTCCCGAGAATTCTCCAATTCGAACCTAAAGGCTCAATCAAAATACAAGCC
TCTTGCACCAGAAGCTGTGGTCACTAGGGAAGAGTTCGACCTGATGAAGCACAAGTTTGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGT
TCGACGATGGCGACTTGGGAGAATCGCCATTCACCGCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCTCGCCATGAAGCCCTATGACGGGTCTAAGGAC
CCTAAAGACTATGTTGAGGTCTTCGAGGGCCTCATGGACTTTCAAACGGCGACAGATGCGATCAAATGTCGCGCCTTCCAGATCGCGCTTACCGGCAGCGCGCGCTTGTG
GTACCGAAGACTGCCGGCCAGGTCGATCTCGACCTACTCCCAGCTAAGGAAGGAGTTCATCAGTCAGTTCTCCTCTCGGCATTATGACAGAAAGACAGCGACTCACCTTG
CCACCATCAGGCAGAGGGAAGGAGAGACGCTGAGAGAATATGTCACACGGTTCCAGGAGGAGCAACTTAAGGTCTCGCACTGCTCCGATTATTCGGCCATATGCTACTTC
CTCACCGGCCTGGCCGATGAGACCTTGACAGTAAAACTTGGAGAGGAGGCTCCAACCACCTTCGCCGAAGTATTGCAGAAGGCGAAAAAAGTCATTGATGGACAGGAGCT
ACTCCGAACCAAAACTGGCCGACCTGAGAAGCAGATCGACCAAAAGAAGTTGAACCAGGAGACGAGGAGGCCTGATGTCAAGTCCAAGGATAAGGGTCCATCCTCCTTCA
GTAGCAGAACAGAGTACCGTAGGACGGAGAGCGGCCCTACCCGGAGCCGACCTTATGAACGGTACACCCCAACCACCATCCCCATCTCCGAGATACTCACGAACATCGAA
GAGAGCGGGATGGAAAAACTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCTAGAAAAACTTAGCAAAGACAAGTACTGCCGCTTTCATCGAGATCACGACCACGATAC
GACAGGTTGCTGGGAATTGAAGCGCCAGATTGAAGAACTCATTCAGGACGGCTACTTCAAAAAATTTGTGGGCAAACCGAGAGTGAACTCGGTCGAAAAGAAAGAGGAAA
GGAAGCGTTCAAGAACGCCGCCTCGCCGGGATGGCCGACCTGTGGTCATCAATACTATTTTTGGGGGTCCGAGCGGGGGCCAGTCTGGAAACAAGAGGAAAGAGCTAGCT
CGCGAGGCCAGACGCGAGGTATGCATCATCAGAGAGCAGAAGCCCACTTGCTCCATCACTTTTGGCAACGCCAACCTGGAGGGGGTCCACTTGCCTCACAACGATGCATT
GGTGATCGCCCCTCTGATTGATCACGTCCTGGTCCGAAGAGTGTTGGTCGATGAAGGCGCATCTGCCAACATCTTGTCCCTCCCAACATATCTGGCATTAGGATGGACCA
GGTCACAATTGAAAAGGAGTCCAACACCCTTGGTCGGATTCTCTGGAGAATCGGTCTCCCCCGAAGGGTGCATCAACTTGCCGGGGTCGGCCGTATGCGCCCTGGAAGAA
CAAACGAATCGTGGCAAGCTGCAGGAGTCAGAGGCCGACCTGCCAAAGGAAAGCAAAAGGCAGTTCTCCCCGCCAATAGAGGAGCTCGAGCTCGTTCCTTTACTTAGTCC
CGAAAAACAAGTAAGTATAGGAACCAAGCTGGGGGCCACTGACAGCGAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCACGTGGTCTCACGAGGATATGC
CTGCATCAGCATACGAGACCGACCTGGCCACGGCGGTTCCTGTTGAGATCTTGGATAATCCCTCGATCTTGGAGCCAGATCTGATGGAGATTGGCGTTCCAGAGCCCTCA
TGGATGGACCCGATTATGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGAAGGCAGCTCGTCAAGGGAATAGTCTGACCTGGGA
CATACATATTGGCCGATCTGAAAGGAGACGTCTTGGAGCACCCATGGAACGCGGAGCACCTGAAGCGTTATTACCCGTGAAATGA
Protein sequenceShow/hide protein sequence
MSHAQLEPPKLPPLALTGAKYKGPRCDLVKVRSFGKLGRSAQVFRSDIRSHPDLKTYVKMVHPANSANTTEQRGVNADNGPQRDLGARVVEDQVRAGQEDDLPRRSARHA
NQELPPAHPKPSKANRGRGRTSRKTSQRASQGADPEALSTLQRELDDMRHRLRTMEEMYAEAKRANRTASPSMAPGAPGEKGAPSIQPGDREPIPNDGGVDYSLRDNDLR
KHLTEKKKRASWEPEDSSSYSREFSNSNLKAQSKYKPLAPEAVVTREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTLAMKPYDGSKD
PKDYVEVFEGLMDFQTATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQREGETLREYVTRFQEEQLKVSHCSDYSAICYF
LTGLADETLTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLNQETRRPDVKSKDKGPSSFSSRTEYRRTESGPTRSRPYERYTPTTIPISEILTNIE
ESGMEKLLKRPEKLRGDLEKLSKDKYCRFHRDHDHDTTGCWELKRQIEELIQDGYFKKFVGKPRVNSVEKKEERKRSRTPPRRDGRPVVINTIFGGPSGGQSGNKRKELA
REARREVCIIREQKPTCSITFGNANLEGVHLPHNDALVIAPLIDHVLVRRVLVDEGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVSPEGCINLPGSAVCALEE
QTNRGKLQESEADLPKESKRQFSPPIEELELVPLLSPEKQVSIGTKLGATDSEELINFLRSNSDVFTWSHEDMPASAYETDLATAVPVEILDNPSILEPDLMEIGVPEPS
WMDPIMDFIRGNSPQDPKERRKLARKAARQGNSLTWDIHIGRSERRRLGAPMERGAPEALLPVK