; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g35880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g35880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:26888951..26895134
RNA-Seq ExpressionMoc04g35880
SyntenyMoc04g35880
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.4e-21477.27Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDGS DPKDYVEVFE LMDFQAA D
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSSGRAEYRR-----------------------------
        TVKL EEAPATF EVLQKAKKVIDGQELLRTK G PE+KIGRGR GKDI+ ADPKSKD+GSFSSGRAEYRR                             
Subjt:  TVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSSGRAEYRR-----------------------------

Query:  --------------------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGG
                                                                      PRTSS EKKEERKRSRTPPRRTDRPA+INTIFGGPSGG
Subjt:  --------------------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGG

Query:  QSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARA RREVCIIREQRPTCPIT DGADLE VHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.4e-21468.3Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEGPLNDG+LGESPFTSDVLE        APTVK YDGS DPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQ

Query:  AALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AA DAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSSGRAEYRR-------------------------
        DEALTVKL +EAPATF EVLQKAKKVIDGQELLRTK G PE+ I RGR GKD +KAD KSKD+GSFSSGRAE+RR                         
Subjt:  DEALTVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSSGRAEYRR-------------------------

Query:  ------------------------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGG
                                                                          PRTSS EKKEERK SRTP RR DRPA+INTIFGG
Subjt:  ------------------------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGG

Query:  PSGGQSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSGHKRKELARA RREVCIIREQRPTCPIT D ADLE VHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASR+CYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETL

Query:  ASRDGTLEFEADLPRREFAAPTEELELVPLL
         SRDGTLEF+A+LPRREFAAPTEELELVPLL
Subjt:  ASRDGTLEFEADLPRREFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.1e-21757.27Show/hide
Query:  MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRSTRITVPVLPPAHPRTSKATRGRGGTSKKGAQGPAPAPTNENFDALQREMEAMR
        MVQPANSTNTADRR LA +  HQREVGA  VEGQGH+ L TEPL RS RIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRSTRITVPVLPPAHPRTSKATRGRGGTSKKGAQGPAPAPTNENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGLVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGLVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCRAFQIA
        P GVITR EFDQL+ K DAQVEALKA+CE+KE   +DG+LGE  F+SD+LEA IPPKFK PT+KPYDGS DPKDYVEVFE LMDFQAA DAIKC AFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKLREEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPA

Query:  TFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRG-SFSSGRAEYRR--------------------------------------
        TF EVLQK KKVIDGQELLRTK G PE+ I +GR GKD  KAD KS+D+G S SS R +YRR                                      
Subjt:  TFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRG-SFSSGRAEYRR--------------------------------------

Query:  -----------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGGQSGHKRKEL
                                                             PR++S EKKEERKR RTPPRR DRPA+IN             K+KEL
Subjt:  -----------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGGQSGHKRKEL

Query:  ARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID
        AR  RREVCIIREQRPT  I  + ADLEGVHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCID
Subjt:  ARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETLASRD
        LPV++ QD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SR+CYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETLASRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]3.0e-19464.89Show/hide
Query:  DNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFT
        ++E   ++ R  DLR+HL  K+  +  + +   S SR   +SN +A+S + P  P  VI R EFD ++ + D QVEALKA+CE+KE P +D +LGESPFT
Subjt:  DNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFT

Query:  SDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLAT
        SD++EAPIPPKFK PT+KPYDGS DPKDYVEVFEGLMDFQAA DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ QFS RHYD+KTATHLAT
Subjt:  SDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLAT

Query:  IRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKS
        IRQKE                                   DE LTVKL EEAPATF EVLQ AKKVIDGQELLRTK   PE++I + R+ +  +K D KS
Subjt:  IRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKS

Query:  KDRGSFSSG-RAEYRR------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGGQSGHKRKELA
        KD+GS SSG R EYRR                                    PR++S EKKEERKRSRTPPRR DRPA+INTIFGGPSGGQ  +KRKELA
Subjt:  KDRGSFSSG-RAEYRR------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGGQSGHKRKELA

Query:  RADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDL
           RR+V IIREQ+PTC IT    DLEGVHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEGCIDL
Subjt:  RADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETLASRDGTLEFEADLP
        PVT+GQD TQVTQMAEFVVIDGR AYN IF RPIIHSF+A+PS LHQVLKYSTPNGVGTVRGEQ  SR+CYASALK SSVCALE   S+D       DLP
Subjt:  PVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETLASRDGTLEFEADLP

Query:  R
        R
Subjt:  R

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]1.5e-19372.12Show/hide
Query:  MEAMRTQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGLVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAMRTQMR+MEEMYN+M+  AG  SRS +++   D+ EQ   H   V+EEH             GDLR+HLNRKR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMRTQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGLVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  HNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCR
        +NP  P GVITR EF+QL+ K DAQVEALK +CE+KE   +DG+LGESPFTSD+LEA IPPKFK PT+K YDGS DPKDYVEVFEGLMDFQAA DAIKCR
Subjt:  HNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCR

Query:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLR
        AFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADE  TVKL 
Subjt:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLR

Query:  EEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSS-GRAEYRRPRTSSTEKKEERKRSRTPPRRTDRPAIINTIFG
        EEA ATF EVLQ  KK IDGQELLRTK   PE++I + +  +D +KAD KSKD+GS SS  R +Y   R++S EKKEERKRSRTPPR  DRPA+INTIFG
Subjt:  EEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSS-GRAEYRRPRTSSTEKKEERKRSRTPPRRTDRPAIINTIFG

Query:  GPSGGQSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLV
        GPSGGQSG+KRKELAR   REVCIIREQRPTC +T D +DLEGVHLP+NDALVIAPLIDHV+VRRVLVDGGASANILS    LALGWTRSQLKKSPTPLV
Subjt:  GPSGGQSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLV

Query:  GFSGESVIPEGCI
        GFS ESV  +G +
Subjt:  GFSGESVIPEGCI

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.6e-21477.27Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALD
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDGS DPKDYVEVFE LMDFQAA D
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSSGRAEYRR-----------------------------
        TVKL EEAPATF EVLQKAKKVIDGQELLRTK G PE+KIGRGR GKDI+ ADPKSKD+GSFSSGRAEYRR                             
Subjt:  TVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSSGRAEYRR-----------------------------

Query:  --------------------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGG
                                                                      PRTSS EKKEERKRSRTPPRRTDRPA+INTIFGGPSGG
Subjt:  --------------------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGG

Query:  QSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
        QSG KRKELARA RREVCIIREQRPTCPIT DGADLE VHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.6e-21468.3Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKEGPLNDG+LGESPFTSDVLE        APTVK YDGS DPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQ

Query:  AALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AA DAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSSGRAEYRR-------------------------
        DEALTVKL +EAPATF EVLQKAKKVIDGQELLRTK G PE+ I RGR GKD +KAD KSKD+GSFSSGRAE+RR                         
Subjt:  DEALTVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSSGRAEYRR-------------------------

Query:  ------------------------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGG
                                                                          PRTSS EKKEERK SRTP RR DRPA+INTIFGG
Subjt:  ------------------------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGG

Query:  PSGGQSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSGHKRKELARA RREVCIIREQRPTCPIT D ADLE VHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ ASR+CYASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETL

Query:  ASRDGTLEFEADLPRREFAAPTEELELVPLL
         SRDGTLEF+A+LPRREFAAPTEELELVPLL
Subjt:  ASRDGTLEFEADLPRREFAAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204795.4e-21857.27Show/hide
Query:  MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRSTRITVPVLPPAHPRTSKATRGRGGTSKKGAQGPAPAPTNENFDALQREMEAMR
        MVQPANSTNTADRR LA +  HQREVGA  VEGQGH+ L TEPL RS RIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRSTRITVPVLPPAHPRTSKATRGRGGTSKKGAQGPAPAPTNENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGLVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGLVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCRAFQIA
        P GVITR EFDQL+ K DAQVEALKA+CE+KE   +DG+LGE  F+SD+LEA IPPKFK PT+KPYDGS DPKDYVEVFE LMDFQAA DAIKC AFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKLREEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPA

Query:  TFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRG-SFSSGRAEYRR--------------------------------------
        TF EVLQK KKVIDGQELLRTK G PE+ I +GR GKD  KAD KS+D+G S SS R +YRR                                      
Subjt:  TFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRG-SFSSGRAEYRR--------------------------------------

Query:  -----------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGGQSGHKRKEL
                                                             PR++S EKKEERKR RTPPRR DRPA+IN             K+KEL
Subjt:  -----------------------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGGQSGHKRKEL

Query:  ARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID
        AR  RREVCIIREQRPT  I  + ADLEGVHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCID
Subjt:  ARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETLASRD
        LPV++ QD TQVTQMAEFVVIDGRSAYN IFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SR+CYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETLASRD

A0A6J1DPC9 uncharacterized protein LOC1110222801.4e-19464.89Show/hide
Query:  DNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFT
        ++E   ++ R  DLR+HL  K+  +  + +   S SR   +SN +A+S + P  P  VI R EFD ++ + D QVEALKA+CE+KE P +D +LGESPFT
Subjt:  DNESEGHTRRRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFT

Query:  SDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLAT
        SD++EAPIPPKFK PT+KPYDGS DPKDYVEVFEGLMDFQAA DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ QFS RHYD+KTATHLAT
Subjt:  SDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLAT

Query:  IRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKS
        IRQKE                                   DE LTVKL EEAPATF EVLQ AKKVIDGQELLRTK   PE++I + R+ +  +K D KS
Subjt:  IRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKS

Query:  KDRGSFSSG-RAEYRR------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGGQSGHKRKELA
        KD+GS SSG R EYRR                                    PR++S EKKEERKRSRTPPRR DRPA+INTIFGGPSGGQ  +KRKELA
Subjt:  KDRGSFSSG-RAEYRR------------------------------------PRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGGQSGHKRKELA

Query:  RADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDL
           RR+V IIREQ+PTC IT    DLEGVHLPHNDALVIAPLIDHV+VRRVLVDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEGCIDL
Subjt:  RADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDL

Query:  PVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETLASRDGTLEFEADLP
        PVT+GQD TQVTQMAEFVVIDGR AYN IF RPIIHSF+A+PS LHQVLKYSTPNGVGTVRGEQ  SR+CYASALK SSVCALE   S+D       DLP
Subjt:  PVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKYSTPNGVGTVRGEQTASRKCYASALKGSSVCALETLASRDGTLEFEADLP

Query:  R
        R
Subjt:  R

A0A6J1DPN4 uncharacterized protein LOC1110230607.2e-19472.12Show/hide
Query:  MEAMRTQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGLVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAMRTQMR+MEEMYN+M+  AG  SRS +++   D+ EQ   H   V+EEH             GDLR+HLNRKR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMRTQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGLVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  HNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCR
        +NP  P GVITR EF+QL+ K DAQVEALK +CE+KE   +DG+LGESPFTSD+LEA IPPKFK PT+K YDGS DPKDYVEVFEGLMDFQAA DAIKCR
Subjt:  HNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCR

Query:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLR
        AFQIALTGSARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADE  TVKL 
Subjt:  AFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLR

Query:  EEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSS-GRAEYRRPRTSSTEKKEERKRSRTPPRRTDRPAIINTIFG
        EEA ATF EVLQ  KK IDGQELLRTK   PE++I + +  +D +KAD KSKD+GS SS  R +Y   R++S EKKEERKRSRTPPR  DRPA+INTIFG
Subjt:  EEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQKADPKSKDRGSFSS-GRAEYRRPRTSSTEKKEERKRSRTPPRRTDRPAIINTIFG

Query:  GPSGGQSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLV
        GPSGGQSG+KRKELAR   REVCIIREQRPTC +T D +DLEGVHLP+NDALVIAPLIDHV+VRRVLVDGGASANILS    LALGWTRSQLKKSPTPLV
Subjt:  GPSGGQSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLV

Query:  GFSGESVIPEGCI
        GFS ESV  +G +
Subjt:  GFSGESVIPEGCI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTACCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGACACGAATCACCGTGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCAGGGTCCAGCCCCGGCTCCAACAAATGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGTAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCTAGTCGAGGAGGAACA
TCCCGAAGACAACGAAAGCGAGGGACACACTCGCCGGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGAGTTGGGGGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAACGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATTAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACCGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAATT
GAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTTTAGCCGACGAAGCCCTCACGGTGAAACTTAGAGAGGAGGCCCCGGCCACCTTCACCG
AAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACTAAAATCGGTCCACCAGAACAAAAGATCGGCCGGGGCAGAATTGGGAAAGATATACAA
AAGGCAGATCCCAAGTCCAAGGACAGGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGCCCAGGACCAGCTCGACAGAGAAAAAAGAAGAGCGAAAGCGTTCGAG
GACGCCGCCCCGGCGCACTGACCGACCTGCGATCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGACAGGC
GCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCGTCGACGGAGCAGACTTGGAGGGAGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCC
TTGATTGATCACGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAA
GAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGTATCGACTTGCCGGTCACGCTTGGGCAGGACCAGACTCAGGTCACCCAAATGG
CCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACACCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTAT
TCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGAAGTGCTATGCCTCCGCACTTAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAG
TAGGGATGGGACGCTTGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCAT
CAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGACGCTCCAGAGTCCTCATGGATG
GACCCGATCGAGGAATTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGTAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCG
ACGCGGCTTTTCCTTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTTCAAGGGAATAGTCCGACCTGGGACGTACGTGTTGGCCGATCTGAAAGGAGACGTCCTC
GCGCACCCATGGAACGTGGAGCACCTGAAGCGGCGCGACCTCAAAAGTACGAGGTGCGAGGTGACATTGGCTTTAGCTTAAAGAAAAAGCAAAGAAAGGAAGGCAACAAA
GTAAAAACAAATGGGAGCTTCTTTATTGAATGGAGAGCAGAGCCAAGGCTTATAGACCTTGCAGCTCTGCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTACCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGACACGAATCACCGTGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCAGGGTCCAGCCCCGGCTCCAACAAATGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGTAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCTAGTCGAGGAGGAACA
TCCCGAAGACAACGAAAGCGAGGGACACACTCGCCGGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGAGTTGGGGGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAACGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATTAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACCGGCAGCGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAATT
GAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTTTAGCCGACGAAGCCCTCACGGTGAAACTTAGAGAGGAGGCCCCGGCCACCTTCACCG
AAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACTAAAATCGGTCCACCAGAACAAAAGATCGGCCGGGGCAGAATTGGGAAAGATATACAA
AAGGCAGATCCCAAGTCCAAGGACAGGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGCCCAGGACCAGCTCGACAGAGAAAAAAGAAGAGCGAAAGCGTTCGAG
GACGCCGCCCCGGCGCACTGACCGACCTGCGATCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGACAGGC
GCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCGTCGACGGAGCAGACTTGGAGGGAGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCC
TTGATTGATCACGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAA
GAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGTATCGACTTGCCGGTCACGCTTGGGCAGGACCAGACTCAGGTCACCCAAATGG
CCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACACCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTAT
TCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGAAGTGCTATGCCTCCGCACTTAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAG
TAGGGATGGGACGCTTGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCAT
CAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGACGCTCCAGAGTCCTCATGGATG
GACCCGATCGAGGAATTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGTAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCG
ACGCGGCTTTTCCTTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTTCAAGGGAATAGTCCGACCTGGGACGTACGTGTTGGCCGATCTGAAAGGAGACGTCCTC
GCGCACCCATGGAACGTGGAGCACCTGAAGCGGCGCGACCTCAAAAGTACGAGGTGCGAGGTGACATTGGCTTTAGCTTAAAGAAAAAGCAAAGAAAGGAAGGCAACAAA
GTAAAAACAAATGGGAGCTTCTTTATTGAATGGAGAGCAGAGCCAAGGCTTATAGACCTTGCAGCTCTGCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLATSDAHQREVGAAAVEGQGHDGLATEPLRRSTRITVPVLPPAHPRTSKATRGRGGTSKKGAQGPAPAPTNENFDALQREMEAMRTQMRSMEEMY
NEMILAAGVGSRSENRMTRIDIREQRGSHLGLVEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQ
VEALKAKCEQKEGPLNDGELGESPFTSDVLEAPIPPKFKAPTVKPYDGSNDPKDYVEVFEGLMDFQAALDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFTEVLQKAKKVIDGQELLRTKIGPPEQKIGRGRIGKDIQ
KADPKSKDRGSFSSGRAEYRRPRTSSTEKKEERKRSRTPPRRTDRPAIINTIFGGPSGGQSGHKRKELARADRREVCIIREQRPTCPITVDGADLEGVHLPHNDALVIAP
LIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNTIFGRPIIHSFRALPSTLHQVLKY
STPNGVGTVRGEQTASRKCYASALKGSSVCALETLASRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMEIDAPESSWM
DPIEEFIRGNSPQDPKERRKLARRAARFVVRGGALYRRGFSLPLLRCLTPEEGLQGNSPTWDVRVGRSERRRPRAPMERGAPEAARPQKYEVRGDIGFSLKKKQRKEGNK
VKTNGSFFIEWRAEPRLIDLAALP