; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18200 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18200
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:13602528..13608110
RNA-Seq ExpressionMoc02g18200
SyntenyMoc02g18200
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]5.1e-27292.61Show/hide
Query:  QAEYSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AE S N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  L+DGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAEYSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E A+PKSKDKGSFSSG  EYRR E+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGFSKE
        QSG KRKELAR ARREVCIIREQ PTCPITFDGADLEE HLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL WTRSQLK+SPTPLVGFS E
Subjt:  QSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGFSKE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]1.4e-22194.31Show/hide
Query:  KDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSL+DGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDC
        GRGRSGKD ERA+PKSKDKGSFSSG  EYRR ESGPT+SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSDC
Subjt:  GRGRSGKD-ERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR ARREVCIIREQGPTCPITFDGAD EE H
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.9e-27577.86Show/hide
Query:  SSNQQAEYSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAE SHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  L+DGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAEYSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+A+ KSKDKGSFSSG  E+RR  +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGF
        SGGQSGHKRKELAR ARREVCIIREQ PTCPITFD ADLEE HLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLAL WTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGF

Query:  SKESVIPEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S+ESVIPEGCIDLPVTLG DQTQVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SKESVIPEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSISEPDLMEIGTPE
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +   ++     ++   + +  + D+   G PE
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSISEPDLMEIGTPE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.8e-26965.95Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKDTRGRGGTSKKGARGPPPDPTSENLDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKDTRGRGGTSKKGARGPPPDPTSENLDALKREMEAMR

Query:  TQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHVNRKRGSSLRKGQSPSRSHRSSNQQAEYSHNP--
                                                                                                   AE S+NP  
Subjt:  TQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHVNRKRGSSLRKGQSPSRSHRSSNQQAEYSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S  DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RANPKSKDKG-SFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +A+ KS+DKG S SS   +YRR  S   +SRPYE +TPTTIPI EILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RANPKSKDKG-SFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+RN DKYCRFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGFSKESVIPEGCIDL
        R ARREVCIIREQ PT  I F+ ADLE  HLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLAL WTRSQLK+SPTPLVGFS ES+  EGCIDL
Subjt:  RTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGFSKESVIPEGCIDL

Query:  PVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD
        PV++ QD TQVTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCALE  T+RD
Subjt:  PVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.8e-22175.82Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ANPKSKDKGSFSSGG-PEYRRVESGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  +++R A+ KS+DKGS SS    EYRR+ESGP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ANPKSKDKGSFSSGG-PEYRRVESGPTRSRPYERFTPTTIP

Query:  ISEILTNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIE+SGMEKLLKRPEKLRG  E+RNK+KYCRFHR+H HNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE  PTC ITF  ADLE  HLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSP

Query:  TPLVGFSKESVIPEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYA+ALKGS+VC
Subjt:  TPLVGFSKESVIPEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVC

Query:  ALE--TLRDGTLEFEADLP---RKEFAAPTEELELVPLLSPEKQTD
        ALE  T R    E EADLP   +++F  PTEELELVPLLSPE+Q +
Subjt:  ALE--TLRDGTLEFEADLP---RKEFAAPTEELELVPLLSPEKQTD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.5e-27292.61Show/hide
Query:  QAEYSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AE S N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  L+DGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAEYSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E A+PKSKDKGSFSSG  EYRR E+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGFSKE
        QSG KRKELAR ARREVCIIREQ PTCPITFDGADLEE HLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL WTRSQLK+SPTPLVGFS E
Subjt:  QSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGFSKE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.4e-27577.86Show/hide
Query:  SSNQQAEYSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAE SHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  L+DGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAEYSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+A+ KSKDKGSFSSG  E+RR  +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGF
        SGGQSGHKRKELAR ARREVCIIREQ PTCPITFD ADLEE HLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLAL WTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGF

Query:  SKESVIPEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-
        S+ESVIPEGCIDLPVTLG DQTQVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL 
Subjt:  SKESVIPEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSISEPDLMEIGTPE
         RDGTLEF+A+LPR+EFAAPTEELELVPLL  +   ++     ++   + +  + D+   G PE
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSISEPDLMEIGTPE

A0A6J1D9W7 uncharacterized protein LOC1110187086.8e-22294.31Show/hide
Query:  KDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSL+DGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKD-ERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDC
        GRGRSGKD ERA+PKSKDKGSFSSG  EYRR ESGPT+SRPYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSDC
Subjt:  GRGRSGKD-ERANPKSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDH
        WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR ARREVCIIREQGPTCPITFDGAD EE H
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204798.7e-27065.95Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKDTRGRGGTSKKGARGPPPDPTSENLDALKREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKDTRGRGGTSKKGARGPPPDPTSENLDALKREMEAMR

Query:  TQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHVNRKRGSSLRKGQSPSRSHRSSNQQAEYSHNP--
                                                                                                   AE S+NP  
Subjt:  TQMHSMEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHVNRKRGSSLRKGQSPSRSHRSSNQQAEYSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S  DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RANPKSKDKG-SFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +A+ KS+DKG S SS   +YRR  S   +SRPYE +TPTTIPI EILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RANPKSKDKG-SFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+RN DKYCRFHR+H HNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGFSKESVIPEGCIDL
        R ARREVCIIREQ PT  I F+ ADLE  HLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLAL WTRSQLK+SPTPLVGFS ES+  EGCIDL
Subjt:  RTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSPTPLVGFSKESVIPEGCIDL

Query:  PVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD
        PV++ QD TQVTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCALE  T+RD
Subjt:  PVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALE--TLRD

A0A6J1DZB9 uncharacterized protein LOC1110249048.9e-22275.82Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ANPKSKDKGSFSSGG-PEYRRVESGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  +++R A+ KS+DKGS SS    EYRR+ESGP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ANPKSKDKGSFSSGG-PEYRRVESGPTRSRPYERFTPTTIP

Query:  ISEILTNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIE+SGMEKLLKRPEKLRG  E+RNK+KYCRFHR+H HNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE  PTC ITF  ADLE  HLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALCWTRSQLKRSP

Query:  TPLVGFSKESVIPEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYA+ALKGS+VC
Subjt:  TPLVGFSKESVIPEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVC

Query:  ALE--TLRDGTLEFEADLP---RKEFAAPTEELELVPLLSPEKQTD
        ALE  T R    E EADLP   +++F  PTEELELVPLLSPE+Q +
Subjt:  ALE--TLRDGTLEFEADLP---RKEFAAPTEELELVPLLSPEKQTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGTAACGGAACCCCTCCGCAGGTCAGCACGGATCACCGCACCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGACACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCACCCCCGGATCCAACAAGCGAGAACCTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCACTCCATGGAGGAAATGTAT
AACGAAATGATGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAAGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTATACTCGTCAGAGGGGAGACCTCCGTGAGCATGTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGTTCCAACCAGCAGGCTGAATACTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAGCGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTGTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCAGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCT
CGGCACTACGACAAAAAGACAGCGACTCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTACTCC
AGAAGGCGAAGAAAGTCATCGACGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGCGGAAAAGATGAAAGGGCAAATCCC
AAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCGGACCTGAGTATCGAAGGGTGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCC
AATTTCCGAGATCCTAACAAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAAAGGCGCAACAAGGACAAGTATTGCC
GCTTCCATCGGGAGCACGACCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTCGTGGGAAAGCCCAGG
ACTAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCACCCAGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCA
ATCCGGACATAAGAGAAAGGAGTTAGCTCGTACAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGACCGACCTGCCCAATCACCTTCGACGGTGCAGATTTGGAGG
AGGACCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATCGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTA
CCGACCTACCTCGCCTTGTGCTGGACGAGGTCGCAATTGAAGAGAAGTCCGACGCCGCTGGTTGGGTTCTCTAAAGAGTCGGTCATCCCAGAGGGTTGCATCGACTTGCC
GGTCACGCTGGGGCAGGACCAAACTCAGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGCAGATCGGCTTATAACGCCATCTTTGGGAGACCCATCATCCACTCAT
TTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCA
CTCAAAGGCTCGTCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAATTTGCCGCACCCACTGAGGAGCTCGAACT
TGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCTTCGATCTCGGAGCCAGATCTGATGGAGATCGGCACTC
CAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAAGGGCAACTCACCACAGGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAACATTAT
GAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGTCAGACA
TTACAATGCCTGTGTCCGACCTCGAGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGACCCGGCCTGGGAGGGCCCGTTTGAGA
TCAAGGGCATAGTCCGACCTCGGACGTACATGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGTAACGGAACCCCTCCGCAGGTCAGCACGGATCACCGCACCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGACACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCACCCCCGGATCCAACAAGCGAGAACCTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCACTCCATGGAGGAAATGTAT
AACGAAATGATGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAAGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGGTATACTCGTCAGAGGGGAGACCTCCGTGAGCATGTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCT
CACACAGGAGTTCCAACCAGCAGGCTGAATACTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCC
TTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAGCGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTGTTGGAAGCACCAATCCCTCCGAAGTTCAAAGC
TCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCAGCATCAGACGCAATCAAATGCCGCGCCT
TTCAGATTGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCT
CGGCACTACGACAAAAAGACAGCGACTCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGC
ACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTACTCC
AGAAGGCGAAGAAAGTCATCGACGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGCGGAAAAGATGAAAGGGCAAATCCC
AAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCGGACCTGAGTATCGAAGGGTGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCC
AATTTCCGAGATCCTAACAAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAAAGGCGCAACAAGGACAAGTATTGCC
GCTTCCATCGGGAGCACGACCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTCGTGGGAAAGCCCAGG
ACTAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCACCCAGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCA
ATCCGGACATAAGAGAAAGGAGTTAGCTCGTACAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGACCGACCTGCCCAATCACCTTCGACGGTGCAGATTTGGAGG
AGGACCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATCGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTA
CCGACCTACCTCGCCTTGTGCTGGACGAGGTCGCAATTGAAGAGAAGTCCGACGCCGCTGGTTGGGTTCTCTAAAGAGTCGGTCATCCCAGAGGGTTGCATCGACTTGCC
GGTCACGCTGGGGCAGGACCAAACTCAGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGCAGATCGGCTTATAACGCCATCTTTGGGAGACCCATCATCCACTCAT
TTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCA
CTCAAAGGCTCGTCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAATTTGCCGCACCCACTGAGGAGCTCGAACT
TGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCTTCGATCTCGGAGCCAGATCTGATGGAGATCGGCACTC
CAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAAGGGCAACTCACCACAGGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAACATTAT
GAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGTCAGACA
TTACAATGCCTGTGTCCGACCTCGAGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGACCCGGCCTGGGAGGGCCCGTTTGAGA
TCAAGGGCATAGTCCGACCTCGGACGTACATGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLVTEPLRRSARITAPALPPAHPRTSKDTRGRGGTSKKGARGPPPDPTSENLDALKREMEAMRTQMHSMEEMY
NEMMLAAGAGSRSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHVNRKRGSSLRKGQSPSRSHRSSNQQAEYSHNPAGIITREEFDQLRGELDAQVEA
LKAKCEQKDDSLSDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSS
RHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERANP
KSKDKGSFSSGGPEYRRVESGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPR
TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIREQGPTCPITFDGADLEEDHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSL
PTYLALCWTRSQLKRSPTPLVGFSKESVIPEGCIDLPVTLGQDQTQVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAA
LKGSSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSISEPDLMEIGTPESSWMDPIADFIKGNSPQDPKERRKLARRAARVEHY
EPTTNEDGLLLNLDLLEERRAMAQLRLAEYQGRMVRHYNACVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPRTYMLADLKGDVLAHPWNAEHLKRYYP