; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g07140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g07140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:5625050..5630973
RNA-Seq ExpressionMoc09g07140
SyntenyMoc09g07140
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.1e-24285.23Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDG+LGESPFTSDVLEAPIPPKFKAPTVKPY+G+KDPKDYVEVFE LMDFQ ASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVAHCS+DSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEAL

Query:  TVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLG+EAPATFAEVLQKAKKVIDGQEL+RTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAE RRAE+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDK                                       TSSA+KKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPL+DHVVV RVLVDGG SANILSLPTYLA+GW RSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTVGQDRTRVTQMAEFV
        SVIPEG IDLPVT+GQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTVGQDRTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.3e-24572.52Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDG+LGESPFTSDVLE        APTVK Y+G+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQ

Query:  TASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLA
         ASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  S+DSAMCYFLTGLA
Subjt:  TASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLA

Query:  DEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLGKEAPATFAEVLQKAKKVIDGQEL+RTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE RRA +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIEESGMEKLLKRPEKLRGAPERR+KDK                                       TSSA+KKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGF
        SGGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPL+DHVVVRRVLVD G SANI+SL TYLA+GW RSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLA
        S ESVIPEGCIDLPVT+G D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR IPSTLHQVLKY TPNGVG +RGEQIASRECYAS LKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLA

Query:  GRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQVTSAYETDLARSVPVEILDN
         RDG LEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+
Subjt:  GRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQVTSAYETDLARSVPVEILDN

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.0e-23760.38Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSAQITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSA+IT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSAQITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRTMEEMYNEMMLAADAGSRSENRVTRVDVCEQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNGKRCSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRTMEEMYNEMMLAADAGSRSENRVTRVDVCEQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNGKRCSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DG+LGE  F+SD+LEA IPPKFK PT+KPY+G+KDPKDYVEVFE LMDFQ A+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGKEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCS+DSAMCYFLTGLADE LTVKL +EAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGKEAPAT

Query:  FAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR
        FAEVLQK KKVIDGQEL+RTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R + RR+ S   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DK                                       ++S +KKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGESVIPEGCIDL
        R ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPL+D V+VRR+LVDGGASANILSL TYLA+GW RSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLAGRD
        PV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKY T NGVGT+RGE   SRECYAS  K SSVCALE    RD
Subjt:  PVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLAGRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]3.1e-19762.62Show/hide
Query:  EQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNGKRCSSLRKGQ---SPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E   Y+ +  DLR+HL  K+  +  + +   S SR   +SN +A+S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNGKRCSSLRKGQ---SPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        K+   +D +LGESPFTSD++EAPIPPKFK PT+KPY+G+KDPKDYVEVFEGLMDFQ A+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKI
        FS RHYD+KT THLATIRQKE                                   DE LTVKLG+EAPATFAEVLQ AKKVIDGQEL+RTKT RPE++I
Subjt:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKI

Query:  GRGR-SGKDERADPKSKDKGSFSSG-RAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKTSSAKKKEERKRSR
         + R S K  + D KSKDKGS SSG R E RR+ESGP+RSRPYER       I ++   I++S  +K + +P             +++S +KKEERKRSR
Subjt:  GRGR-SGKDERADPKSKDKGSFSSG-RAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKTSSAKKKEERKRSR

Query:  TPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYL
        TPPRR DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ PTC ITF   DLE VHLPHNDALVIAPL+DHV+VRRVLVDGGASANILSLPTYL
Subjt:  TPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYL

Query:  AMGWMRSQLKRSPTPLVGFSGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASR
        A+   RSQLK+SPTPLVGFS ESV PEGCIDLPVT+GQD T+VTQMAEFVVIDGR AYNAIF RPIIHSF+ +PS LHQVLKY TPNGVGT+RGEQ  SR
Subjt:  AMGWMRSQLKRSPTPLVGFSGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASR

Query:  ECYASTLKGSSVCALETLAGRDGALEFEADLPRK
        ECYAS LK SSVCALE    +D       DLPR+
Subjt:  ECYASTLKGSSVCALETLAGRDGALEFEADLPRK

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]2.0e-20493.22Show/hide
Query:  LADEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEI
        +ADEALTVKLG+EAPATFAEVLQKAKKVIDGQEL+RTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAE RRAE+GPTRSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGAPERRSKDKTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGA
        LTNIE+SGMEKLLKRPEKLRGAPERRSKDKTSSA+KKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR ARREVCIIREQGPTCPITFDGA
Subjt:  LTNIEESGMEKLLKRPEKLRGAPERRSKDKTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGA

Query:  DLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRS
        DLEEVHLPHNDALVIAPL+DHVVVRRVLVDGGASANILSLPTYLA+GW RSQLKRSPTPLVGFSGESVIPEGCIDLPVT+GQD+TRVTQM EFVV+DGRS
Subjt:  DLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRS

Query:  AYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQ
         YNAIFGRPIIHSFR IPSTLHQVLKY TPNGVGT+RGEQ  SRECYA+ LKGSSVCALETL  RDG LE EADLPRKEFAAPTEELELVPLLSPEKQ
Subjt:  AYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.0e-24285.23Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDG+LGESPFTSDVLEAPIPPKFKAPTVKPY+G+KDPKDYVEVFE LMDFQ ASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVAHCS+DSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEAL

Query:  TVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLG+EAPATFAEVLQKAKKVIDGQEL+RTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAE RRAE+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDK                                       TSSA+KKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPL+DHVVV RVLVDGG SANILSLPTYLA+GW RSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTVGQDRTRVTQMAEFV
        SVIPEG IDLPVT+GQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTVGQDRTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.1e-24572.52Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDG+LGESPFTSDVLE        APTVK Y+G+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQ

Query:  TASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLA
         ASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  S+DSAMCYFLTGLA
Subjt:  TASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLA

Query:  DEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILT
        DEALTVKLGKEAPATFAEVLQKAKKVIDGQEL+RTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE RRA +GPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILT

Query:  NIEESGMEKLLKRPEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIEESGMEKLLKRPEKLRGAPERR+KDK                                       TSSA+KKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEESGMEKLLKRPEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGF
        SGGQSGHKRKELARAARREVCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPL+DHVVVRRVLVD G SANI+SL TYLA+GW RSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLA
        S ESVIPEGCIDLPVT+G D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR IPSTLHQVLKY TPNGVG +RGEQIASRECYAS LKGSSVCALETL 
Subjt:  SGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLA

Query:  GRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQVTSAYETDLARSVPVEILDN
         RDG LEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+
Subjt:  GRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQVTSAYETDLARSVPVEILDN

A0A6J1DHB3 uncharacterized protein LOC1110204791.5e-23760.38Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSAQITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSA+IT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSAQITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRTMEEMYNEMMLAADAGSRSENRVTRVDVCEQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNGKRCSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRTMEEMYNEMMLAADAGSRSENRVTRVDVCEQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNGKRCSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DG+LGE  F+SD+LEA IPPKFK PT+KPY+G+KDPKDYVEVFE LMDFQ A+DAIKC AFQIAL
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGKEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCS+DSAMCYFLTGLADE LTVKL +EAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGKEAPAT

Query:  FAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR
        FAEVLQK KKVIDGQEL+RTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R + RR+ S   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKR

Query:  PEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA
        PEKLRG PE+R+ DK                                       ++S +KKEERKR RTPPRR DRPAVIN             K+KELA
Subjt:  PEKLRGAPERRSKDK---------------------------------------TSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA

Query:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGESVIPEGCIDL
        R ARREVCIIREQ PT  I F+ ADLE VHLPHNDALVIAPL+D V+VRR+LVDGGASANILSL TYLA+GW RSQLK+SPTPLVGFSGES+  EGCIDL
Subjt:  RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGESVIPEGCIDL

Query:  PVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLAGRD
        PV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKY T NGVGT+RGE   SRECYAS  K SSVCALE    RD
Subjt:  PVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLAGRD

A0A6J1DPC9 uncharacterized protein LOC1110222801.5e-19762.62Show/hide
Query:  EQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNGKRCSSLRKGQ---SPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ
        E+    + P   E   ++E   Y+ +  DLR+HL  K+  +  + +   S SR   +SN +A+S +    P  +I REEFD ++   D QVEALKA+CE+
Subjt:  EQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNGKRCSSLRKGQ---SPSRSHRSSNQQAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQ

Query:  KDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        K+   +D +LGESPFTSD++EAPIPPKFK PT+KPY+G+KDPKDYVEVFEGLMDFQ A+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KDDSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKI
        FS RHYD+KT THLATIRQKE                                   DE LTVKLG+EAPATFAEVLQ AKKVIDGQEL+RTKT RPE++I
Subjt:  FSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSNDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKI

Query:  GRGR-SGKDERADPKSKDKGSFSSG-RAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKTSSAKKKEERKRSR
         + R S K  + D KSKDKGS SSG R E RR+ESGP+RSRPYER       I ++   I++S  +K + +P             +++S +KKEERKRSR
Subjt:  GRGR-SGKDERADPKSKDKGSFSSG-RAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKTSSAKKKEERKRSR

Query:  TPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYL
        TPPRR DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ PTC ITF   DLE VHLPHNDALVIAPL+DHV+VRRVLVDGGASANILSLPTYL
Subjt:  TPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYL

Query:  AMGWMRSQLKRSPTPLVGFSGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASR
        A+   RSQLK+SPTPLVGFS ESV PEGCIDLPVT+GQD T+VTQMAEFVVIDGR AYNAIF RPIIHSF+ +PS LHQVLKY TPNGVGT+RGEQ  SR
Subjt:  AMGWMRSQLKRSPTPLVGFSGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASR

Query:  ECYASTLKGSSVCALETLAGRDGALEFEADLPRK
        ECYAS LK SSVCALE    +D       DLPR+
Subjt:  ECYASTLKGSSVCALETLAGRDGALEFEADLPRK

A0A6J1DYW5 uncharacterized protein LOC1110243329.6e-20593.22Show/hide
Query:  LADEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEI
        +ADEALTVKLG+EAPATFAEVLQKAKKVIDGQEL+RTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAE RRAE+GPTRSRPYERFTPTTIPISEI
Subjt:  LADEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERFTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGAPERRSKDKTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGA
        LTNIE+SGMEKLLKRPEKLRGAPERRSKDKTSSA+KKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELAR ARREVCIIREQGPTCPITFDGA
Subjt:  LTNIEESGMEKLLKRPEKLRGAPERRSKDKTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGA

Query:  DLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRS
        DLEEVHLPHNDALVIAPL+DHVVVRRVLVDGGASANILSLPTYLA+GW RSQLKRSPTPLVGFSGESVIPEGCIDLPVT+GQD+TRVTQM EFVV+DGRS
Subjt:  DLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRS

Query:  AYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQ
         YNAIFGRPIIHSFR IPSTLHQVLKY TPNGVGT+RGEQ  SRECYA+ LKGSSVCALETL  RDG LE EADLPRKEFAAPTEELELVPLLSPEKQ
Subjt:  AYNAIFGRPIIHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGACCTACCGGGGAGCTCGGTAGGGGCCAATGTGAGCAACTGTCAGTCCGCGCAAGTGTTCAGATCGGCCC
GGAAGCCGAGTTCGAGTTGCAATCTGAAATACGTTGTTGTGCATATTCTTGCATAAACATTTGGCGCCGTCTGTGGGAACGACAATCTAAGTCATCCCAATTCTTTCAAA
CCAACACGCGAGCGACCATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCAGTA
GAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACAGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGG
CCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCA
CCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGACGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTATGCGAGCAAAGGGGTTCCCACCTCGGC
CCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACGGAAAGAGATGCTCGTCTCTCCGAAAAGG
GCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCCGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCG
ATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCAACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATC
CCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTACAATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAACGGCATCAGACGC
AATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCC
TCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTTCAGGAG
GAGCAGTTGAAGGTTGCACACTGCTCCAATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAAAGGAGGCCCCGGCCAC
CTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCATTCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAG
ATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAACGCTTC
ACCCCGACCACGATTCCAATTTCTGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAG
CAAGGACAAGACCAGCTCAGCAAAAAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAA
GCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCTAGACGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCA
GACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGGTTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACAT
CCTGTCCTTACCGACCTACCTCGCCATGGGATGGATGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCA
TCGACTTACCGGTCACGGTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATC
ATCCACTCATTTCGGGTCATTCCCTCAACACTGCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGATCCGAGGAGAACAGATCGCTTCGAGGGAGTGTTA
TGCCTCCACACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTTGAGGCCGACCTGCCAAGGAAGGAGTTTGCCGCACCCA
CTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAACATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGACAATCCCTCG
ATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCTTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAAGAGCGCAGAAA
GTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGACATTGTACCGACGTGGCTTTTCTCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAGCAGAGC
CAAGGCTTATAGACCTTGCAGCTCTGCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGACCTACCGGGGAGCTCGGTAGGGGCCAATGTGAGCAACTGTCAGTCCGCGCAAGTGTTCAGATCGGCCC
GGAAGCCGAGTTCGAGTTGCAATCTGAAATACGTTGTTGTGCATATTCTTGCATAAACATTTGGCGCCGTCTGTGGGAACGACAATCTAAGTCATCCCAATTCTTTCAAA
CCAACACGCGAGCGACCATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCAGTA
GAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACAGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGG
CCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCA
CCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGACGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTATGCGAGCAAAGGGGTTCCCACCTCGGC
CCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACGGAAAGAGATGCTCGTCTCTCCGAAAAGG
GCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCCGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCG
ATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCAACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATC
CCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTACAATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAACGGCATCAGACGC
AATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCC
TCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTTCAGGAG
GAGCAGTTGAAGGTTGCACACTGCTCCAATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAAAGGAGGCCCCGGCCAC
CTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCATTCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAG
ATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAACGCTTC
ACCCCGACCACGATTCCAATTTCTGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAG
CAAGGACAAGACCAGCTCAGCAAAAAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAA
GCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCTAGACGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCA
GACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGGTTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACAT
CCTGTCCTTACCGACCTACCTCGCCATGGGATGGATGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCA
TCGACTTACCGGTCACGGTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATC
ATCCACTCATTTCGGGTCATTCCCTCAACACTGCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGATCCGAGGAGAACAGATCGCTTCGAGGGAGTGTTA
TGCCTCCACACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTTGAGGCCGACCTGCCAAGGAAGGAGTTTGCCGCACCCA
CTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAACATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGACAATCCCTCG
ATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCTTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAAGAGCGCAGAAA
GTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGACATTGTACCGACGTGGCTTTTCTCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAGCAGAGC
CAAGGCTTATAGACCTTGCAGCTCTGCCCTGA
Protein sequenceShow/hide protein sequence
MLSMRAEVNLAEVRPTGELGRGQCEQLSVRASVQIGPEAEFELQSEIRCCAYSCINIWRRLWERQSKSSQFFQTNTRATMVQPANSTNTTDRRTLAASDAHQREVGAAAV
EGQGHDGLATEPLRRSAQITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRTMEEMYNEMMLAADAGSRSENRVTRVDVCEQRGSHLG
PAEEERPEDNESEGYTRQRGDLREHLNGKRCSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGNLGESPFTSDVLEAPI
PPKFKAPTVKPYNGTKDPKDYVEVFEGLMDFQTASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQE
EQLKVAHCSNDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELIRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAECRRAESGPTRSRPYERF
TPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKTSSAKKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGA
DLEEVHLPHNDALVIAPLVDHVVVRRVLVDGGASANILSLPTYLAMGWMRSQLKRSPTPLVGFSGESVIPEGCIDLPVTVGQDRTRVTQMAEFVVIDGRSAYNAIFGRPI
IHSFRVIPSTLHQVLKYPTPNGVGTIRGEQIASRECYASTLKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQVTSAYETDLARSVPVEILDNPS
ISEPDLMEIGAPESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRDGTLYRRGFSLPLLRCLTPEEGLAEPRLIDLAALP