; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g06890 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g06890
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:4829089..4834674
RNA-Seq ExpressionMoc04g06890
SyntenyMoc04g06890
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.2e-23382.77Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASN
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAK                        +PIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAAS+
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASN

Query:  AIKFRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIK RAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKFRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEASATFVE----------------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEA ATF E                             RSGKDIE ADPKSKDKGSFSS RAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEASATFVE----------------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  KSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGG
        +SGMEK LK PEKLRGAP+R  KDKYCRFHRE GHNTSD  ELK QIE+LIQDGYFKKFVGKP+TSS EKKEERKRSR PPRRTDRPAVINTIFGGPSGG
Subjt:  KSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV  VL+DGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]5.0e-23773.38Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAK---------------SPIPPK-FKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASNAIKF
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAK               SP      +APTVK YDGSKDPKDYVEVFEGLMDFQAAS+AIK 
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAK---------------SPIPPK-FKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASNAIKF

Query:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
        RAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLADEALTVKL
Subjt:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL

Query:  GEEASATFVE----------------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGM
        G+EA ATF E                             RSGKD EKAD KSKDKGSFSS RAE+RRA NGPTRSRPYERFTPTTIPISEILTNIE+SGM
Subjt:  GEEASATFVE----------------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGM

Query:  EKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGGQSGH
        EK LK PEKLRGAP+R  KDKYCRFHRE  HNTSD  ELK QIEDLIQD YFKKFVGKP+TSS EKKEERK SR P RR DRPAVINTIFGGPSGGQSGH
Subjt:  EKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGGQSGH

Query:  KRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIP
        KRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVV+ VL+D G SANI+SL TYLALGWTRSQL KS TPLVGFS ESVIP
Subjt:  KRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIP

Query:  EGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALETLAGRDGTLE
        EGCIDLPVTLG DQTQVTQMAEFVVIDGR AYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGE+ ASRECYASALKGSSVCALETL  RDGTLE
Subjt:  EGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALETLAGRDGTLE

Query:  FEADLPRREFAAPTEELKLVPLLTSAYETDL
        F+A+LPRREFAAPTEEL+LVPLL   Y  ++
Subjt:  FEADLPRREFAAPTEELKLVPLLTSAYETDL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.2e-20383.9Show/hide
Query:  MCYFLTGLADEALTVKLGEEASATFVE---------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EEA ATF E                      RSGKD+E  DPKSKDKGSFS+ RAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEASATFVE---------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEKSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIF
        ILTNIE+SGMEK LK PEKLRGAP+R  KDKYCRFHRE GHNTSD  ELK QIEDLIQDGYFKKFVGKP+TSS EKKEERKRSR PPRRTDRPAVINTIF
Subjt:  ILTNIEKSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIF

Query:  GGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPL
        GGPSGGQSGHKRK+LARAARREVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDHVVV+ VL+DGGASANILSLPTYLALGWTRSQL KSPTPL
Subjt:  GGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGR AYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVGTVRGE+TASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALE

Query:  TLAGRDGTLEFEADLPRREFAAPTEELKLVPLLTSAYETDL
        TL  RDGTLEFEADLP REFAAP EEL+LVPLL+   +  L
Subjt:  TLAGRDGTLEFEADLPRREFAAPTEELKLVPLLTSAYETDL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.4e-23460.3Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKNGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKNGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMCNEMILAAGAGSRSENRMTRIDIREQRGFHLGLVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMCNEMILAAGAGSRSENRMTRIDIREQRGFHLGLVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASNAIKFRAFQIA
        P GVITR EFDQL+ K DAQVEALKA+                        + IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA++AIK  AFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASNAIKFRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEASA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEA A
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEASA

Query:  TFVE----------------------------ARSGKDIEKADPKSKDKG-SFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKQLK
        TF E                             R+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEK LK
Subjt:  TFVE----------------------------ARSGKDIEKADPKSKDKG-SFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKQLK

Query:  HPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
         PEKLRG P++   DKYCRFHR+ GHNTS+  ELK QIEDLIQDGYFKKFVGKP+++S EKKEERKR R PPRR DRPAVIN             K+KEL
Subjt:  HPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCID
        AR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+V+ +L+DGGASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCID
Subjt:  ARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALETLAGRD
        LPV++ QD TQVTQMAEFVVIDGR AYNAIFGRPIIHSFRA+PSTLHQ+LKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.0e-19369.63Show/hide
Query:  MDFQAASNAIKFRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA++AIK RAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASNAIKFRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEASATFVE-------------------ARSGKDIE---------KADPKSKDKGSFSS-DRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEA  TFVE                    R  K I+         KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEASATFVE-------------------ARSGKDIE---------KADPKSKDKGSFSS-DRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEKSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVIN
        ISEILTNIE+SGMEK LK PEKLRG  ++  K+KYCRFHR+ GHNT+ C ELK QIEDLIQDGYFKKFVGKP+++S EKKEERKRSR PPRR DRPAVIN
Subjt:  ISEILTNIEKSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +V+ VLIDG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGR AYNAIFGRPIIHSFRA+PSTLHQ+LKYSTPN VG VRGE+  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADLP---RREFAAPTEELKLVPLLT
        ALE    R    E EADLP   +R+F  PTEEL+LVPLL+
Subjt:  ALETLAGRDGTLEFEADLP---RREFAAPTEELKLVPLLT

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.6e-23482.77Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASN
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAK                        +PIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAAS+
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASN

Query:  AIKFRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIK RAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKFRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEASATFVE----------------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEA ATF E                             RSGKDIE ADPKSKDKGSFSS RAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEASATFVE----------------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  KSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGG
        +SGMEK LK PEKLRGAP+R  KDKYCRFHRE GHNTSD  ELK QIE+LIQDGYFKKFVGKP+TSS EKKEERKRSR PPRRTDRPAVINTIFGGPSGG
Subjt:  KSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTCPITFD  DLEEVHLPHNDALVIAPLIDHVVV  VL+DGG SANILSLPTYLALGWTRSQL KSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.4e-23773.38Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAK---------------SPIPPK-FKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASNAIKF
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAK               SP      +APTVK YDGSKDPKDYVEVFEGLMDFQAAS+AIK 
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAK---------------SPIPPK-FKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASNAIKF

Query:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
        RAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLADEALTVKL
Subjt:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL

Query:  GEEASATFVE----------------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGM
        G+EA ATF E                             RSGKD EKAD KSKDKGSFSS RAE+RRA NGPTRSRPYERFTPTTIPISEILTNIE+SGM
Subjt:  GEEASATFVE----------------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGM

Query:  EKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGGQSGH
        EK LK PEKLRGAP+R  KDKYCRFHRE  HNTSD  ELK QIEDLIQD YFKKFVGKP+TSS EKKEERK SR P RR DRPAVINTIFGGPSGGQSGH
Subjt:  EKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGGQSGH

Query:  KRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIP
        KRKELARAARREVCIIREQRPTCPITFDS DLEEVHLPHNDALVIAPLIDHVVV+ VL+D G SANI+SL TYLALGWTRSQL KS TPLVGFS ESVIP
Subjt:  KRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIP

Query:  EGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALETLAGRDGTLE
        EGCIDLPVTLG DQTQVTQMAEFVVIDGR AYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVG VRGE+ ASRECYASALKGSSVCALETL  RDGTLE
Subjt:  EGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALETLAGRDGTLE

Query:  FEADLPRREFAAPTEELKLVPLLTSAYETDL
        F+A+LPRREFAAPTEEL+LVPLL   Y  ++
Subjt:  FEADLPRREFAAPTEELKLVPLLTSAYETDL

A0A6J1DD03 uncharacterized protein LOC1110198996.0e-20483.9Show/hide
Query:  MCYFLTGLADEALTVKLGEEASATFVE---------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALTVKL EEA ATF E                      RSGKD+E  DPKSKDKGSFS+ RAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTVKLGEEASATFVE---------------------ARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEKSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIF
        ILTNIE+SGMEK LK PEKLRGAP+R  KDKYCRFHRE GHNTSD  ELK QIEDLIQDGYFKKFVGKP+TSS EKKEERKRSR PPRRTDRPAVINTIF
Subjt:  ILTNIEKSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIF

Query:  GGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPL
        GGPSGGQSGHKRK+LARAARREVCIIREQRPTCPITFD  DL EVHLPHNDALVIAPLIDHVVV+ VL+DGGASANILSLPTYLALGWTRSQL KSPTPL
Subjt:  GGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGR AYNAIFGRPIIHSFRAIPSTLHQ+LKYSTPNGVGTVRGE+TASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALE

Query:  TLAGRDGTLEFEADLPRREFAAPTEELKLVPLLTSAYETDL
        TL  RDGTLEFEADLP REFAAP EEL+LVPLL+   +  L
Subjt:  TLAGRDGTLEFEADLPRREFAAPTEELKLVPLLTSAYETDL

A0A6J1DHB3 uncharacterized protein LOC1110204796.6e-23560.3Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKNGARGPAPAPPSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKNGARGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMCNEMILAAGAGSRSENRMTRIDIREQRGFHLGLVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
                                                                                                   AESS+NP T
Subjt:  TQMRSMEEMCNEMILAAGAGSRSENRMTRIDIREQRGFHLGLVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASNAIKFRAFQIA
        P GVITR EFDQL+ K DAQVEALKA+                        + IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA++AIK  AFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAK------------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASNAIKFRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEASA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEA A
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEASA

Query:  TFVE----------------------------ARSGKDIEKADPKSKDKG-SFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKQLK
        TF E                             R+GKD  KAD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE++GMEK LK
Subjt:  TFVE----------------------------ARSGKDIEKADPKSKDKG-SFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKSGMEKQLK

Query:  HPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
         PEKLRG P++   DKYCRFHR+ GHNTS+  ELK QIEDLIQDGYFKKFVGKP+++S EKKEERKR R PPRR DRPAVIN             K+KEL
Subjt:  HPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCID
        AR ARREVCIIREQRPT  I F+  DLE VHLPHNDALVIAPLID V+V+ +L+DGGASANILSL TYLALGWTRSQL KSPTPLVGFSGES+  EGCID
Subjt:  ARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCID

Query:  LPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALETLAGRD
        LPV++ QD TQVTQMAEFVVIDGR AYNAIFGRPIIHSFRA+PSTLHQ+LKYST NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  LPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249049.6e-19469.63Show/hide
Query:  MDFQAASNAIKFRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA++AIK RAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASNAIKFRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEASATFVE-------------------ARSGKDIE---------KADPKSKDKGSFSS-DRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEA  TFVE                    R  K I+         KAD KS+DKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEASATFVE-------------------ARSGKDIE---------KADPKSKDKGSFSS-DRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEKSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVIN
        ISEILTNIE+SGMEK LK PEKLRG  ++  K+KYCRFHR+ GHNT+ C ELK QIEDLIQDGYFKKFVGKP+++S EKKEERKRSR PPRR DRPAVIN
Subjt:  ISEILTNIEKSGMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LIDH +V+ VLIDG                          
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVC
                      GCIDLPVT+GQD TQVTQMAEFVVIDGR AYNAIFGRPIIHSFRA+PSTLHQ+LKYSTPN VG VRGE+  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADLP---RREFAAPTEELKLVPLLT
        ALE    R    E EADLP   +R+F  PTEEL+LVPLL+
Subjt:  ALETLAGRDGTLEFEADLP---RREFAAPTEELKLVPLLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAACGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTGT
AACGAGATGATACTAGCTGCAGGTGCAGGGTCTCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTTCCACCTCGGCCTAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGCGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCGGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAAGTCTTTGAAGG
CCTCATGGATTTCCAAGCGGCATCAAACGCAATCAAATTCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCT
CGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACG
CTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCAC
GGTGAAACTTGGAGAGGAGGCCTCGGCCACCTTCGTCGAGGCCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAAGACAAGGGATCCTTTTCCAGCGACC
GAGCTGAGTATCGAAGGGCAGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACAATTCCAATTTCCGAGATCCTAACGAACATCGAGAAGTCT
GGAATGGAAAAACAACTCAAGCATCCTGAGAAGCTTCGGGGAGCTCCGAAAAGGCACGTCAAGGACAAGTATTGCCGCTTCCATCGGGAGGACGGCCATAACACGTCGGA
CTGCTCGGAGTTGAAGTGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAAGACCAGCTCGACAGAGAAAAAGGAAGAGCGGAAGC
GTTCGAGGATGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGCTAGCTCGTGCA
GCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCCATCACCTTCGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGAT
CGCTCCCTTGATTGATCATGTGGTGGTCAAGTGGGTGCTGATAGACGGAGGCGCATCTGCTAACATCCTATCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCGC
AATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACC
CAAATGGCCGAGTTCGTGGTAATTGACGGTAGATTGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAATTTT
GAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACGGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCTGTCTGCGCCCTCGAAACTC
TCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTACCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCAAGCTGGTTCCTCTGCTTACATCGGCGTACGAG
ACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCAGCGCTCCAGAGTCCTCATGGTTGGACCCGATTGC
GGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCACGGTTCGTGGTCCGAGGTGGAGCATTGTATCGACGCGGCTTTT
CCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAATGTCCAGACATTACCCCGCCCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGG
GTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGTTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCT
CGCGTACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAACGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTGT
AACGAGATGATACTAGCTGCAGGTGCAGGGTCTCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTTCCACCTCGGCCTAGTCGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGCGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCGGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAAGTCTTTGAAGG
CCTCATGGATTTCCAAGCGGCATCAAACGCAATCAAATTCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCT
CGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACG
CTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCAC
GGTGAAACTTGGAGAGGAGGCCTCGGCCACCTTCGTCGAGGCCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAAGACAAGGGATCCTTTTCCAGCGACC
GAGCTGAGTATCGAAGGGCAGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACAATTCCAATTTCCGAGATCCTAACGAACATCGAGAAGTCT
GGAATGGAAAAACAACTCAAGCATCCTGAGAAGCTTCGGGGAGCTCCGAAAAGGCACGTCAAGGACAAGTATTGCCGCTTCCATCGGGAGGACGGCCATAACACGTCGGA
CTGCTCGGAGTTGAAGTGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAAGACCAGCTCGACAGAGAAAAAGGAAGAGCGGAAGC
GTTCGAGGATGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGCTAGCTCGTGCA
GCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCCATCACCTTCGACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGAT
CGCTCCCTTGATTGATCATGTGGTGGTCAAGTGGGTGCTGATAGACGGAGGCGCATCTGCTAACATCCTATCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCGC
AATTGACGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACC
CAAATGGCCGAGTTCGTGGTAATTGACGGTAGATTGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAATTTT
GAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACGGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCTGTCTGCGCCCTCGAAACTC
TCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTACCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCAAGCTGGTTCCTCTGCTTACATCGGCGTACGAG
ACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCAGCGCTCCAGAGTCCTCATGGTTGGACCCGATTGC
GGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCACGGTTCGTGGTCCGAGGTGGAGCATTGTATCGACGCGGCTTTT
CCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAATGTCCAGACATTACCCCGCCCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGG
GTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGTTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCT
CGCGTACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKNGARGPAPAPPSENFDALQREMEAMRTQMRSMEEMC
NEMILAAGAGSRSENRMTRIDIREQRGFHLGLVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQ
VEALKAKSPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASNAIKFRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGET
LREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEASATFVEARSGKDIEKADPKSKDKGSFSSDRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEKS
GMEKQLKHPEKLRGAPKRHVKDKYCRFHREDGHNTSDCSELKCQIEDLIQDGYFKKFVGKPKTSSTEKKEERKRSRMPPRRTDRPAVINTIFGGPSGGQSGHKRKELARA
ARREVCIIREQRPTCPITFDSTDLEEVHLPHNDALVIAPLIDHVVVKWVLIDGGASANILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVT
QMAEFVVIDGRLAYNAIFGRPIIHSFRAIPSTLHQILKYSTPNGVGTVRGERTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRREFAAPTEELKLVPLLTSAYE
TDLARSVPVEILDNPSISEPDLMEISAPESSWLDPIADFIRGNSPQDPKERRKLARRAARFVVRGGALYRRGFSLPLLRCLTPEEGLMSRHYPARVRPRTFQVGHLVLRR
VQTHVGALDPTWEGPFEFKGIVRPGTYVLADLKGDVLAYPWNAEHLKRYYP