; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g18340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g18340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:13878245..13882780
RNA-Seq ExpressionMoc08g18340
SyntenyMoc08g18340
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.8e-26389.77Show/hide
Query:  QAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNP TPAG+ITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGR GKD+E  DPKSKDKGSFSSGRAEYRRAEN PTRSRPYERFTPT IPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEILTNIE

Query:  ESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGME                        +  HGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTC ITFDGADLEEVHLP NDALVIAPLIDHVVV RVLVDGG SANI+SLPTYLALGWTR+QLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGE

Query:  LVVPEGCIDLPVTLGQDQTRVTQMAEFV
         V+PEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  LVVPEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]6.5e-26078.29Show/hide
Query:  SSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NP TP G+ITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGR GKD EK D KSKDKGSFSSGRAE+RRA N PTRSRPYERFTPT IPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEIL

Query:  TNIEESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGME                        +  H HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTC ITFD ADLEEVHLP NDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTR+QLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVG

Query:  FSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETP
        FS E V+PEGCIDLPVTLG DQT+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALET 
Subjt:  FSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETP

Query:  TSRDGTVGFEADLPRREFAAPTEELELVPLL
         SRDGT+ F+A+LPRREFAAPTEELELVPLL
Subjt:  TSRDGTVGFEADLPRREFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]8.9e-20986.38Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GR GKDME TDPKSKDKGSFS+GRAEYRRAEN PTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTP

Query:  TAIPISEILTNIEESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        T IPISEILTNIEESGME                        +  HGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TAIPISEILTNIEESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTC ITFD ADL EVHLP NDALVIAPLIDHVVVRRVLVDGGASANI+SLPTYLALGWTR+QL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQL

Query:  KKSPTPLVGFSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGE VVPEGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETPTSRDGTVGFEADLPRREFAAPTEELELVPLLSPEKQGQL
        +SVCALET TSRDGT+ FEADLP REFAAP EELELVPLLS EKQ QL
Subjt:  SSVCALETPTSRDGTVGFEADLPRREFAAPTEELELVPLLSPEKQGQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.1e-26264.48Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSRKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L T+PL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSRKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENQVTSVDVREQRGSHLGPVEEERPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVT
                                                                                                   AESS NP+T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENQVTSVDVREQRGSHLGPVEEERPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVT

Query:  PAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P G+ITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEALIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKG-SFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEILTNIEESGMEN---
        TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR GKD  K D KS+DKG S SS R +YRR+ +   +SRPYE +TPT IPI EILTNIEE+GME    
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKG-SFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEILTNIEESGMEN---

Query:  --------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
                            + +HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KEL
Subjt:  --------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGELVVPEGCID
        AR ARREVCIIREQRPT SI F+ ADLE VHLP NDALVIAPLID V+VRR+LVDGGASANI+SL TYLALGWTR+QLKKSPTPLVGFSGE +  EGCID
Subjt:  ARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGELVVPEGCID

Query:  LPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETPTSRD
        LPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE  T RD
Subjt:  LPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETPTSRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]4.7e-21866.26Show/hide
Query:  EQRGSHLGPVEEERPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DLR+HL  K+  +  + +   S SR   +SN +A+S   P+ P  +I REEFD ++ + D QVEALKA+CE+
Subjt:  EQRGSHLGPVEEERPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KE P +D DLGESPFTSD++EA IPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRRGKDMEKTDPKSKDKGSFSSG-RAEYRRAENEPTRSRPYERFTPTAIPISEILTNIEESGMENYSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGK
         + R  +   K D KSKDKGS SSG R EYRR+E+ P+RSRPYER                                CWELKRQIEDLIQD YFKKFVGK
Subjt:  GRGRRGKDMEKTDPKSKDKGSFSSG-RAEYRRAENEPTRSRPYERFTPTAIPISEILTNIEESGMENYSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGK

Query:  PRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVL
        PR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ+PTCSITF   DLE VHLP NDALVIAPLIDHV+VRRVL
Subjt:  PRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVL

Query:  VDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYS
        VDGGASANI+SLPTYLAL  TR+QLKKSPTPLVGFS E V PEGCIDLPVT+GQD T+VTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYS
Subjt:  VDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYS

Query:  TPNGVGTVRGEQTASRECYASALKGSSVCALETPTSRDGTVGFEADLPR
        TPNGVGTVRGEQ  SRECYASALK SSVCALE  TS+D       DLPR
Subjt:  TPNGVGTVRGEQTASRECYASALKGSSVCALETPTSRDGTVGFEADLPR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.3e-26389.77Show/hide
Query:  QAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNP TPAG+ITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGR GKD+E  DPKSKDKGSFSSGRAEYRRAEN PTRSRPYERFTPT IPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEILTNIE

Query:  ESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGME                        +  HGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTC ITFDGADLEEVHLP NDALVIAPLIDHVVV RVLVDGG SANI+SLPTYLALGWTR+QLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGE

Query:  LVVPEGCIDLPVTLGQDQTRVTQMAEFV
         V+PEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  LVVPEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.1e-26078.29Show/hide
Query:  SSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NP TP G+ITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGR GKD EK D KSKDKGSFSSGRAE+RRA N PTRSRPYERFTPT IPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEIL

Query:  TNIEESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGME                        +  H HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVG
        PSGGQSGHKRKELARAARREVCIIREQRPTC ITFD ADLEEVHLP NDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTR+QLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVG

Query:  FSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETP
        FS E V+PEGCIDLPVTLG DQT+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALET 
Subjt:  FSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETP

Query:  TSRDGTVGFEADLPRREFAAPTEELELVPLL
         SRDGT+ F+A+LPRREFAAPTEELELVPLL
Subjt:  TSRDGTVGFEADLPRREFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198994.3e-20986.38Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GR GKDME TDPKSKDKGSFS+GRAEYRRAEN PTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTP

Query:  TAIPISEILTNIEESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
        T IPISEILTNIEESGME                        +  HGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP
Subjt:  TAIPISEILTNIEESGMEN-----------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTC ITFD ADL EVHLP NDALVIAPLIDHVVVRRVLVDGGASANI+SLPTYLALGWTR+QL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQL

Query:  KKSPTPLVGFSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGE VVPEGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETPTSRDGTVGFEADLPRREFAAPTEELELVPLLSPEKQGQL
        +SVCALET TSRDGT+ FEADLP REFAAP EELELVPLLS EKQ QL
Subjt:  SSVCALETPTSRDGTVGFEADLPRREFAAPTEELELVPLLSPEKQGQL

A0A6J1DHB3 uncharacterized protein LOC1110204792.0e-26264.48Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSRKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L T+PL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSRKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENQVTSVDVREQRGSHLGPVEEERPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVT
                                                                                                   AESS NP+T
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENQVTSVDVREQRGSHLGPVEEERPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVT

Query:  PAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P G+ITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEALIPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Subjt:  PAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKG-SFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEILTNIEESGMEN---
        TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR GKD  K D KS+DKG S SS R +YRR+ +   +SRPYE +TPT IPI EILTNIEE+GME    
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDMEKTDPKSKDKG-SFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEILTNIEESGMEN---

Query:  --------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
                            + +HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN             K+KEL
Subjt:  --------------------YSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGELVVPEGCID
        AR ARREVCIIREQRPT SI F+ ADLE VHLP NDALVIAPLID V+VRR+LVDGGASANI+SL TYLALGWTR+QLKKSPTPLVGFSGE +  EGCID
Subjt:  ARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGELVVPEGCID

Query:  LPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETPTSRD
        LPV++ QD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE  T RD
Subjt:  LPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETPTSRD

A0A6J1DPC9 uncharacterized protein LOC1110222802.3e-21866.26Show/hide
Query:  EQRGSHLGPVEEERPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQ
        E+    + P   E   ++E   ++ +  DLR+HL  K+  +  + +   S SR   +SN +A+S   P+ P  +I REEFD ++ + D QVEALKA+CE+
Subjt:  EQRGSHLGPVEEERPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQ

Query:  KEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
        KE P +D DLGESPFTSD++EA IPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLPARSISTYSQLR+EF+ Q
Subjt:  KEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS RHYD+KTATHLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRRGKDMEKTDPKSKDKGSFSSG-RAEYRRAENEPTRSRPYERFTPTAIPISEILTNIEESGMENYSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGK
         + R  +   K D KSKDKGS SSG R EYRR+E+ P+RSRPYER                                CWELKRQIEDLIQD YFKKFVGK
Subjt:  GRGRRGKDMEKTDPKSKDKGSFSSG-RAEYRRAENEPTRSRPYERFTPTAIPISEILTNIEESGMENYSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGK

Query:  PRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVL
        PR++S EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQ  +KRKELA  ARR+V IIREQ+PTCSITF   DLE VHLP NDALVIAPLIDHV+VRRVL
Subjt:  PRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVL

Query:  VDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYS
        VDGGASANI+SLPTYLAL  TR+QLKKSPTPLVGFS E V PEGCIDLPVT+GQD T+VTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYS
Subjt:  VDGGASANIMSLPTYLALGWTRTQLKKSPTPLVGFSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYS

Query:  TPNGVGTVRGEQTASRECYASALKGSSVCALETPTSRDGTVGFEADLPR
        TPNGVGTVRGEQ  SRECYASALK SSVCALE  TS+D       DLPR
Subjt:  TPNGVGTVRGEQTASRECYASALKGSSVCALETPTSRDGTVGFEADLPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAAAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCGCCTGCGCACCCGAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
GGAAGGGCGCCCGGGGTCCAGCCCCAGCTCCAACGAGTGAGAACTTTGATGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCAAGTGACGAGCGTTGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAAAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAGGAATGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGTGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACTAATCCCTCCGAA
GTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAAGAGCTCCTTCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGGAGGGGAAAAGATATGGAA
AAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGGAGGGCGGAGAACGAACCTACCAGGAGCCGACCTTACGAACGCTTCACCCC
GACCGCGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAACTACTCAAACCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAA
TTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCTAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGACGC
ACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGTATCAT
CAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTTCACCTGCCCCGCAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGG
TGGTCAGGAGGGTGCTGGTAGACGGAGGTGCATCTGCTAACATCATGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGACGCAATTGAAGAAAAGCCCGACACCG
CTGGTTGGGTTCTCCGGAGAATTAGTCGTCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCGGGTCACCCAAATGGCCGAGTTCGTGGTGAT
TGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCG
TGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCCCACCAGTAGGGATGGGACGGTC
GGGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGGGCAACTCACCACAAGACCCCA
AGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCGTTGTACCGACGCGGCTTTTCCCTGCCTTTATTGAGATGCCTAACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAAAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCGCCTGCGCACCCGAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
GGAAGGGCGCCCGGGGTCCAGCCCCAGCTCCAACGAGTGAGAACTTTGATGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCAAGTGACGAGCGTTGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAAAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAGGAATGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGTGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACTAATCCCTCCGAA
GTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAAGAGCTCCTTCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGGAGGGGAAAAGATATGGAA
AAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGGAGGGCGGAGAACGAACCTACCAGGAGCCGACCTTACGAACGCTTCACCCC
GACCGCGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAACTACTCAAACCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAA
TTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCTAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGACGC
ACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGTATCAT
CAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTTCACCTGCCCCGCAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGG
TGGTCAGGAGGGTGCTGGTAGACGGAGGTGCATCTGCTAACATCATGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGACGCAATTGAAGAAAAGCCCGACACCG
CTGGTTGGGTTCTCCGGAGAATTAGTCGTCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCGGGTCACCCAAATGGCCGAGTTCGTGGTGAT
TGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCG
TGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCCCACCAGTAGGGATGGGACGGTC
GGGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGGGCAACTCACCACAAGACCCCA
AGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCGTTGTACCGACGCGGCTTTTCCCTGCCTTTATTGAGATGCCTAACCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATKPLRRSARITAPVLPPAHPRTSKATRGRGGTSRKGARGPAPAPTSENFDALQREMEAMRTQMRSMEEMY
NEMILAAGAGSRSENQVTSVDVREQRGSHLGPVEEERPEDNESEGHTRQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQ
VEALKAKCEQKEGPLNDGDLGESPFTSDVLEALIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQ
FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRRGKDME
KTDPKSKDKGSFSSGRAEYRRAENEPTRSRPYERFTPTAIPISEILTNIEESGMENYSNHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRR
TDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPRNDALVIAPLIDHVVVRRVLVDGGASANIMSLPTYLALGWTRTQLKKSPTP
LVGFSGELVVPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETPTSRDGTV
GFEADLPRREFAAPTEELELVPLLSPEKQGQLTTRPQGAQKVGKASSSVRGPRWSVVPTRLFPAFIEMPNP