; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:7269325..7274925
RNA-Seq ExpressionMoc07g09450
SyntenyMoc07g09450
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]7.6e-24686.8Show/hide
Query:  VITREEFDQLKGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQASSDAIKCRAFQIALTG
        VITREEFDQL+GQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVK YDG KDPKDYVEVFE LMDFQA+SDAIKCRAF+IALTG
Subjt:  VITREEFDQLKGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQASSDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLADEALTVKLGEEAPTTFA
        SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADEALTVKLGEEAP TFA
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLADEALTVKLGEEAPTTFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK
        EVLQKAKKVIDGQELLRTKTGRPERKIG+GRSGKDIE ADPKS+DKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK

Query:  LRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK-------------------------SGSVRGR-RPDALTDL
        LRG PERR+KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EK++                         SG   GR R +     
Subjt:  LRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK-------------------------SGSVRGR-RPDALTDL

Query:  RREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPQGCINLPVT
        RREVCIIREQRPTCPITF+  DLEEVHLPHNDALVIAPLI+HVVV RVLVDGG S NILSLPTYLALGWTRSQL KSPTPLVGFSGESVIP+G I+LPVT
Subjt:  RREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPQGCINLPVT

Query:  LGQDQTQVTRMAEFV
        LGQDQTQVT+MAEFV
Subjt:  LGQDQTQVTRMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]3.2e-24471.8Show/hide
Query:  SVNTSTEREAHLSEKDRVITREEFDQLKGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQ
        S N   E   + +  D VITREEFDQL+G+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDG KDPKDYVEVFEGLMDFQ
Subjt:  SVNTSTEREAHLSEKDRVITREEFDQLKGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQ

Query:  ASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLA
        A+SDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLT LA
Subjt:  ASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLA

Query:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKTGRPER I +GRSGKD EKAD KS+DKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK------------------------
        TNIEESGMEKLLKRPEKLRG PERRNKDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EK++                        
Subjt:  TNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK------------------------

Query:  -SGSVRG-RRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVG
         SG   G +R +     RREVCIIREQRPTCPITF+S DLEEVHLPHNDALVIAPLI+HVVVRRVLVD G S NI+SL TYLALGWTRSQL KS TPLVG
Subjt:  -SGSVRG-RRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVCALETL
        FS ESVIP+GCI+LPVTLG DQTQVT+MAEFVVIDGRSAYNAIFGRP+IHSFRAIPSTLHQ+LKYS PNGVG VRGEQ AS ECYASALKGSSVCALETL
Subjt:  FSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVCALETL

Query:  AGRDGTLEFEVDLPRREFAAPTEELELVPLLSPKKQLASAYETDLARSVTVEILDN
          RDGTLEF+ +LPRREFAAPTEELELVPLL  K      +E +L    ++  +D+
Subjt:  AGRDGTLEFEVDLPRREFAAPTEELELVPLLSPKKQLASAYETDLARSVTVEILDN

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]6.1e-19580.76Show/hide
Query:  MCYFLTCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLT LADEALTVKL EEAP TFAEVLQKAKKVIDGQELLRT       KIGQGRSGKD+E  DPKS+DKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRKSGSVRGRRPDALTD-
        TTIPISEILTNIEESGMEKLLKRPEKLRG PERR+KDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS EK++    R R P   TD 
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRKSGSVRGRRPDALTD-

Query:  --------------------------LRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQ
                                   RREVCIIREQRPTCPITF+  DL EVHLPHNDALVIAPLI+HVVVRRVLVDGGAS NILSLPTYLALGWTRSQ
Subjt:  --------------------------LRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQ

Query:  LTKSPTPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALK
        L KSPTPLVGFSGESV+P+GCI+LPVTLGQDQT+VT+MAEFVV+DGRSAYNAIFGRP+IHSFRAIPSTLHQ+LKYS PNGVGTVRGEQTAS ECYAS LK
Subjt:  LTKSPTPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALK

Query:  GSSVCALETLAGRDGTLEFEVDLPRREFAAPTEELELVPLLSPKKQL
        G+SVCALETL  RDGTLEFE DLP REFAAP EELELVPLLS +KQ+
Subjt:  GSSVCALETLAGRDGTLEFEVDLPRREFAAPTEELELVPLLSPKKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.2e-25966.05Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITASVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT  VLPPAHP+ SKA                                   
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITASVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMR

Query:  RQMRSMEEMYNEMILAAGAGSRSENLMTRIDIREQRGSHLGPVEEEHPEDNVARDTLAREETSVNTSTEREAHLSEKDRVITREEFDQLKGQLDAQVEAL
                                                                    E+S N  T           VITREEFDQLK + DAQVEAL
Subjt:  RQMRSMEEMYNEMILAAGAGSRSENLMTRIDIREQRGSHLGPVEEEHPEDNVARDTLAREETSVNTSTEREAHLSEKDRVITREEFDQLKGQLDAQVEAL

Query:  KAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR
        KA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+K YDG KDPKDYVEVFE LMDFQA++DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR
Subjt:  KAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR

Query:  REFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTG
        +EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLT LADE LTVKL EEAP TFAEVLQK KKVIDGQELLRTKTG
Subjt:  REFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTG

Query:  RPERKIGQGRSGKDIEKADPKSRDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREH
        RPE+ I QGR+GKD  KAD KSRDKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+RN DKYCRFHR+H
Subjt:  RPERKIGQGRSGKDIEKADPKSRDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREH

Query:  GHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRKS-------------GSVRGRRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDA
        GHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++SVEK++               +V  ++ +   + RREVCIIREQRPT  I FN  DLE VHLPHNDA
Subjt:  GHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRKS-------------GSVRGRRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDA

Query:  LVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIH
        LVIAPLI+ V+VRR+LVDGGAS NILSL TYLALGWTRSQL KSPTPLVGFSGES+  +GCI+LPV++ QD TQVT+MAEFVVIDGRSAYNAIFGRP+IH
Subjt:  LVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIH

Query:  SFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVCALETLAGRD
        SFRA+PSTLHQ+LKYS  NGVGTVRGE   S ECYAS  K SSVCALE    RD
Subjt:  SFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.0e-19869.12Show/hide
Query:  MDFQASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQA++DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAPTTF EVLQKAKKVIDGQELLRTKTGRPE++I Q +  ++  KAD KSRDKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK-------------------
        ISEILTNIEESGMEKLLKRPEKLRG  E+RNK+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++SVEK++                   
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK-------------------

Query:  -------SGSVRGRRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSP
                G    +R +   + RREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LI+H +VRRVL+DG                          
Subjt:  -------SGSVRGRRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVC
                      GCI+LPVT+GQD TQVT+MAEFVVIDGRSAYNAIFGRP+IHSFRA+PSTLHQ+LKYS PN VG VRGEQ  S ECYASALKGS+VC
Subjt:  TPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVC

Query:  ALETLAGRDGTLEFEVDLP---RREFAAPTEELELVPLLSPKKQ
        ALE    R    E E DLP   +R+F  PTEELELVPLLSP++Q
Subjt:  ALETLAGRDGTLEFEVDLP---RREFAAPTEELELVPLLSPKKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.7e-24686.8Show/hide
Query:  VITREEFDQLKGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQASSDAIKCRAFQIALTG
        VITREEFDQL+GQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVK YDG KDPKDYVEVFE LMDFQA+SDAIKCRAF+IALTG
Subjt:  VITREEFDQLKGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQASSDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLADEALTVKLGEEAPTTFA
        SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADEALTVKLGEEAP TFA
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLADEALTVKLGEEAPTTFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK
        EVLQKAKKVIDGQELLRTKTGRPERKIG+GRSGKDIE ADPKS+DKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK

Query:  LRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK-------------------------SGSVRGR-RPDALTDL
        LRG PERR+KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSS EK++                         SG   GR R +     
Subjt:  LRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK-------------------------SGSVRGR-RPDALTDL

Query:  RREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPQGCINLPVT
        RREVCIIREQRPTCPITF+  DLEEVHLPHNDALVIAPLI+HVVV RVLVDGG S NILSLPTYLALGWTRSQL KSPTPLVGFSGESVIP+G I+LPVT
Subjt:  RREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPQGCINLPVT

Query:  LGQDQTQVTRMAEFV
        LGQDQTQVT+MAEFV
Subjt:  LGQDQTQVTRMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.5e-24471.8Show/hide
Query:  SVNTSTEREAHLSEKDRVITREEFDQLKGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQ
        S N   E   + +  D VITREEFDQL+G+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDG KDPKDYVEVFEGLMDFQ
Subjt:  SVNTSTEREAHLSEKDRVITREEFDQLKGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQ

Query:  ASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLA
        A+SDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLT LA
Subjt:  ASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLA

Query:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKTGRPER I +GRSGKD EKAD KS+DKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK------------------------
        TNIEESGMEKLLKRPEKLRG PERRNKDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSS EK++                        
Subjt:  TNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK------------------------

Query:  -SGSVRG-RRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVG
         SG   G +R +     RREVCIIREQRPTCPITF+S DLEEVHLPHNDALVIAPLI+HVVVRRVLVD G S NI+SL TYLALGWTRSQL KS TPLVG
Subjt:  -SGSVRG-RRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVG

Query:  FSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVCALETL
        FS ESVIP+GCI+LPVTLG DQTQVT+MAEFVVIDGRSAYNAIFGRP+IHSFRAIPSTLHQ+LKYS PNGVG VRGEQ AS ECYASALKGSSVCALETL
Subjt:  FSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVCALETL

Query:  AGRDGTLEFEVDLPRREFAAPTEELELVPLLSPKKQLASAYETDLARSVTVEILDN
          RDGTLEF+ +LPRREFAAPTEELELVPLL  K      +E +L    ++  +D+
Subjt:  AGRDGTLEFEVDLPRREFAAPTEELELVPLLSPKKQLASAYETDLARSVTVEILDN

A0A6J1DD03 uncharacterized protein LOC1110198992.9e-19580.76Show/hide
Query:  MCYFLTCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLT LADEALTVKL EEAP TFAEVLQKAKKVIDGQELLRT       KIGQGRSGKD+E  DPKS+DKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRKSGSVRGRRPDALTD-
        TTIPISEILTNIEESGMEKLLKRPEKLRG PERR+KDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS EK++    R R P   TD 
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRKSGSVRGRRPDALTD-

Query:  --------------------------LRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQ
                                   RREVCIIREQRPTCPITF+  DL EVHLPHNDALVIAPLI+HVVVRRVLVDGGAS NILSLPTYLALGWTRSQ
Subjt:  --------------------------LRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQ

Query:  LTKSPTPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALK
        L KSPTPLVGFSGESV+P+GCI+LPVTLGQDQT+VT+MAEFVV+DGRSAYNAIFGRP+IHSFRAIPSTLHQ+LKYS PNGVGTVRGEQTAS ECYAS LK
Subjt:  LTKSPTPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALK

Query:  GSSVCALETLAGRDGTLEFEVDLPRREFAAPTEELELVPLLSPKKQL
        G+SVCALETL  RDGTLEFE DLP REFAAP EELELVPLLS +KQ+
Subjt:  GSSVCALETLAGRDGTLEFEVDLPRREFAAPTEELELVPLLSPKKQL

A0A6J1DHB3 uncharacterized protein LOC1110204795.8e-26066.05Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITASVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT  VLPPAHP+ SKA                                   
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITASVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMR

Query:  RQMRSMEEMYNEMILAAGAGSRSENLMTRIDIREQRGSHLGPVEEEHPEDNVARDTLAREETSVNTSTEREAHLSEKDRVITREEFDQLKGQLDAQVEAL
                                                                    E+S N  T           VITREEFDQLK + DAQVEAL
Subjt:  RQMRSMEEMYNEMILAAGAGSRSENLMTRIDIREQRGSHLGPVEEEHPEDNVARDTLAREETSVNTSTEREAHLSEKDRVITREEFDQLKGQLDAQVEAL

Query:  KAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR
        KA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+K YDG KDPKDYVEVFE LMDFQA++DAIKC AFQIALTGSARLWYRRLPAR ISTYSQLR
Subjt:  KAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR

Query:  REFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTG
        +EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLT LADE LTVKL EEAP TFAEVLQK KKVIDGQELLRTKTG
Subjt:  REFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTG

Query:  RPERKIGQGRSGKDIEKADPKSRDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREH
        RPE+ I QGR+GKD  KAD KSRDKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+RN DKYCRFHR+H
Subjt:  RPERKIGQGRSGKDIEKADPKSRDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREH

Query:  GHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRKS-------------GSVRGRRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDA
        GHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++SVEK++               +V  ++ +   + RREVCIIREQRPT  I FN  DLE VHLPHNDA
Subjt:  GHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRKS-------------GSVRGRRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDA

Query:  LVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIH
        LVIAPLI+ V+VRR+LVDGGAS NILSL TYLALGWTRSQL KSPTPLVGFSGES+  +GCI+LPV++ QD TQVT+MAEFVVIDGRSAYNAIFGRP+IH
Subjt:  LVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIH

Query:  SFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVCALETLAGRD
        SFRA+PSTLHQ+LKYS  NGVGTVRGE   S ECYAS  K SSVCALE    RD
Subjt:  SFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249049.8e-19969.12Show/hide
Query:  MDFQASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQA++DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAPTTF EVLQKAKKVIDGQELLRTKTGRPE++I Q +  ++  KAD KSRDKGS SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK-------------------
        ISEILTNIEESGMEKLLKRPEKLRG  E+RNK+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++SVEK++                   
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRK-------------------

Query:  -------SGSVRGRRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSP
                G    +R +   + RREVCIIRE +PTC ITF   DLE VHLPHNDALVIA LI+H +VRRVL+DG                          
Subjt:  -------SGSVRGRRPDALTDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSP

Query:  TPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVC
                      GCI+LPVT+GQD TQVT+MAEFVVIDGRSAYNAIFGRP+IHSFRA+PSTLHQ+LKYS PN VG VRGEQ  S ECYASALKGS+VC
Subjt:  TPLVGFSGESVIPQGCINLPVTLGQDQTQVTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVC

Query:  ALETLAGRDGTLEFEVDLP---RREFAAPTEELELVPLLSPKKQ
        ALE    R    E E DLP   +R+F  PTEELELVPLLSP++Q
Subjt:  ALETLAGRDGTLEFEVDLP---RREFAAPTEELELVPLLSPKKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGTCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAGGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAAAACTTTAACGCGCTCCAGAGAGAAATGGAGGCAATGCGCAGACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCTAATGACACGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGTAGCGAGGGACACACTTGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGAGTGATTACAAGGG
AGGAGTTCGACCAGCTGAAGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCC
TTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCGTTACGATGGGTTGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGG
CCTCATGGATTTCCAAGCGTCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCT
CGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACG
CTGCGAGAATATGTCACCAGATTCCAAGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCTGTCTAGCCGACGAAGCCCTCAC
GGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAAC
GAAAGATCGGCCAGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAGGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAAC
GGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCC
TGAGAAGCTTCGGGGAACCCCGGAAAGGCGCAACAAAGACAAGTATTGCCGCTTCCATCGGGAGCATGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTG
AGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGTAGAGAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGACGCACTG
ACCGACCTGCGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCAACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACT
TGTGATCGCTCCCCTGATTAATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTACTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGA
GGTCGCAATTGACGAAAAGCCCGACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCACAGGGTTGCATCAACTTGCCGGTCACACTTGGGCAGGACCAAACTCAG
GTCACCCGAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCGTCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCA
AATTTTGAAGTATTCCATCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGGGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCG
AAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGTCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTACTTAGTCCCAAG
AAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCACCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATAGAGATCGGCACTCCTGA
GTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTG
GAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGATGGTCCAAACCCATGTGGGTGCCCTTGATCCGACTTGGGAGGGC
CCGTTTGAGGTCAAGGGCATACTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCACACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCC
TTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGTCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAGGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAAAACTTTAACGCGCTCCAGAGAGAAATGGAGGCAATGCGCAGACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCTAATGACACGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCCCGAAGACAACGTAGCGAGGGACACACTTGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGAGTGATTACAAGGG
AGGAGTTCGACCAGCTGAAGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCC
TTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCGTTACGATGGGTTGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGG
CCTCATGGATTTCCAAGCGTCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCT
CGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACG
CTGCGAGAATATGTCACCAGATTCCAAGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCTGTCTAGCCGACGAAGCCCTCAC
GGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAAC
GAAAGATCGGCCAGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAGGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAAC
GGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCC
TGAGAAGCTTCGGGGAACCCCGGAAAGGCGCAACAAAGACAAGTATTGCCGCTTCCATCGGGAGCATGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTG
AGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGTAGAGAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGACGCACTG
ACCGACCTGCGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCAACAGTACAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGCACT
TGTGATCGCTCCCCTGATTAATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTACTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGA
GGTCGCAATTGACGAAAAGCCCGACACCGCTAGTTGGGTTCTCTGGAGAATCGGTCATCCCACAGGGTTGCATCAACTTGCCGGTCACACTTGGGCAGGACCAAACTCAG
GTCACCCGAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCGTCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCA
AATTTTGAAGTATTCCATCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGGGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCG
AAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGTCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTACTTAGTCCCAAG
AAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCACCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATAGAGATCGGCACTCCTGA
GTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTG
GAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGATGGTCCAAACCCATGTGGGTGCCCTTGATCCGACTTGGGAGGGC
CCGTTTGAGGTCAAGGGCATACTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCACACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCC
TTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITASVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMRRQMRSMEEMY
NEMILAAGAGSRSENLMTRIDIREQRGSHLGPVEEEHPEDNVARDTLAREETSVNTSTEREAHLSEKDRVITREEFDQLKGQLDAQVEALKAKCEQKEGPLNDGDLGESP
FTSDVLEAPIPPKFKAPTVKRYDGLKDPKDYVEVFEGLMDFQASSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGET
LREYVTRFQEEQLKVAHCSDDSAMCYFLTCLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSRDKGSFSSGRAEYRRAEN
GPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGTPERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSVEKRKSGSVRGRRPDAL
TDLRREVCIIREQRPTCPITFNSTDLEEVHLPHNDALVIAPLINHVVVRRVLVDGGASTNILSLPTYLALGWTRSQLTKSPTPLVGFSGESVIPQGCINLPVTLGQDQTQ
VTRMAEFVVIDGRSAYNAIFGRPVIHSFRAIPSTLHQILKYSIPNGVGTVRGEQTASGECYASALKGSSVCALETLAGRDGTLEFEVDLPRREFAAPTEELELVPLLSPK
KQLASAYETDLARSVTVEILDNPSISEPDLIEIGTPESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRGGALYRRGFSLPLLRCLTPEEGLMVQTHVGALDPTWEG
PFEVKGILRPGTYILADLKGDVLAHPWNAEHLKRYYP