; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g03830 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g03830
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:3349831..3355424
RNA-Seq ExpressionMoc07g03830
SyntenyMoc07g03830
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]8.1e-23783.71Show/hide
Query:  QAESSHN---PAGIITREEFDQLREELDAQVEALKAKY--------------------VLEAPISPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLR +LDAQVEALKAK                     VLEAPI PKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLREELDAQVEALKAKY--------------------VLEAPISPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E ADPKSKDKGSFSSG+AEYRRAE GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSG-
        ESGME+LLKRPEKL GAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP+RTDRPAVINTIFGGPSG 
Subjt:  ESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSG-

Query:  ----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGE
                                          DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGW RSQLK+SPTPLVGFSGE
Subjt:  ----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]4.7e-23770.85Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLREELDAQVEALKAKY-----------VLEAPI-SPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKC
        SSNQQAESSHNPA   G+ITREEFDQLR +L+AQVEALKAK            + E+P  S   +APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKC
Subjt:  SSNQQAESSHNPA---GIITREEFDQLREELDAQVEALKAKY-----------VLEAPI-SPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKC

Query:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
        RAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLADEALTVKL
Subjt:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL

Query:  GEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIEESGM
        G+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSG+AE+RRA  GPTRSRPYERFTPTTIPISEILTNIEESGM
Subjt:  GEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIEESGM

Query:  EELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSG-----
        E+LLKRPEKL GAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP +R DRPAVINTIFGGPSG     
Subjt:  EELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSG-----

Query:  ------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGESVIP
                                      DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGW RSQLK+S TPLVGFS ESVIP
Subjt:  ------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGESVIP

Query:  EGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLE
        EGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL  RDGTLE
Subjt:  EGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLE

Query:  FEADLPRKEFAAPTEELELIPLLSPEKQLESAYETDLARSVPVEILDNPSILE--PDLMEVG
        F+A+LPR+EFAAPTEELEL+PLL  +      +E +L     +  +D+   +E  P+ + VG
Subjt:  FEADLPRKEFAAPTEELELIPLLSPEKQLESAYETDLARSVPVEILDNPSILE--PDLMEVG

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]4.5e-20383.67Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+G+AEYRRAE GPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRP
        TTIPISEILTNIEESGME+LLKRPEKL GAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP+RTDRP
Subjt:  TTIPISEILTNIEESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRP

Query:  AVINTIFGGPSG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQL
        AVINTIFGGPSG                                   DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGW RSQL
Subjt:  AVINTIFGGPSG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTVRGEQTASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELIPLLSPEKQLE
        +SVCALETL  RDGTLEFEADLP +EFAAP EELEL+PLLS EKQ++
Subjt:  SSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELIPLLSPEKQLE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.6e-24562.29Show/hide
Query:  MIQPANSTNTTDRRSLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARDPAPAPTSENFDALKREMEAIR
        M+QPANSTNT DRR+LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MIQPANSTNTTDRRSLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARDPAPAPTSENFDALKREMEAIR

Query:  TQMHFMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNEREGYTRQGGDLLEHLNRKRDSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMHFMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNEREGYTRQGGDLLEHLNRKRDSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLREELDAQVEALKAKY--------------------VLEAPISPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+                     +LEA I PKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLREELDAQVEALKAKY--------------------VLEAPISPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIEESGMEELLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS + +YRR+     +SRPYE +TPTTIPI EILTNIEE+GME+LLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIEESGMEELLKR

Query:  PEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN-------------------
        PEKL G PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPP+R DRPAVIN                   
Subjt:  PEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN-------------------

Query:  ---TIFGGPSGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ
           +       DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGW RSQLK+SPTPLVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  ---TIFGGPSGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ

Query:  MAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        MAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  MAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]4.2e-20170.22Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSS-GQAEYRRAEGGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  ++  +AD KS+DKGS SS  + EYRR E GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSS-GQAEYRRAEGGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN
        ISEILTNIEESGME+LLKRPEKL G  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPP+R DRPAVIN
Subjt:  ISEILTNIEESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN

Query:  TIFGGPSG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSP
        TIFGGP+G                                   DLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADLP---RKEFAAPTEELELIPLLSPEKQ
        ALE    R    E EADLP   +++F  PTEELEL+PLLSPE+Q
Subjt:  ALETLAGRDGTLEFEADLP---RKEFAAPTEELELIPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.9e-23783.71Show/hide
Query:  QAESSHN---PAGIITREEFDQLREELDAQVEALKAKY--------------------VLEAPISPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLR +LDAQVEALKAK                     VLEAPI PKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLREELDAQVEALKAKY--------------------VLEAPISPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E ADPKSKDKGSFSSG+AEYRRAE GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSG-
        ESGME+LLKRPEKL GAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP+RTDRPAVINTIFGGPSG 
Subjt:  ESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSG-

Query:  ----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGE
                                          DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGW RSQLK+SPTPLVGFSGE
Subjt:  ----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV
        SVIPEG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDRTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.3e-23770.85Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLREELDAQVEALKAKY-----------VLEAPI-SPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKC
        SSNQQAESSHNPA   G+ITREEFDQLR +L+AQVEALKAK            + E+P  S   +APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKC
Subjt:  SSNQQAESSHNPA---GIITREEFDQLREELDAQVEALKAKY-----------VLEAPI-SPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKC

Query:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
        RAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLADEALTVKL
Subjt:  RAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL

Query:  GEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIEESGM
        G+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSG+AE+RRA  GPTRSRPYERFTPTTIPISEILTNIEESGM
Subjt:  GEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIEESGM

Query:  EELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSG-----
        E+LLKRPEKL GAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP +R DRPAVINTIFGGPSG     
Subjt:  EELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSG-----

Query:  ------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGESVIP
                                      DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGW RSQLK+S TPLVGFS ESVIP
Subjt:  ------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGESVIP

Query:  EGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLE
        EGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL  RDGTLE
Subjt:  EGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLE

Query:  FEADLPRKEFAAPTEELELIPLLSPEKQLESAYETDLARSVPVEILDNPSILE--PDLMEVG
        F+A+LPR+EFAAPTEELEL+PLL  +      +E +L     +  +D+   +E  P+ + VG
Subjt:  FEADLPRKEFAAPTEELELIPLLSPEKQLESAYETDLARSVPVEILDNPSILE--PDLMEVG

A0A6J1DD03 uncharacterized protein LOC1110198992.2e-20383.67Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+G+AEYRRAE GPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRP
        TTIPISEILTNIEESGME+LLKRPEKL GAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPP+RTDRP
Subjt:  TTIPISEILTNIEESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRP

Query:  AVINTIFGGPSG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQL
        AVINTIFGGPSG                                   DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGW RSQL
Subjt:  AVINTIFGGPSG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVGTVRGEQTASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELIPLLSPEKQLE
        +SVCALETL  RDGTLEFEADLP +EFAAP EELEL+PLLS EKQ++
Subjt:  SSVCALETLAGRDGTLEFEADLPRKEFAAPTEELELIPLLSPEKQLE

A0A6J1DHB3 uncharacterized protein LOC1110204797.9e-24662.29Show/hide
Query:  MIQPANSTNTTDRRSLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARDPAPAPTSENFDALKREMEAIR
        M+QPANSTNT DRR+LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MIQPANSTNTTDRRSLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARDPAPAPTSENFDALKREMEAIR

Query:  TQMHFMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNEREGYTRQGGDLLEHLNRKRDSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMHFMEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNEREGYTRQGGDLLEHLNRKRDSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLREELDAQVEALKAKY--------------------VLEAPISPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL
         G+ITREEFDQL+ + DAQVEALKA+                     +LEA I PKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Subjt:  AGIITREEFDQLREELDAQVEALKAKY--------------------VLEAPISPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL

Query:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
        TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIEESGMEELLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS + +YRR+     +SRPYE +TPTTIPI EILTNIEE+GME+LLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKG-SFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIEESGMEELLKR

Query:  PEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN-------------------
        PEKL G PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPP+R DRPAVIN                   
Subjt:  PEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN-------------------

Query:  ---TIFGGPSGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ
           +       DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGW RSQLK+SPTPLVGFSGES+  EGCIDLPV++ QD T+VTQ
Subjt:  ---TIFGGPSGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQ

Query:  MAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        MAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  MAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DZB9 uncharacterized protein LOC1110249042.0e-20170.22Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSS-GQAEYRRAEGGPTRSRPYERFTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPE++I + +  ++  +AD KS+DKGS SS  + EYRR E GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSS-GQAEYRRAEGGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN
        ISEILTNIEESGME+LLKRPEKL G  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPP+R DRPAVIN
Subjt:  ISEILTNIEESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPQRTDRPAVIN

Query:  TIFGGPSG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSP
        TIFGGP+G                                   DLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDGTLEFEADLP---RKEFAAPTEELELIPLLSPEKQ
        ALE    R    E EADLP   +++F  PTEELEL+PLLSPE+Q
Subjt:  ALETLAGRDGTLEFEADLP---RKEFAAPTEELELIPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCAACCAGCGAACTCGACCAATACGACAGACCGAAGGTCTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCGCGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCAGGGATCCAGCTCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATACGCACACAAATGCACTTC
ATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGTGAGCAAGGGGGTTCCCACCTC
GGCCCAGCCGAGGAGGAACGTCCCGAAAACAACGAGAGAGAGGGGTACACTCGCCAGGGGGGAGACCTCCTTGAGCATCTCAACAGAAAGAGAGACTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGAGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATACGTTTTGGAAGCACCAATCTCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGG
ACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGC
AGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAACTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAA
AAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTTGCACACTGCTCC
GATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAAGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAG
GCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAGCGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGAT
CCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCAAGCTGAGTATCGAAGGGCGGAGGGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACC
ACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAGAACTACTCAAGCGTCCTGAGAAACTTTGGGGAGCCCCGGAGAGGCGCAGCAAG
GACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAG
TTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCACAACGCACCGACCGACCTGCGGTCATCAATACCATT
TTTGGAGGGCCAAGCGGGGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTA
GACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGAGGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCT
GGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGA
TCAGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGC
ACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTC
GAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTATTCCTCTGCTTAGTCCCGAGAAGCAGTTAGAATCGGCGTACGAG
ACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTCGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGGTCGGCGCTTCAGAATCCTCATGGATGGACCCG
ATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGAGGGTCAAAACGCATGTGTGTGCCCTTGAT
CCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACTTACATATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAA
CACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCAACCAGCGAACTCGACCAATACGACAGACCGAAGGTCTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCGCGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCAGGGATCCAGCTCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATACGCACACAAATGCACTTC
ATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACGTACGTGAGCAAGGGGGTTCCCACCTC
GGCCCAGCCGAGGAGGAACGTCCCGAAAACAACGAGAGAGAGGGGTACACTCGCCAGGGGGGAGACCTCCTTGAGCATCTCAACAGAAAGAGAGACTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTG
AGGGAGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATACGTTTTGGAAGCACCAATCTCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGG
ACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGC
AGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAACTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAA
AAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTTGCACACTGCTCC
GATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAAGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAG
GCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAGCGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGAT
CCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCAAGCTGAGTATCGAAGGGCGGAGGGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACC
ACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAGAACTACTCAAGCGTCCTGAGAAACTTTGGGGAGCCCCGGAGAGGCGCAGCAAG
GACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAG
TTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCACAACGCACCGACCGACCTGCGGTCATCAATACCATT
TTTGGAGGGCCAAGCGGGGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTA
GACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGAGGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCT
GGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGA
TCAGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGC
ACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTC
GAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTATTCCTCTGCTTAGTCCCGAGAAGCAGTTAGAATCGGCGTACGAG
ACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTCGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGGTCGGCGCTTCAGAATCCTCATGGATGGACCCG
ATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGAGGGTCAAAACGCATGTGTGTGCCCTTGAT
CCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACTTACATATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAA
CACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MIQPANSTNTTDRRSLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARDPAPAPTSENFDALKREMEAIRTQMHF
MEEMYNEMMLAAGAGSRSENRVTRVDVREQGGSHLGPAEEERPENNEREGYTRQGGDLLEHLNRKRDSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQL
REELDAQVEALKAKYVLEAPISPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK
KTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERAD
PKSKDKGSFSSGQAEYRRAEGGPTRSRPYERFTPTTIPISEILTNIEESGMEELLKRPEKLWGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKK
FVGKPRTSSAEKKEERKRSRTPPQRTDRPAVINTIFGGPSGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWRRSQLKRSPTPLVGFS
GESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTL
EFEADLPRKEFAAPTEELELIPLLSPEKQLESAYETDLARSVPVEILDNPSILEPDLMEVGASESSWMDPIADFIRGNSPQDPKERRKLARRAARRVKTHVCALD
PTWEGPFEVKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYP