; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g36090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g36090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:26626645..26632233
RNA-Seq ExpressionMoc08g36090
SyntenyMoc08g36090
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.0e-24887.28Show/hide
Query:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGELPFTSDVLEAPIPPKFKAPT------------------------------------I
        TPA VITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGE PFTSDVLEAPIPPKFKAPT                                    I
Subjt:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGELPFTSDVLEAPIPPKFKAPT------------------------------------I

Query:  ALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAP
        ALTGS R  YRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETL+EYVTRFQEEQLKV+HCSDDSAMCYFLTGLADEALTVKLGEEAP
Subjt:  ALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  TTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLK
         TFAEVLQKAKKVIDGQELLRTKTG+PERKIGRGRSGKDIE ADPKSKDKGSFSSG AEYRRAENGPTRSRPYERF PTTIPISEILTNIEESGMEKLLK
Subjt:  TTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRGAPERRSKDKYC  HREHGH+TSD WELKRQIE+LIQDGYFKKFVGKPRT SAEKK ERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKEL
Subjt:  RPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID
        ARAARREVCIIREQR TCPITF+ ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG ID
Subjt:  ARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID

Query:  LPITFGQDQTQVTQMAEFV
        LP+T GQDQTQVTQMAEFV
Subjt:  LPITFGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]6.5e-24779.55Show/hide
Query:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGELPFTSDVLEAPIPPKFKAPTIALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRH
        TP  VITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGE PFTSDVLEAP    +                          S+  ++++  F    
Subjt:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGELPFTSDVLEAPIPPKFKAPTIALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRH

Query:  YDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRS
         D + A+     R  +          FQE+QLKV+  SDDSAMCYFLTGLADEALTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKTG+PER I RGRS
Subjt:  YDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRS

Query:  GKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKR
        GKD EKAD KSKDKGSFSSG AE+RRA NGPTRSRPYERF PTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+KDKYC  HREH H+TSD WELKR
Subjt:  GKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKR

Query:  QIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHND
        QIEDLIQD YFKKFVGKPRT SAEKK ERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQR TCPITF+SADLEEVHLPHND
Subjt:  QIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHND

Query:  ALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPII
        ALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEGCIDLP+T G DQTQVTQMAEFVVIDGRSAYNAIFGRPII
Subjt:  ALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPII

Query:  HSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIETLAGGDGTLEFEADLPRREFAAPTEELELVPLL
        HSFRAIPSTLHQVLKYSTPNG+G VRGEQ ASRECYASALKGSSVCA+ETL   DGTLEF+A+LPRREFAAPTEELELVPLL
Subjt:  HSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIETLAGGDGTLEFEADLPRREFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]9.1e-22590.13Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAP
        MCYFLTGLADEALTVKL EEAP TFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+G AEYRRAENGPTRSRPYERF P
Subjt:  MCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYC  HREHGH+TSD WELK QIEDLIQDGYFKKFVGKPRT SAEKK ERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQR TCPITF+ ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLP+T GQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG+GTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKG

Query:  SSVCAIETLAGGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQL
        +SVCA+ETL   DGTLEFEADLP REFAAP EELELVPLLS EKQ+
Subjt:  SSVCAIETLAGGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.4e-24664.18Show/hide
Query:  MVQPANSTNTVDRRTLAANDAHEREVGAAVVEGQGHDDLATEPLRRSARITAPVLPLVHPPRTSKATRGRGGTSKKGVWGPAPVLTSENLDALQREMEAM
        MVQPANSTNT DRR LAAN  H+REVGA VVEGQGH+DL TEPL RSARIT PVLP  H P+ SKA                                  
Subjt:  MVQPANSTNTVDRRTLAANDAHEREVGAAVVEGQGHDDLATEPLRRSARITAPVLPLVHPPRTSKATRGRGGTSKKGVWGPAPVLTSENLDALQREMEAM

Query:  RTQMRSMKEMYNEMILAACAGSRSDNRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLPTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGP
               +  YN                                                    P    VITREEFDQL+ + DAQVEALKA+CE+KE  
Subjt:  RTQMRSMKEMYNEMILAACAGSRSDNRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLPTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGP

Query:  LNDGDLGELPFTSDVLEAPIPPKFKAPT------------------------------------IALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSR
         +DGDLGEL F+SD+LEA IPPKFK PT                                    IALTGS R  YRRLPAR ISTYSQLR+EF++QFSSR
Subjt:  LNDGDLGELPFTSDVLEAPIPPKFKAPT------------------------------------IALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSR

Query:  HYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGR
        HYD+KT THLATIRQKEGETL+EYVTRF EEQLKV+HCSDDSAMCYFLTGLADE LTVKL EEAP TFAEVLQK KKVIDGQELLRTKTG+PE+ I +GR
Subjt:  HYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGR

Query:  SGKDIEKADPKSKDKG-SFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWEL
        +GKD  KAD KS+DKG S SS   +YRR+ +   +SRPYE + PTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYC  HR+HGH+TS+ WEL
Subjt:  SGKDIEKADPKSKDKG-SFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWEL

Query:  KRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPH
        KRQIEDLIQDGYFKKFVGKPR+ S EKK ERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQR T  I FN ADLE VHLPH
Subjt:  KRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPH

Query:  NDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRP
        NDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLP++  QD TQVTQMAEFVVIDGRSAYNAIFGRP
Subjt:  NDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRP

Query:  IIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIE
        IIHSFRA+PSTLHQVLKYST NG+GTVRGE   SRECYAS  K SSVCA+E
Subjt:  IIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIE

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.0e-20873.03Show/hide
Query:  KFKAPTIALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTV
        K +A  IALTGS R  YRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETL+EYVTRFQEEQLKV+HCSDDSAMCYFLT LADE LTV
Subjt:  KFKAPTIALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTV

Query:  KLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSS-GLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEE
        KLGEEAPTTF EVLQKAKKVIDGQELLRTKTG+PE++I + +  ++  KAD KS+DKGS SS    EYRR E+GP+RSRPYER+  +TIPISEILTNIEE
Subjt:  KLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSS-GLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEE

Query:  SGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQ
        SGMEKLLKRPEKLRG  E+R+K+KYC  HR+HGH+T+ CWELKRQIEDLIQDGYFKKFVGKPR+ S EKK ERKRSRTPPRR DRPAVINTIFGGP+GGQ
Subjt:  SGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQ

Query:  SGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGES
        SG+KRKELAR ARREVCIIRE + TC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                                    
Subjt:  SGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGES

Query:  VIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIETLAGGDG
            GCIDLP+T GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN +G VRGEQ  SRECYASALKGS+VCA+E       
Subjt:  VIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIETLAGGDG

Query:  TLEFEADLP---RREFAAPTEELELVPLLSPEKQ
          E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  TLEFEADLP---RREFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088139.8e-24987.28Show/hide
Query:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGELPFTSDVLEAPIPPKFKAPT------------------------------------I
        TPA VITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGE PFTSDVLEAPIPPKFKAPT                                    I
Subjt:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGELPFTSDVLEAPIPPKFKAPT------------------------------------I

Query:  ALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAP
        ALTGS R  YRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETL+EYVTRFQEEQLKV+HCSDDSAMCYFLTGLADEALTVKLGEEAP
Subjt:  ALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAP

Query:  TTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLK
         TFAEVLQKAKKVIDGQELLRTKTG+PERKIGRGRSGKDIE ADPKSKDKGSFSSG AEYRRAENGPTRSRPYERF PTTIPISEILTNIEESGMEKLLK
Subjt:  TTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRGAPERRSKDKYC  HREHGH+TSD WELKRQIE+LIQDGYFKKFVGKPRT SAEKK ERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKEL
Subjt:  RPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID
        ARAARREVCIIREQR TCPITF+ ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG ID
Subjt:  ARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCID

Query:  LPITFGQDQTQVTQMAEFV
        LP+T GQDQTQVTQMAEFV
Subjt:  LPITFGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188233.1e-24779.55Show/hide
Query:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGELPFTSDVLEAPIPPKFKAPTIALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRH
        TP  VITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGE PFTSDVLEAP    +                          S+  ++++  F    
Subjt:  TPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGELPFTSDVLEAPIPPKFKAPTIALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRH

Query:  YDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRS
         D + A+     R  +          FQE+QLKV+  SDDSAMCYFLTGLADEALTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKTG+PER I RGRS
Subjt:  YDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRS

Query:  GKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKR
        GKD EKAD KSKDKGSFSSG AE+RRA NGPTRSRPYERF PTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+KDKYC  HREH H+TSD WELKR
Subjt:  GKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKR

Query:  QIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHND
        QIEDLIQD YFKKFVGKPRT SAEKK ERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQR TCPITF+SADLEEVHLPHND
Subjt:  QIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHND

Query:  ALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPII
        ALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEGCIDLP+T G DQTQVTQMAEFVVIDGRSAYNAIFGRPII
Subjt:  ALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPII

Query:  HSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIETLAGGDGTLEFEADLPRREFAAPTEELELVPLL
        HSFRAIPSTLHQVLKYSTPNG+G VRGEQ ASRECYASALKGSSVCA+ETL   DGTLEF+A+LPRREFAAPTEELELVPLL
Subjt:  HSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIETLAGGDGTLEFEADLPRREFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198994.4e-22590.13Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAP
        MCYFLTGLADEALTVKL EEAP TFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+G AEYRRAENGPTRSRPYERF P
Subjt:  MCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAP

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYC  HREHGH+TSD WELK QIEDLIQDGYFKKFVGKPRT SAEKK ERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQR TCPITF+ ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLP+T GQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNG+GTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKG

Query:  SSVCAIETLAGGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQL
        +SVCA+ETL   DGTLEFEADLP REFAAP EELELVPLLS EKQ+
Subjt:  SSVCAIETLAGGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204797.0e-24764.18Show/hide
Query:  MVQPANSTNTVDRRTLAANDAHEREVGAAVVEGQGHDDLATEPLRRSARITAPVLPLVHPPRTSKATRGRGGTSKKGVWGPAPVLTSENLDALQREMEAM
        MVQPANSTNT DRR LAAN  H+REVGA VVEGQGH+DL TEPL RSARIT PVLP  H P+ SKA                                  
Subjt:  MVQPANSTNTVDRRTLAANDAHEREVGAAVVEGQGHDDLATEPLRRSARITAPVLPLVHPPRTSKATRGRGGTSKKGVWGPAPVLTSENLDALQREMEAM

Query:  RTQMRSMKEMYNEMILAACAGSRSDNRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLPTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGP
               +  YN                                                    P    VITREEFDQL+ + DAQVEALKA+CE+KE  
Subjt:  RTQMRSMKEMYNEMILAACAGSRSDNRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLPTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGP

Query:  LNDGDLGELPFTSDVLEAPIPPKFKAPT------------------------------------IALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSR
         +DGDLGEL F+SD+LEA IPPKFK PT                                    IALTGS R  YRRLPAR ISTYSQLR+EF++QFSSR
Subjt:  LNDGDLGELPFTSDVLEAPIPPKFKAPT------------------------------------IALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSR

Query:  HYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGR
        HYD+KT THLATIRQKEGETL+EYVTRF EEQLKV+HCSDDSAMCYFLTGLADE LTVKL EEAP TFAEVLQK KKVIDGQELLRTKTG+PE+ I +GR
Subjt:  HYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGR

Query:  SGKDIEKADPKSKDKG-SFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWEL
        +GKD  KAD KS+DKG S SS   +YRR+ +   +SRPYE + PTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYC  HR+HGH+TS+ WEL
Subjt:  SGKDIEKADPKSKDKG-SFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWEL

Query:  KRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPH
        KRQIEDLIQDGYFKKFVGKPR+ S EKK ERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQR T  I FN ADLE VHLPH
Subjt:  KRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPH

Query:  NDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRP
        NDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLP++  QD TQVTQMAEFVVIDGRSAYNAIFGRP
Subjt:  NDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRP

Query:  IIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIE
        IIHSFRA+PSTLHQVLKYST NG+GTVRGE   SRECYAS  K SSVCA+E
Subjt:  IIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIE

A0A6J1DZB9 uncharacterized protein LOC1110249049.9e-20973.03Show/hide
Query:  KFKAPTIALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTV
        K +A  IALTGS R  YRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETL+EYVTRFQEEQLKV+HCSDDSAMCYFLT LADE LTV
Subjt:  KFKAPTIALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTV

Query:  KLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSS-GLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEE
        KLGEEAPTTF EVLQKAKKVIDGQELLRTKTG+PE++I + +  ++  KAD KS+DKGS SS    EYRR E+GP+RSRPYER+  +TIPISEILTNIEE
Subjt:  KLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSS-GLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEE

Query:  SGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQ
        SGMEKLLKRPEKLRG  E+R+K+KYC  HR+HGH+T+ CWELKRQIEDLIQDGYFKKFVGKPR+ S EKK ERKRSRTPPRR DRPAVINTIFGGP+GGQ
Subjt:  SGMEKLLKRPEKLRGAPERRSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQ

Query:  SGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGES
        SG+KRKELAR ARREVCIIRE + TC ITF  ADLE VHLPHNDALVIA LIDH +VRRVL+DG                                    
Subjt:  SGHKRKELARAARREVCIIREQRSTCPITFNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGES

Query:  VIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIETLAGGDG
            GCIDLP+T GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN +G VRGEQ  SRECYASALKGS+VCA+E       
Subjt:  VIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIETLAGGDG

Query:  TLEFEADLP---RREFAAPTEELELVPLLSPEKQ
          E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  TLEFEADLP---RREFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCGCGAACTCGACCAATACGGTAGATCGAAGGACCCTAGCTGCCAACGATGCACACGAGAGGGAGGTTGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGACCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACTTGTGCACCCCCCAAGGACATCCAAAGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGTCTGGGGTCCAGCCCCGGTCCTGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGCAAATGCGGTCCATGAAAGAAATG
TATAACGAAATGATATTAGCTGCATGCGCGGGGTCCCGATCTGATAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCCGACTCCTGCAAGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCG
ACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAAAAAGAAGGCCCACTGAACGATGGCGACTTGGGAGAATTGCCCTTCACCTCGGACGTTTTGGAAGCACCTATC
CCTCCGAAGTTCAAAGCTCCTACCATCGCGCTTACTGGCAGCACGCGATTTTTGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTT
CCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCAAGAATATGTCACCAGATTCCAGG
AGGAGCAATTGAAGGTCTCACACTGCTCTGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACC
ACTTTCGCCGAAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCAACCAGAACGAAAGATCGGCCGGGGCAGGAGTGGAAA
AGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCTAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAAC
GCTTCGCCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTCCTCAAACGCCCTGAGAAGCTTCGAGGAGCCCCGGAGAGG
CGTAGCAAGGACAAGTATTGCTGCATCCATCGGGAGCACGGCCATGACACGTCAGATTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAA
GAAGTTTGTGGGAAAGCCTAGGACCATCTCGGCAGAGAAAAAGGGAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTT
TCGGAGGGCCAAGCGGGGGCCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGTCGACCTGCCCAATCACC
TTCAACAGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCACGTGGTGGTCAGAAGGGTGCTGGTAGACGGAGGCGC
ATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCC
CAGAGGGTTGCATCGACTTGCCGATCACATTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCCTATAACGCCATCTTT
GGGAGACCCATCATCCATTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCATGGGCACGGTCCGAGGAGAACAGACCGCTTC
GAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCATCGAAACTCTCGCCGGTGGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGT
TTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTA
GATAATCCCTCAATCTCGGAGCCAGATCTGATGGAGATCGACGCTCCAGAGCCCTCATGGATGGACCCAATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAA
GGAGCGTAGAAAGTTAGCAAGGCAAGCAGCTCGGAAGGTCCAAACCCATGTGGGTGCCCATGACCCAGCCTGGGAGGGGCCGTTTGAGGTCAAAGGAATAGTCCGACCTG
GGACGTACATGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCATTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCGCGAACTCGACCAATACGGTAGATCGAAGGACCCTAGCTGCCAACGATGCACACGAGAGGGAGGTTGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGACCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACTTGTGCACCCCCCAAGGACATCCAAAGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGTCTGGGGTCCAGCCCCGGTCCTGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGCAAATGCGGTCCATGAAAGAAATG
TATAACGAAATGATATTAGCTGCATGCGCGGGGTCCCGATCTGATAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCCGACTCCTGCAAGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCG
ACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAAAAAGAAGGCCCACTGAACGATGGCGACTTGGGAGAATTGCCCTTCACCTCGGACGTTTTGGAAGCACCTATC
CCTCCGAAGTTCAAAGCTCCTACCATCGCGCTTACTGGCAGCACGCGATTTTTGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTT
CCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCAAGAATATGTCACCAGATTCCAGG
AGGAGCAATTGAAGGTCTCACACTGCTCTGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACC
ACTTTCGCCGAAGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCAACCAGAACGAAAGATCGGCCGGGGCAGGAGTGGAAA
AGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCTAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAAC
GCTTCGCCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTCCTCAAACGCCCTGAGAAGCTTCGAGGAGCCCCGGAGAGG
CGTAGCAAGGACAAGTATTGCTGCATCCATCGGGAGCACGGCCATGACACGTCAGATTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAA
GAAGTTTGTGGGAAAGCCTAGGACCATCTCGGCAGAGAAAAAGGGAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTT
TCGGAGGGCCAAGCGGGGGCCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGTCGACCTGCCCAATCACC
TTCAACAGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCACGTGGTGGTCAGAAGGGTGCTGGTAGACGGAGGCGC
ATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCC
CAGAGGGTTGCATCGACTTGCCGATCACATTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCCTATAACGCCATCTTT
GGGAGACCCATCATCCATTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCATGGGCACGGTCCGAGGAGAACAGACCGCTTC
GAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCATCGAAACTCTCGCCGGTGGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGT
TTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTA
GATAATCCCTCAATCTCGGAGCCAGATCTGATGGAGATCGACGCTCCAGAGCCCTCATGGATGGACCCAATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAA
GGAGCGTAGAAAGTTAGCAAGGCAAGCAGCTCGGAAGGTCCAAACCCATGTGGGTGCCCATGACCCAGCCTGGGAGGGGCCGTTTGAGGTCAAAGGAATAGTCCGACCTG
GGACGTACATGTTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCATTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTVDRRTLAANDAHEREVGAAVVEGQGHDDLATEPLRRSARITAPVLPLVHPPRTSKATRGRGGTSKKGVWGPAPVLTSENLDALQREMEAMRTQMRSMKEM
YNEMILAACAGSRSDNRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLPTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGELPFTSDVLEAPI
PPKFKAPTIALTGSTRFLYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVSHCSDDSAMCYFLTGLADEALTVKLGEEAPT
TFAEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGLAEYRRAENGPTRSRPYERFAPTTIPISEILTNIEESGMEKLLKRPEKLRGAPER
RSKDKYCCIHREHGHDTSDCWELKRQIEDLIQDGYFKKFVGKPRTISAEKKGERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRSTCPIT
FNSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPITFGQDQTQVTQMAEFVVIDGRSAYNAIF
GRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCAIETLAGGDGTLEFEADLPRREFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEIL
DNPSISEPDLMEIDAPEPSWMDPIVDFIRGNSPQDPKERRKLARQAARKVQTHVGAHDPAWEGPFEVKGIVRPGTYMLADLKGDVLAHPWNAEHLKHYYP