; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr8:19029153..19034346
RNA-Seq ExpressionMoc08g26460
SyntenyMoc08g26460
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]3.3e-12935.9Show/hide
Query:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMVCARTWLNALEPNSINTWAE
        MNRNA+DPPPPQNPPVNGDMAGEGAANR GEI N ILL DNRDV MRNYVT AFHNLNSGINN LPQAA  +LKPVMF M+                   
Subjt:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMVCARTWLNALEPNSINTWAE

Query:  LTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQ-IEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDI
           + + ++  LT                    ++ +  LK  +    +  LP   +   +L  GLDRSSRMMLNTAAN SLLEKSVNEIVDILNKM DI
Subjt:  LTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQ-IEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDI

Query:  NDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTIS---TIP-----------------------------------------
        NDQ E GRSL KKQVSAG+FELDT+A MQAQMAAMNQ LKQ TM KETKT  S    IP                                         
Subjt:  NDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTIS---TIP-----------------------------------------

Query:  --------EPS-------------------PVLQISDISCVYC------------GDNHLYENCPANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFS
                 P                    P+   S ++ V C             DN L    P   V+ + V    +R  N  +  Y+          
Subjt:  --------EPS-------------------PVLQISDISCVYC------------GDNHLYENCPANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFS

Query:  WRNQGVA-----------SSSVQAPAQQYKQNYTPP----GFPTQS---ASQPQQYNQQRGQNTTQQSGSNASLEAMMKESMTRN---DAAIRSLE----
         R  G             +    AP  Q K  +T P     F   S    + P  +  QR           +++E  M +        D  + +LE    
Subjt:  WRNQGVA-----------SSSVQAPAQQYKQNYTPP----GFPTQS---ASQPQQYNQQRGQNTTQQSGSNASLEAMMKESMTRN---DAAIRSLE----

Query:  --MQEGQIANDQKFR---PQGTLPGH--------TENPKQDREGKEHCKVVI--TRSGLSYEG---PSLPDKGTDVVTPVPASTSNP--QQEEKAEP---
           +   + N +K      +GT+ GH         +  K D   K    + +   RS L + G     + D          A  S P  Q  E  +P   
Subjt:  --MQEGQIANDQKFR---PQGTLPGH--------TENPKQDREGKEHCKVVI--TRSGLSYEG---PSLPDKGTDVVTPVPASTSNP--QQEEKAEP---

Query:  ----VSSEEKGKKADKGKQVV-------PNTTPQDQDPTLVKIAPGKSQ--------RISKSLSSLSF---LALLEFIGGIFVVD---------------
            + + E  KKA     ++       P     D     V +  G+ +          SK+L+S      +   E +  +F  D               
Subjt:  ----VSSEEKGKKADKGKQVV-------PNTTPQDQDPTLVKIAPGKSQ--------RISKSLSSLSF---LALLEFIGGIFVVD---------------

Query:  -----------REANSRTMEGSSSSKPHDKE-KEKKRVL---------LPPPTKPDMIPLEPPRISHEKLVFDPREQSRKYEEAIRMNPRRNLSIGGTNS
                   +EA  R +      +  D E K++K            L    + D   ++   +    L          Y   +      N      + 
Subjt:  -----------REANSRTMEGSSSSKPHDKE-KEKKRVL---------LPPPTKPDMIPLEPPRISHEKLVFDPREQSRKYEEAIRMNPRRNLSIGGTNS

Query:  EKINMESHDARVNKEGHNERKLGGVNKVYLRKNQSLEEKGV--VLDEEIARLQERA-EMFSKNNEIRDKENERVYAKIEELNIKWQEF-MENSKKVSEKI
         +I    HD R  K         G + +  R     E  G+  V +  I  + E++  +  K+  I+  +    Y    +  +    + +   K     +
Subjt:  EKINMESHDARVNKEGHNERKLGGVNKVYLRKNQSLEEKGV--VLDEEIARLQERA-EMFSKNNEIRDKENERVYAKIEELNIKWQEF-MENSKKVSEKI

Query:  QLELSSMSIRRRMNLSQNNPVSESL----ELSILPPLSTTVAVHVEGQEQVSGDS---EHDMEPLEHLDSATVEIQCQIAPSAIMDETPPATLQGILSPS
        +LE  +    R++N          L    EL      S   A     Q +   D    + ++EP E+L       + ++ P  +      + +   + P 
Subjt:  QLELSSMSIRRRMNLSQNNPVSESL----ELSILPPLSTTVAVHVEGQEQVSGDS---EHDMEPLEHLDSATVEIQCQIAPSAIMDETPPATLQGILSPS

Query:  FPDPILTNKPLVF----------YDSEQERTTSKIAKILVALNEARGEDPLEDDGNMHGDEFEDDEDNDDISQYEVRVRTPVHESQQVDEEPTTKEQEGT
            + ++K  +F          Y    +R    + +     +E  GE P E    +HGDEFED+EDNDDISQYEV+VRTPVHESQQVDEEP TKEQEGT
Subjt:  FPDPILTNKPLVF----------YDSEQERTTSKIAKILVALNEARGEDPLEDDGNMHGDEFEDDEDNDDISQYEVRVRTPVHESQQVDEEPTTKEQEGT

Query:  SGPVDVLSEAIEESSSFSSQGKTPSLSSLNVSDPNFVSTAETSDEEVSLTKVVKKTQKKKKVPEIGAISRPRTNAAVARLAAQKEAEAGPFKKAKRARVQ
        SGPVDV SEA+EESSS SSQ                                             GA+SRPRT  AVARLAAQKEAEAGP KKAK ARVQ
Subjt:  SGPVDVLSEAIEESSSFSSQGKTPSLSSLNVSDPNFVSTAETSDEEVSLTKVVKKTQKKKKVPEIGAISRPRTNAAVARLAAQKEAEAGPFKKAKRARVQ

Query:  RGAEEPLEKANEEEPNSTEQTPSKVKGVRLEVRRPTFTTRDILLERGFDEAQESVPEYVRKRLVQNGWEVLFAPTTRVSEALVKEFYTAINPNRGDIVRV
        R AEEPLE+ANEEEP+STEQTPS+VK VRLEVRRPTFTTRDILLERGFDEAQE VPEYVRKR+V+NGWE LFAP TRVSEALVKEFYTAINPNRGD VRV
Subjt:  RGAEEPLEKANEEEPNSTEQTPSKVKGVRLEVRRPTFTTRDILLERGFDEAQESVPEYVRKRLVQNGWEVLFAPTTRVSEALVKEFYTAINPNRGDIVRV

Query:  RAP------------------C--------------------INEQATVWMYVVKNRLIPTSHDSSIKRNRAMMVYILMKVIEFNFGELIRNEIRSWSEK
        R                    C                    INEQATVWMYVVKNRLIPTS+DSSIKRNRAM+VYIL+K +EFNFGELIRNEI+S SEK
Subjt:  RAP------------------C--------------------INEQATVWMYVVKNRLIPTSHDSSIKRNRAMMVYILMKVIEFNFGELIRNEIRSWSEK

Query:  M
        +
Subjt:  M

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]1.3e-15470.56Show/hide
Query:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMVCARTWLNALEPNSINTWAE
        MN N +DPP P NPPV+GD AGEGAANR GE+ NPILL DNRDV +RNYVTHAFHNLNS + +  P   +    P    +       NA + + ++  A 
Subjt:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMVCARTWLNALEPNSINTWAE

Query:  LTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDIN
                   L  NADLREDIVSFRQKENEAVQE WER KELLRRC SHGLP CVQIEQ YRGLDR SRMMLNTAAN SL EKS++EI+DILNKMTD N
Subjt:  LTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDIN

Query:  DQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTISTIPEPSPVLQISDISCVYCGDNHLYENCPANPVSIFYVGQGAQRNFNP
        DQ EIGRSLPKKQVSA +FELDT+ASMQAQMA +NQ LKQLTM KETKT  S + EPS  LQISDISCVYCGDN LYENCPANP S+FYVGQ AQRNFNP
Subjt:  DQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTISTIPEPSPVLQISDISCVYCGDNHLYENCPANPVSIFYVGQGAQRNFNP

Query:  YSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYKQNYTPPGFPTQSASQPQQYNQQRGQNTTQQSGSNASLEAMMKESMTRN-----------DAAIRS
        YSNTY+  WR+HPNFSW NQGVASSS Q PAQQYKQNYTPP FPTQ ASQPQQYNQQR QNTTQQ GSN SLEAM KE MTR+           D  IR 
Subjt:  YSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYKQNYTPPGFPTQSASQPQQYNQQRGQNTTQQSGSNASLEAMMKESMTRN-----------DAAIRS

Query:  LEMQEGQIANDQKFRPQGTLPGHTENPK
        LEMQ GQIAND+K RPQGTLPG+TENPK
Subjt:  LEMQEGQIANDQKFRPQGTLPGHTENPK

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]3.9e-14672.92Show/hide
Query:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMV-------------------
        MNRNA+DPPPPQNPPVNGDMAGE AANRVGEI N ILL DNRDV MRNYVTHAFHNLNSGINNPLPQAA F+LKPVMFQ++                   
Subjt:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMV-------------------

Query:  ------------------------------CARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHG
                                       ARTW+NALEPNSINTWAELT+KFLAKYHTLT+NADLREDIVSFRQKENEAVQEAWER KELLRRCPSHG
Subjt:  ------------------------------CARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHG

Query:  LPACVQIEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDINDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTI
        LP+CVQIEQ YRGLDRSS+MMLNT AN SLLEKSVNEIVD+LNKMTDINDQ E+GRSLPKKQVS G+FELDT+ASMQAQMAAMNQ LKQLTM KETKT  
Subjt:  LPACVQIEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDINDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTI

Query:  STIPEPSPVLQISDISCVYCGDNHLYENCPANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYK
        S IPE SP+LQISDISCVYC                   GQGAQRNFNPYSNTYN GWRHHPNFSW NQGVASSS QAPAQQYK
Subjt:  STIPEPSPVLQISDISCVYCGDNHLYENCPANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYK

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]3.9e-9877.31Show/hide
Query:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLL
        A TW+N LE N I TWAELT+KFLAKYHTLTRNADL+EDIVSFRQ+E+EAVQEAWER KELL+RC SHGLP CVQI+Q YRGLD   RMM +TAANCSLL
Subjt:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLL

Query:  EKSVNEIVDILNKMTDINDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTIST-IPEPSPVLQISDISCVYCGDNHLYENCP
        EKSVNEI+DILNKM DINDQ E+GRSLPKKQ SAG+FELDT+ S+QAQ++AM+Q LKQLTM K  K   S  I EPS +LQISDISCVYC DNHLYENC 
Subjt:  EKSVNEIVDILNKMTDINDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTIST-IPEPSPVLQISDISCVYCGDNHLYENCP

Query:  ANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRN
        ANP  IFYVGQG QRNFNPYSNTYN GWR HPNFS  N
Subjt:  ANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRN

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]3.4e-9441.7Show/hide
Query:  ISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMV-------------------------------------------------C
        + NPI + D RD  MR+Y      +LNS + N  P  A F+ KP+M QM+                                                  
Subjt:  ISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMV-------------------------------------------------C

Query:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLL
        A  WLNA   ++I TW+++ +KFL KY   TRNAD+RE+I+SFRQKENEAV  AWER K+L+  CP+ G+PACVQIE  +RG D  ++MMLN AAN    
Subjt:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLL

Query:  EKSVNEIVDILNKMTDINDQ--AEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTISTIPEPSPVLQISDISCVYCGDNHLYENC
         KS NEIV+IL+++++ N Q  +E  R+  K+   AG+  LD + SMQ Q+  + Q LK +        +      PSPV QI++ +C YCGD H  ENC
Subjt:  EKSVNEIVDILNKMTDINDQ--AEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTISTIPEPSPVLQISDISCVYCGDNHLYENC

Query:  PANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYKQNYTPPGFPTQSA--SQPQQYNQQRGQ-NTTQQSGSNASL-----
        P+NP S++YVGQ  Q+ FNPYSNTYN GW+ HPNFSW  QG  SS+     QQYK+ YTPPGFP   A    P QYNQQ+      QQ+ SN  +     
Subjt:  PANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYKQNYTPPGFPTQSA--SQPQQYNQQRGQ-NTTQQSGSNASL-----

Query:  ----EAMMKESMT-----------------RNDAAIRSLEMQEGQIANDQKFRPQGTLPGHTENPKQDREGKEHCKVVITRSGLSYEGPSLPDKGTDVVT
            +A MKE MT                 RND  +R LEMQ GQ+ N+ + RPQG+LP  TE P+  R GKEHC  + TRSGL YEGP +PD+      
Subjt:  ----EAMMKESMT-----------------RNDAAIRSLEMQEGQIANDQKFRPQGTLPGHTENPKQDREGKEHCKVVITRSGLSYEGPSLPDKGTDVVT

Query:  PVPASTSNPQQEEKAEPV
            S+ +P +E+  + V
Subjt:  PVPASTSNPQQEEKAEPV

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.6e-12935.9Show/hide
Query:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMVCARTWLNALEPNSINTWAE
        MNRNA+DPPPPQNPPVNGDMAGEGAANR GEI N ILL DNRDV MRNYVT AFHNLNSGINN LPQAA  +LKPVMF M+                   
Subjt:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMVCARTWLNALEPNSINTWAE

Query:  LTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQ-IEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDI
           + + ++  LT                    ++ +  LK  +    +  LP   +   +L  GLDRSSRMMLNTAAN SLLEKSVNEIVDILNKM DI
Subjt:  LTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQ-IEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDI

Query:  NDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTIS---TIP-----------------------------------------
        NDQ E GRSL KKQVSAG+FELDT+A MQAQMAAMNQ LKQ TM KETKT  S    IP                                         
Subjt:  NDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTIS---TIP-----------------------------------------

Query:  --------EPS-------------------PVLQISDISCVYC------------GDNHLYENCPANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFS
                 P                    P+   S ++ V C             DN L    P   V+ + V    +R  N  +  Y+          
Subjt:  --------EPS-------------------PVLQISDISCVYC------------GDNHLYENCPANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFS

Query:  WRNQGVA-----------SSSVQAPAQQYKQNYTPP----GFPTQS---ASQPQQYNQQRGQNTTQQSGSNASLEAMMKESMTRN---DAAIRSLE----
         R  G             +    AP  Q K  +T P     F   S    + P  +  QR           +++E  M +        D  + +LE    
Subjt:  WRNQGVA-----------SSSVQAPAQQYKQNYTPP----GFPTQS---ASQPQQYNQQRGQNTTQQSGSNASLEAMMKESMTRN---DAAIRSLE----

Query:  --MQEGQIANDQKFR---PQGTLPGH--------TENPKQDREGKEHCKVVI--TRSGLSYEG---PSLPDKGTDVVTPVPASTSNP--QQEEKAEP---
           +   + N +K      +GT+ GH         +  K D   K    + +   RS L + G     + D          A  S P  Q  E  +P   
Subjt:  --MQEGQIANDQKFR---PQGTLPGH--------TENPKQDREGKEHCKVVI--TRSGLSYEG---PSLPDKGTDVVTPVPASTSNP--QQEEKAEP---

Query:  ----VSSEEKGKKADKGKQVV-------PNTTPQDQDPTLVKIAPGKSQ--------RISKSLSSLSF---LALLEFIGGIFVVD---------------
            + + E  KKA     ++       P     D     V +  G+ +          SK+L+S      +   E +  +F  D               
Subjt:  ----VSSEEKGKKADKGKQVV-------PNTTPQDQDPTLVKIAPGKSQ--------RISKSLSSLSF---LALLEFIGGIFVVD---------------

Query:  -----------REANSRTMEGSSSSKPHDKE-KEKKRVL---------LPPPTKPDMIPLEPPRISHEKLVFDPREQSRKYEEAIRMNPRRNLSIGGTNS
                   +EA  R +      +  D E K++K            L    + D   ++   +    L          Y   +      N      + 
Subjt:  -----------REANSRTMEGSSSSKPHDKE-KEKKRVL---------LPPPTKPDMIPLEPPRISHEKLVFDPREQSRKYEEAIRMNPRRNLSIGGTNS

Query:  EKINMESHDARVNKEGHNERKLGGVNKVYLRKNQSLEEKGV--VLDEEIARLQERA-EMFSKNNEIRDKENERVYAKIEELNIKWQEF-MENSKKVSEKI
         +I    HD R  K         G + +  R     E  G+  V +  I  + E++  +  K+  I+  +    Y    +  +    + +   K     +
Subjt:  EKINMESHDARVNKEGHNERKLGGVNKVYLRKNQSLEEKGV--VLDEEIARLQERA-EMFSKNNEIRDKENERVYAKIEELNIKWQEF-MENSKKVSEKI

Query:  QLELSSMSIRRRMNLSQNNPVSESL----ELSILPPLSTTVAVHVEGQEQVSGDS---EHDMEPLEHLDSATVEIQCQIAPSAIMDETPPATLQGILSPS
        +LE  +    R++N          L    EL      S   A     Q +   D    + ++EP E+L       + ++ P  +      + +   + P 
Subjt:  QLELSSMSIRRRMNLSQNNPVSESL----ELSILPPLSTTVAVHVEGQEQVSGDS---EHDMEPLEHLDSATVEIQCQIAPSAIMDETPPATLQGILSPS

Query:  FPDPILTNKPLVF----------YDSEQERTTSKIAKILVALNEARGEDPLEDDGNMHGDEFEDDEDNDDISQYEVRVRTPVHESQQVDEEPTTKEQEGT
            + ++K  +F          Y    +R    + +     +E  GE P E    +HGDEFED+EDNDDISQYEV+VRTPVHESQQVDEEP TKEQEGT
Subjt:  FPDPILTNKPLVF----------YDSEQERTTSKIAKILVALNEARGEDPLEDDGNMHGDEFEDDEDNDDISQYEVRVRTPVHESQQVDEEPTTKEQEGT

Query:  SGPVDVLSEAIEESSSFSSQGKTPSLSSLNVSDPNFVSTAETSDEEVSLTKVVKKTQKKKKVPEIGAISRPRTNAAVARLAAQKEAEAGPFKKAKRARVQ
        SGPVDV SEA+EESSS SSQ                                             GA+SRPRT  AVARLAAQKEAEAGP KKAK ARVQ
Subjt:  SGPVDVLSEAIEESSSFSSQGKTPSLSSLNVSDPNFVSTAETSDEEVSLTKVVKKTQKKKKVPEIGAISRPRTNAAVARLAAQKEAEAGPFKKAKRARVQ

Query:  RGAEEPLEKANEEEPNSTEQTPSKVKGVRLEVRRPTFTTRDILLERGFDEAQESVPEYVRKRLVQNGWEVLFAPTTRVSEALVKEFYTAINPNRGDIVRV
        R AEEPLE+ANEEEP+STEQTPS+VK VRLEVRRPTFTTRDILLERGFDEAQE VPEYVRKR+V+NGWE LFAP TRVSEALVKEFYTAINPNRGD VRV
Subjt:  RGAEEPLEKANEEEPNSTEQTPSKVKGVRLEVRRPTFTTRDILLERGFDEAQESVPEYVRKRLVQNGWEVLFAPTTRVSEALVKEFYTAINPNRGDIVRV

Query:  RAP------------------C--------------------INEQATVWMYVVKNRLIPTSHDSSIKRNRAMMVYILMKVIEFNFGELIRNEIRSWSEK
        R                    C                    INEQATVWMYVVKNRLIPTS+DSSIKRNRAM+VYIL+K +EFNFGELIRNEI+S SEK
Subjt:  RAP------------------C--------------------INEQATVWMYVVKNRLIPTSHDSSIKRNRAMMVYILMKVIEFNFGELIRNEIRSWSEK

Query:  M
        +
Subjt:  M

A0A6J1DY39 uncharacterized protein LOC1110256531.7e-9441.7Show/hide
Query:  ISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMV-------------------------------------------------C
        + NPI + D RD  MR+Y      +LNS + N  P  A F+ KP+M QM+                                                  
Subjt:  ISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMV-------------------------------------------------C

Query:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLL
        A  WLNA   ++I TW+++ +KFL KY   TRNAD+RE+I+SFRQKENEAV  AWER K+L+  CP+ G+PACVQIE  +RG D  ++MMLN AAN    
Subjt:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLL

Query:  EKSVNEIVDILNKMTDINDQ--AEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTISTIPEPSPVLQISDISCVYCGDNHLYENC
         KS NEIV+IL+++++ N Q  +E  R+  K+   AG+  LD + SMQ Q+  + Q LK +        +      PSPV QI++ +C YCGD H  ENC
Subjt:  EKSVNEIVDILNKMTDINDQ--AEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTISTIPEPSPVLQISDISCVYCGDNHLYENC

Query:  PANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYKQNYTPPGFPTQSA--SQPQQYNQQRGQ-NTTQQSGSNASL-----
        P+NP S++YVGQ  Q+ FNPYSNTYN GW+ HPNFSW  QG  SS+     QQYK+ YTPPGFP   A    P QYNQQ+      QQ+ SN  +     
Subjt:  PANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYKQNYTPPGFPTQSA--SQPQQYNQQRGQ-NTTQQSGSNASL-----

Query:  ----EAMMKESMT-----------------RNDAAIRSLEMQEGQIANDQKFRPQGTLPGHTENPKQDREGKEHCKVVITRSGLSYEGPSLPDKGTDVVT
            +A MKE MT                 RND  +R LEMQ GQ+ N+ + RPQG+LP  TE P+  R GKEHC  + TRSGL YEGP +PD+      
Subjt:  ----EAMMKESMT-----------------RNDAAIRSLEMQEGQIANDQKFRPQGTLPGHTENPKQDREGKEHCKVVITRSGLSYEGPSLPDKGTDVVT

Query:  PVPASTSNPQQEEKAEPV
            S+ +P +E+  + V
Subjt:  PVPASTSNPQQEEKAEPV

A0A6J1DYY9 uncharacterized protein LOC1110255571.5e-9877.73Show/hide
Query:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLL
        A TWLN LE N I TWAELT+KFLAKYHTLTRNADL+EDIVSFRQ+E+EAVQEAWER KELL+RC SHGLP CVQI+Q YRGLD   RMM +TAANCSLL
Subjt:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLL

Query:  EKSVNEIVDILNKMTDINDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTIST-IPEPSPVLQISDISCVYCGDNHLYENCP
        EKSVNEI+DILNKM DINDQ E+GRSLPKKQ SAG+FELDT+ S+QAQ++AM+Q LKQLTM K  K   S  I EPS +LQISDISCVYC DNHLYENC 
Subjt:  EKSVNEIVDILNKMTDINDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTIST-IPEPSPVLQISDISCVYCGDNHLYENCP

Query:  ANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRN
        ANP  IFYVGQG QRNFNPYSNTYN GWR HPNFS  N
Subjt:  ANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRN

A0A6J1DZ19 uncharacterized protein LOC1110248246.4e-15570.56Show/hide
Query:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMVCARTWLNALEPNSINTWAE
        MN N +DPP P NPPV+GD AGEGAANR GE+ NPILL DNRDV +RNYVTHAFHNLNS + +  P   +    P    +       NA + + ++  A 
Subjt:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMVCARTWLNALEPNSINTWAE

Query:  LTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDIN
                   L  NADLREDIVSFRQKENEAVQE WER KELLRRC SHGLP CVQIEQ YRGLDR SRMMLNTAAN SL EKS++EI+DILNKMTD N
Subjt:  LTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDIN

Query:  DQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTISTIPEPSPVLQISDISCVYCGDNHLYENCPANPVSIFYVGQGAQRNFNP
        DQ EIGRSLPKKQVSA +FELDT+ASMQAQMA +NQ LKQLTM KETKT  S + EPS  LQISDISCVYCGDN LYENCPANP S+FYVGQ AQRNFNP
Subjt:  DQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTISTIPEPSPVLQISDISCVYCGDNHLYENCPANPVSIFYVGQGAQRNFNP

Query:  YSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYKQNYTPPGFPTQSASQPQQYNQQRGQNTTQQSGSNASLEAMMKESMTRN-----------DAAIRS
        YSNTY+  WR+HPNFSW NQGVASSS Q PAQQYKQNYTPP FPTQ ASQPQQYNQQR QNTTQQ GSN SLEAM KE MTR+           D  IR 
Subjt:  YSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYKQNYTPPGFPTQSASQPQQYNQQRGQNTTQQSGSNASLEAMMKESMTRN-----------DAAIRS

Query:  LEMQEGQIANDQKFRPQGTLPGHTENPK
        LEMQ GQIAND+K RPQGTLPG+TENPK
Subjt:  LEMQEGQIANDQKFRPQGTLPGHTENPK

A0A6J1E251 uncharacterized protein LOC1110253021.9e-14672.92Show/hide
Query:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMV-------------------
        MNRNA+DPPPPQNPPVNGDMAGE AANRVGEI N ILL DNRDV MRNYVTHAFHNLNSGINNPLPQAA F+LKPVMFQ++                   
Subjt:  MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMV-------------------

Query:  ------------------------------CARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHG
                                       ARTW+NALEPNSINTWAELT+KFLAKYHTLT+NADLREDIVSFRQKENEAVQEAWER KELLRRCPSHG
Subjt:  ------------------------------CARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHG

Query:  LPACVQIEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDINDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTI
        LP+CVQIEQ YRGLDRSS+MMLNT AN SLLEKSVNEIVD+LNKMTDINDQ E+GRSLPKKQVS G+FELDT+ASMQAQMAAMNQ LKQLTM KETKT  
Subjt:  LPACVQIEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDINDQAEIGRSLPKKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTI

Query:  STIPEPSPVLQISDISCVYCGDNHLYENCPANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYK
        S IPE SP+LQISDISCVYC                   GQGAQRNFNPYSNTYN GWRHHPNFSW NQGVASSS QAPAQQYK
Subjt:  STIPEPSPVLQISDISCVYCGDNHLYENCPANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNFSWRNQGVASSSVQAPAQQYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGAAATGCAAAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGGTGAAGGAGCAGCAAACCGAGTAGGAGAAATTTCTAATCCG
ATCCTTCTAATAGATAATCGAGATGTAACCATGCGGAATTATGTCACTCATGCATTCCACAACTTAAATTCAGGGATAAATAATCCTTTACCCCAAGCCGCATCG
TTCAAGCTCAAGCCAGTCATGTTCCAGATGGTTTGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAATTCTATCAACACATGGGCGGAACTGACGGAGAAATTT
TTGGCAAAGTACCATACTTTGACTAGGAACGCAGACCTTCGAGAGGACATTGTGTCTTTTAGACAGAAGGAGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTG
AAGGAGTTACTTAGAAGATGCCCGAGCCATGGATTGCCCGCATGTGTGCAGATTGAACAACTCTATAGAGGATTGGATCGTTCATCAAGGATGATGTTGAACACC
GCAGCCAATTGCTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGCTGAAATAGGAAGGTCATTACCA
AAGAAGCAAGTATCAGCTGGACTCTTTGAGTTAGACACAATAGCTTCAATGCAAGCCCAAATGGCAGCTATGAACCAAACGTTAAAGCAGTTGACAATGGTAAAG
GAAACCAAAACCACAATTTCGACGATACCTGAACCCTCTCCTGTTTTACAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCACTTGTATGAGAACTGT
CCAGCTAATCCAGTGTCTATTTTCTATGTAGGTCAAGGTGCCCAACGGAATTTCAACCCATATTCAAACACTTACAACCATGGATGGAGGCACCATCCAAACTTT
TCCTGGAGAAACCAAGGAGTAGCTAGTAGCAGTGTACAAGCACCCGCTCAACAATACAAGCAAAACTACACTCCTCCTGGTTTTCCAACTCAATCGGCGTCGCAG
CCTCAACAATACAATCAGCAAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGCAACGCAAGTTTGGAGGCCATGATGAAAGAGTCCATGACAAGAAATGATGCT
GCGATAAGAAGCTTGGAGATGCAAGAGGGGCAGATTGCAAATGACCAGAAATTTAGACCCCAAGGTACATTGCCTGGACACACAGAAAACCCGAAGCAAGATCGT
GAGGGAAAGGAGCATTGTAAGGTGGTTATCACGAGAAGCGGATTAAGTTATGAAGGACCCTCACTTCCAGACAAAGGAACTGATGTAGTTACACCTGTTCCTGCA
TCCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACCTGTAAGTTCAGAAGAAAAAGGTAAGAAGGCGGATAAAGGTAAGCAAGTAGTGCCCAACACTACTCCA
CAGGATCAAGATCCGACCCTCGTCAAAATTGCTCCAGGCAAATCTCAAAGAATCTCAAAGTCTCTTTCATCTCTCTCTTTTTTGGCACTTTTAGAGTTTATTGGA
GGGATTTTCGTTGTTGATCGTGAAGCAAACTCAAGAACCATGGAAGGTTCATCTTCCTCCAAGCCGCATGACAAAGAGAAAGAAAAGAAGAGAGTATTGTTGCCT
CCACCAACCAAACCGGATATGATTCCTCTTGAACCTCCTAGGATTTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGCAGAAAATATGAGGAAGCTATA
AGAATGAACCCTAGGAGAAATCTATCCATAGGTGGTACAAATTCTGAAAAAATCAATATGGAATCTCATGATGCTAGGGTTAATAAAGAAGGTCATAATGAAAGG
AAATTAGGAGGTGTTAATAAAGTCTATCTTCGAAAAAATCAATCTTTAGAGGAAAAAGGTGTTGTTTTAGATGAAGAAATAGCTAGACTTCAAGAGAGGGCGGAG
ATGTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAATGAGAGGGTTTATGCGAAAATTGAGGAATTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAG
AAAGTGAGTGAGAAGATTCAACTTGAGTTAAGTAGCATGAGTATACGTCGTAGGATGAATCTTTCTCAAAATAACCCCGTTTCCGAGTCTTTAGAACTATCTATC
CTTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGGTCAAGAACAGGTTAGTGGAGACTCAGAACACGACATGGAGCCTTTGGAGCATTTAGATTCGGCC
ACGGTCGAAATTCAATGCCAAATTGCGCCTAGCGCAATTATGGATGAGACTCCACCGGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTG
ACTAACAAGCCTCTAGTTTTTTATGATTCAGAACAGGAAAGGACAACGTCGAAAATTGCCAAAATTTTGGTGGCCTTGAATGAAGCAAGGGGAGAGGATCCATTG
GAGGATGATGGAAACATGCATGGAGATGAGTTTGAGGACGACGAAGACAATGACGATATCTCTCAATATGAAGTGAGAGTACGAACTCCGGTGCATGAATCTCAG
CAAGTTGATGAGGAGCCCACTACAAAAGAGCAAGAAGGAACATCTGGTCCTGTGGATGTCCTTAGTGAGGCCATAGAGGAATCATCTTCCTTTTCTTCACAAGGT
AAGACCCCTTCTTTGTCGAGTTTGAATGTTTCTGACCCAAACTTTGTTTCTACTGCAGAGACTTCAGATGAGGAGGTGAGTTTGACCAAAGTGGTAAAGAAAACA
CAAAAGAAGAAAAAAGTGCCAGAAATTGGCGCAATTTCTAGGCCTAGGACCAACGCCGCTGTAGCACGTTTAGCTGCCCAAAAAGAAGCCGAGGCTGGTCCATTT
AAAAAAGCCAAGAGGGCTAGGGTGCAAAGAGGGGCAGAAGAGCCACTTGAGAAGGCCAATGAAGAGGAGCCGAATTCTACCGAACAAACACCATCAAAAGTAAAA
GGGGTGAGATTGGAGGTGAGGAGGCCTACCTTCACAACACGTGATATCCTCCTTGAGAGAGGTTTTGATGAAGCCCAAGAGTCGGTGCCGGAATATGTTAGGAAA
AGGCTTGTGCAGAATGGTTGGGAGGTGTTGTTTGCCCCAACTACACGTGTATCAGAGGCCTTGGTTAAAGAGTTTTACACTGCCATCAACCCAAACCGAGGGGAT
ATAGTGAGAGTACGGGCCCCTTGCATTAATGAGCAAGCGACAGTCTGGATGTATGTGGTGAAGAATCGGTTGATCCCCACTTCTCACGATTCCTCCATTAAGCGC
AATAGGGCGATGATGGTGTACATTCTCATGAAGGTCATTGAGTTCAACTTTGGGGAGCTCATAAGGAACGAGATACGGAGTTGGTCCGAGAAAATGGTATGTAAC
GACCCAGAAATTTCCGCACACCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAGAAATGCAAAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGGTGAAGGAGCAGCAAACCGAGTAGGAGAAATTTCTAATCCG
ATCCTTCTAATAGATAATCGAGATGTAACCATGCGGAATTATGTCACTCATGCATTCCACAACTTAAATTCAGGGATAAATAATCCTTTACCCCAAGCCGCATCG
TTCAAGCTCAAGCCAGTCATGTTCCAGATGGTTTGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAATTCTATCAACACATGGGCGGAACTGACGGAGAAATTT
TTGGCAAAGTACCATACTTTGACTAGGAACGCAGACCTTCGAGAGGACATTGTGTCTTTTAGACAGAAGGAGAACGAAGCAGTTCAAGAAGCTTGGGAGCGTTTG
AAGGAGTTACTTAGAAGATGCCCGAGCCATGGATTGCCCGCATGTGTGCAGATTGAACAACTCTATAGAGGATTGGATCGTTCATCAAGGATGATGTTGAACACC
GCAGCCAATTGCTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGCTGAAATAGGAAGGTCATTACCA
AAGAAGCAAGTATCAGCTGGACTCTTTGAGTTAGACACAATAGCTTCAATGCAAGCCCAAATGGCAGCTATGAACCAAACGTTAAAGCAGTTGACAATGGTAAAG
GAAACCAAAACCACAATTTCGACGATACCTGAACCCTCTCCTGTTTTACAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCACTTGTATGAGAACTGT
CCAGCTAATCCAGTGTCTATTTTCTATGTAGGTCAAGGTGCCCAACGGAATTTCAACCCATATTCAAACACTTACAACCATGGATGGAGGCACCATCCAAACTTT
TCCTGGAGAAACCAAGGAGTAGCTAGTAGCAGTGTACAAGCACCCGCTCAACAATACAAGCAAAACTACACTCCTCCTGGTTTTCCAACTCAATCGGCGTCGCAG
CCTCAACAATACAATCAGCAAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGCAACGCAAGTTTGGAGGCCATGATGAAAGAGTCCATGACAAGAAATGATGCT
GCGATAAGAAGCTTGGAGATGCAAGAGGGGCAGATTGCAAATGACCAGAAATTTAGACCCCAAGGTACATTGCCTGGACACACAGAAAACCCGAAGCAAGATCGT
GAGGGAAAGGAGCATTGTAAGGTGGTTATCACGAGAAGCGGATTAAGTTATGAAGGACCCTCACTTCCAGACAAAGGAACTGATGTAGTTACACCTGTTCCTGCA
TCCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACCTGTAAGTTCAGAAGAAAAAGGTAAGAAGGCGGATAAAGGTAAGCAAGTAGTGCCCAACACTACTCCA
CAGGATCAAGATCCGACCCTCGTCAAAATTGCTCCAGGCAAATCTCAAAGAATCTCAAAGTCTCTTTCATCTCTCTCTTTTTTGGCACTTTTAGAGTTTATTGGA
GGGATTTTCGTTGTTGATCGTGAAGCAAACTCAAGAACCATGGAAGGTTCATCTTCCTCCAAGCCGCATGACAAAGAGAAAGAAAAGAAGAGAGTATTGTTGCCT
CCACCAACCAAACCGGATATGATTCCTCTTGAACCTCCTAGGATTTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGCAGAAAATATGAGGAAGCTATA
AGAATGAACCCTAGGAGAAATCTATCCATAGGTGGTACAAATTCTGAAAAAATCAATATGGAATCTCATGATGCTAGGGTTAATAAAGAAGGTCATAATGAAAGG
AAATTAGGAGGTGTTAATAAAGTCTATCTTCGAAAAAATCAATCTTTAGAGGAAAAAGGTGTTGTTTTAGATGAAGAAATAGCTAGACTTCAAGAGAGGGCGGAG
ATGTTCAGTAAAAATAACGAAATTAGGGACAAAGAGAATGAGAGGGTTTATGCGAAAATTGAGGAATTAAACATAAAATGGCAAGAATTCATGGAAAACTCAAAG
AAAGTGAGTGAGAAGATTCAACTTGAGTTAAGTAGCATGAGTATACGTCGTAGGATGAATCTTTCTCAAAATAACCCCGTTTCCGAGTCTTTAGAACTATCTATC
CTTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGGTCAAGAACAGGTTAGTGGAGACTCAGAACACGACATGGAGCCTTTGGAGCATTTAGATTCGGCC
ACGGTCGAAATTCAATGCCAAATTGCGCCTAGCGCAATTATGGATGAGACTCCACCGGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTG
ACTAACAAGCCTCTAGTTTTTTATGATTCAGAACAGGAAAGGACAACGTCGAAAATTGCCAAAATTTTGGTGGCCTTGAATGAAGCAAGGGGAGAGGATCCATTG
GAGGATGATGGAAACATGCATGGAGATGAGTTTGAGGACGACGAAGACAATGACGATATCTCTCAATATGAAGTGAGAGTACGAACTCCGGTGCATGAATCTCAG
CAAGTTGATGAGGAGCCCACTACAAAAGAGCAAGAAGGAACATCTGGTCCTGTGGATGTCCTTAGTGAGGCCATAGAGGAATCATCTTCCTTTTCTTCACAAGGT
AAGACCCCTTCTTTGTCGAGTTTGAATGTTTCTGACCCAAACTTTGTTTCTACTGCAGAGACTTCAGATGAGGAGGTGAGTTTGACCAAAGTGGTAAAGAAAACA
CAAAAGAAGAAAAAAGTGCCAGAAATTGGCGCAATTTCTAGGCCTAGGACCAACGCCGCTGTAGCACGTTTAGCTGCCCAAAAAGAAGCCGAGGCTGGTCCATTT
AAAAAAGCCAAGAGGGCTAGGGTGCAAAGAGGGGCAGAAGAGCCACTTGAGAAGGCCAATGAAGAGGAGCCGAATTCTACCGAACAAACACCATCAAAAGTAAAA
GGGGTGAGATTGGAGGTGAGGAGGCCTACCTTCACAACACGTGATATCCTCCTTGAGAGAGGTTTTGATGAAGCCCAAGAGTCGGTGCCGGAATATGTTAGGAAA
AGGCTTGTGCAGAATGGTTGGGAGGTGTTGTTTGCCCCAACTACACGTGTATCAGAGGCCTTGGTTAAAGAGTTTTACACTGCCATCAACCCAAACCGAGGGGAT
ATAGTGAGAGTACGGGCCCCTTGCATTAATGAGCAAGCGACAGTCTGGATGTATGTGGTGAAGAATCGGTTGATCCCCACTTCTCACGATTCCTCCATTAAGCGC
AATAGGGCGATGATGGTGTACATTCTCATGAAGGTCATTGAGTTCAACTTTGGGGAGCTCATAAGGAACGAGATACGGAGTTGGTCCGAGAAAATGGTATGTAAC
GACCCAGAAATTTCCGCACACCTCTAG
Protein sequenceShow/hide protein sequence
MNRNAKDPPPPQNPPVNGDMAGEGAANRVGEISNPILLIDNRDVTMRNYVTHAFHNLNSGINNPLPQAASFKLKPVMFQMVCARTWLNALEPNSINTWAELTEKF
LAKYHTLTRNADLREDIVSFRQKENEAVQEAWERLKELLRRCPSHGLPACVQIEQLYRGLDRSSRMMLNTAANCSLLEKSVNEIVDILNKMTDINDQAEIGRSLP
KKQVSAGLFELDTIASMQAQMAAMNQTLKQLTMVKETKTTISTIPEPSPVLQISDISCVYCGDNHLYENCPANPVSIFYVGQGAQRNFNPYSNTYNHGWRHHPNF
SWRNQGVASSSVQAPAQQYKQNYTPPGFPTQSASQPQQYNQQRGQNTTQQSGSNASLEAMMKESMTRNDAAIRSLEMQEGQIANDQKFRPQGTLPGHTENPKQDR
EGKEHCKVVITRSGLSYEGPSLPDKGTDVVTPVPASTSNPQQEEKAEPVSSEEKGKKADKGKQVVPNTTPQDQDPTLVKIAPGKSQRISKSLSSLSFLALLEFIG
GIFVVDREANSRTMEGSSSSKPHDKEKEKKRVLLPPPTKPDMIPLEPPRISHEKLVFDPREQSRKYEEAIRMNPRRNLSIGGTNSEKINMESHDARVNKEGHNER
KLGGVNKVYLRKNQSLEEKGVVLDEEIARLQERAEMFSKNNEIRDKENERVYAKIEELNIKWQEFMENSKKVSEKIQLELSSMSIRRRMNLSQNNPVSESLELSI
LPPLSTTVAVHVEGQEQVSGDSEHDMEPLEHLDSATVEIQCQIAPSAIMDETPPATLQGILSPSFPDPILTNKPLVFYDSEQERTTSKIAKILVALNEARGEDPL
EDDGNMHGDEFEDDEDNDDISQYEVRVRTPVHESQQVDEEPTTKEQEGTSGPVDVLSEAIEESSSFSSQGKTPSLSSLNVSDPNFVSTAETSDEEVSLTKVVKKT
QKKKKVPEIGAISRPRTNAAVARLAAQKEAEAGPFKKAKRARVQRGAEEPLEKANEEEPNSTEQTPSKVKGVRLEVRRPTFTTRDILLERGFDEAQESVPEYVRK
RLVQNGWEVLFAPTTRVSEALVKEFYTAINPNRGDIVRVRAPCINEQATVWMYVVKNRLIPTSHDSSIKRNRAMMVYILMKVIEFNFGELIRNEIRSWSEKMVCN
DPEISAHL