; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007217 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007217
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr10:2681816..2685415
RNA-Seq ExpressionHG10007217
SyntenyHG10007217
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589981.1 hypothetical protein SDJN03_15404, partial [Cucurbita argyrosperma subsp. sororia]3.3e-24481.82Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQAR DPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE +LGFINLSSCKDSD+NGESLDFNL TPFVDLISIW + NEGLDQFLPLVSSEV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFLMKLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTEDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPSNENVN+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK
        EWLLKAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDESVNAALV+ A TMSTT++GS KK++ +  
Subjt:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK

Query:  RKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        +K K IKF KYDLVPNSD T+LRSAV++NDTDSEGEVHNPHSDEDSD K+
Subjt:  RKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

KAG7023645.1 hypothetical protein SDJN02_14671, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-24681.92Show/hide
Query:  VESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSC
        VESM+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQAR DPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSC
Subjt:  VESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSC

Query:  SAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVS
        SAPC SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE +LGFINLSSCKDSD+NGESLDFNL TPFVDLISIW + NEGLDQFLPLVS
Subjt:  SAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVS

Query:  SEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTE
        SEVRGEF SGVCD+RRLAGVVIAETFLMKLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTE
Subjt:  SEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTE

Query:  DEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPK
        DEALLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPSNEN N+P GSSPK
Subjt:  DEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPK

Query:  IFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQG
        IFLEWLLKAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDESVNAALV+ A TMSTT++GSGKK++ 
Subjt:  IFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQG

Query:  KAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        +  +K K IKF+KYDLVPNSD T+LRSAV++NDTDSEGEVHNPHSDEDSD K+
Subjt:  KAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

XP_022960841.1 uncharacterized protein LOC111461526 [Cucurbita moschata]2.8e-24381.64Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQAR DPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE +LGFINLSSCKDSD+NGESLDFNL TPFVDLISIW + NEGLDQFLPLVSSEV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFLMKLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTEDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPSNEN N+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK
        EWLLKAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDESVNAALV+ A TMSTT++GSGKK++ +  
Subjt:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK

Query:  RKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        +K K IKF KYDLV NSD T+LRSAV++NDTDSEGEVHNPHSDEDSD K+
Subjt:  RKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

XP_023515389.1 uncharacterized protein LOC111779560 [Cucurbita pepo subsp. pepo]3.1e-24281.24Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQARADPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE ILGFINLSSCKDSD+NGESLDFNL TPFVDLISIWT+ NEGLDQFLPLVS+EV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFLMKLCLD NS RSR DLE DLRIWAVGSITR+KNFYFF               E L RFLLEATLPV SLLS EDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLP  HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPS+ENVN+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAKR
        EWLLKAE+ GVRVFD+TISN RAKLVLD SKSVS     EG  VDD+LLFYIDKQGENENGSEED  MDESVNAALV+ A TMSTT++GSGKK+  +  +
Subjt:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAKR

Query:  KNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        K K IKF KYDLVPNSDAT+L SAVD+NDTDS+GEVHNPHSDEDSD K+
Subjt:  KNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

XP_038880003.1 uncharacterized protein LOC120071696 [Benincasa hispida]2.1e-25986.28Show/hide
Query:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ
        MALALVESMDSMNPLK+NPFLGENYEFTLAQSIQNV+ EIRKGN  FSQFTEGFYELIQARADPPLESIWFYSALTFRSR  NI GDFLERVAAMK+LFQ
Subjt:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ

Query:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKN-GESLDFNLTTPFVDLISIWTHPNEGLDQ
        LV SCSAPC SSKTI LLSPVVSEVYKLIVDMLGKDL SKR KKAMREVKSLVEAILGFINLSSCKDSDKN  ESLDFNL TPFVDLIS+WTHPNEGLDQ
Subjt:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKN-GESLDFNLTTPFVDLISIWTHPNEGLDQ

Query:  FLPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVT
        FLPLVSSEVRGEF SGVCDVRRLAGVVIAETFLMKLCLDFN+G SRQDLEKDLRIW VGSITR++NFYFF               E LVRFLLEATLPVT
Subjt:  FLPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVT

Query:  SLLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYP
        SLLSTEDEALLRKVLYD+LILV+YSFLK EKAI+LPA+HV+ LAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSS VSSQIIRWVKSQ+PSNENV  P
Subjt:  SLLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYP

Query:  NGSSPKIFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQ
        NGSSPKI LEWLL+AE+QGVRVFD TISN  AKLVLDTSKSVSLEGDKVDDDLLFYIDKQGE+ENGS EDT MDESVNAALV+VARTMSTTE+GSGKKRQ
Subjt:  NGSSPKIFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQ

Query:  GKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
           KRKN+ IKFVKYDLVP+SD TQ RS  DNNDTDSEG+VHNPHSD+DSD+K+
Subjt:  GKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

TrEMBL top hitse value%identityAlignment
A0A0A0M1W0 Uncharacterized protein2.3e-23572.87Show/hide
Query:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ
        MAL LVESM+S+NPLK+NPFLGENYEFTLAQSIQNVL EIRKGN  FSQFT+ FY+LIQARADPPLESIWFYSAL FRS SFN  GDFLERVAAMK+LFQ
Subjt:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ

Query:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKTI LLSPVVSEVYKL++DM GKDL S R KKAMREVKSLVEAILGF+NLSS +DSDKN +SLDF+L TPF+DLISIWT PNEGLDQF
Subjt:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF

Query:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS
        LPLV SEVR EF SG CDVRRLAGVVIAE FLMKLCLDFN GRSRQDLEKDL  WAVGSIT+++NFY F               E LVR LLEATLPVTS
Subjt:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS

Query:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN
        LLST++EALLRKVLYDALILVDYSFLK E AINLPA+HV+FLAVKRLILT+EAIEFYREHGDQ+RAISYLNAFSSSLVSSQIIRW+KSQ+PSNEN+N PN
Subjt:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN

Query:  GSSPKIFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKK---
        G SPK+FLEWLLKAE+QGVRVFDNTISN R+KLVLDTSKSVS EGDKVDDDLLFYIDKQG N NGSEEDT MDESVNAAL + A TMSTTE+ S KK   
Subjt:  GSSPKIFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKK---

Query:  ----------------------------RQG----------------------------------------KAKRKNKNIKFVKYDLVPNSDATQLRSAV
                                    +QG                                        KAKRKNK  K VKYDLVPN+DATQL+SAV
Subjt:  ----------------------------RQG----------------------------------------KAKRKNKNIKFVKYDLVPNSDATQLRSAV

Query:  DNNDTDSEGEVHNPHSDEDSDMK
        +NNDT SEGEVHNPHSD+DSDMK
Subjt:  DNNDTDSEGEVHNPHSDEDSDMK

A0A5D3CFU4 Pentatricopeptide repeat-containing protein1.1e-22979.33Show/hide
Query:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ
        MAL LVESM+S+NPLK+N FLGENYEFTLAQSIQNVL EIRKGN  FS+FTEGFY+LIQARADPPLESIWFYSALTFRS SFN  GDFLERVAAMK+LFQ
Subjt:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ

Query:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKTI LLSPVVSEVYKL++DM GKDL SKR KKAMREVKSLVEAILG  NLSSC+DS+KN +SLDFN  TPFVDLISIWTHPNEGLDQF
Subjt:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF

Query:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS
        LPLV SEVR EF SG CDVRRLAGVVIAETFL+KLCLDFN G SRQ LE+DLR W VGSITR++NFYFF               E LVR LLEATLPVTS
Subjt:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS

Query:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN
        LLST+DEALLRKVL DALILVDYSFLK EKAINLPA+H +FLAVKRLILT+EA EFYR+HGDQ+RAISYLNAFSSSLVSSQIIRWVKSQ+PSNEN+N+ N
Subjt:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN

Query:  GSSPKIFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQG
        GSSPK+FLEWLLKAE+QGVRVFDNTISN RAK+VLDTSKSV  EGDKVDDDLLFYIDKQGENENG EED  MD+SVNAALV+VA TMSTTE+ S KKR  
Subjt:  GSSPKIFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQG

Query:  KAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSE
        KAK++NK           N+D +QL+SAV+NNDT+ +
Subjt:  KAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSE

A0A6J1D1X1 uncharacterized protein LOC1110165271.5e-23178.14Show/hide
Query:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ
        MALALVESMDSMNP  +NPFLGENYE TL QSI+NVL EIR+GN  F  FTE FY+L+QAR DPP+ESIWFYSAL FRS S +  GDFL+R+AAMK+LFQ
Subjt:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ

Query:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKT+A L+PVV EVYKLI DMLGKDL SKR KKAMREVK+LVEAILGFINLSSCK SD+N E LDFNL TPF+DLISIWTHPNEGLDQF
Subjt:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF

Query:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS
        LPLVSSEVRG F SGVCDVR LAGVVIAE FLMKLCLDF+SGRSRQ+LEKDLR+WAVGSIT ++N Y F               E L+RFLL  TLPV S
Subjt:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS

Query:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN
        LLSTEDE LLRKVLYDALILVDYSFL   KAI+L A+HV+FLAVKRLILTH+AIEF+REHGDQSRAISYLNAFSSS V SQ+IRWV+SQIPSNENVN PN
Subjt:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN

Query:  GSSPKIFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSG-
        GSSPKI LEWL KAE+QGVRVFDNTIS+ RAKLVLD SKS S    LEG+KVDD LLFY+DKQGE EN SEED AMDESVNAALV VARTMS  E+GSG 
Subjt:  GSSPKIFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSG-

Query:  KKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        KKRQ K++RKNK IKFVKYDL PN DA QLRSAVDNND +SEGEVHNPH DEDSDM++
Subjt:  KKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

A0A6J1H8Q6 uncharacterized protein LOC1114615261.3e-24381.64Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQAR DPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE +LGFINLSSCKDSD+NGESLDFNL TPFVDLISIW + NEGLDQFLPLVSSEV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFLMKLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTEDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPSNEN N+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK
        EWLLKAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDESVNAALV+ A TMSTT++GSGKK++ +  
Subjt:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK

Query:  RKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        +K K IKF KYDLV NSD T+LRSAV++NDTDSEGEVHNPHSDEDSD K+
Subjt:  RKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

A0A6J1JEZ4 uncharacterized protein LOC1114851551.5e-24281.27Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQARADPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALL+PVV EVYKLI DMLGKDL SKR KKAMREVKSLVE ILGFINLSSCKDSD+NGESLDFNL TPFVDLISIWT+ NEGLDQFLPLVSSEV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFL+KLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTEDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPS+ENVN+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK
        EWL KAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDE+VNAALV+ A TMSTT++G  KK++ +  
Subjt:  EWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK

Query:  RKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        +K K IKF KYDLVPNSDAT+LRSAVD+NDTDS+ EVHNPH DEDSDMK+
Subjt:  RKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G11780.1 unknown protein5.9e-4228.34Show/hide
Query:  LAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARAD-PPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAPCCSSKTIALLSPVVSEVYK
        L  SI+ +L++ R G  +FS F   F  ++    + PPLE +WFYSA+ F S       D    V      FQL+ S S      K ++LLSPVV ++ +
Subjt:  LAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARAD-PPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAPCCSSKTIALLSPVVSEVYK

Query:  LIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWT--------HPNEGLDQFLPLVSSEVRGEFGSGVCDV
        L++        S+R     R+  SL+E I+ +I++    +     + +       F DL  +W            + L+ F+P  S  +R E  S  C V
Subjt:  LIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWT--------HPNEGLDQFLPLVSSEVRGEFGSGVCDV

Query:  RRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLST--EDEALLRKVLYDA
          LAG+V ++ FL+ LC  F+    R +L+KDL+   +  I+   + +FF               + +++ LLE  L +TSL+    EDEA L +++ +A
Subjt:  RRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLST--EDEALLRKVLYDA

Query:  LI-LVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFLEWLLKAEN
        +I  V+  FL      +  + H+  +A+  L L  + +   R + DQ +   Y N FS+SL+   +I WV SQ     + +     +P  F+EWL+  E 
Subjt:  LI-LVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFLEWLLKAEN

Query:  QGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK
        QG RVF+   S   AK V+  S+         D  +   + KQ E E   + D A +++V++  +    T    E    K+ + K K
Subjt:  QGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACCCTTTAAAGAGGAATCCTTTTCTCGGAGAAAATTATGAGTTTACTCTTGCGCAATCAATCCAGAATGTTTT
AGTTGAAATTCGCAAAGGAAATTTTAGTTTTTCTCAATTTACGGAAGGATTCTACGAGTTGATTCAAGCTAGAGCTGACCCACCATTGGAATCGATATGGTTCTACTCCG
CATTAACGTTTCGTAGCCGTAGCTTCAATATTAATGGCGACTTTTTGGAACGAGTGGCAGCCATGAAAATCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTTGT
TCTTCGAAGACCATTGCGTTGCTCTCTCCAGTGGTTTCCGAGGTGTATAAATTGATTGTCGACATGCTTGGAAAGGATTTGGGCTCGAAAAGGGGAAAGAAAGCAATGAG
AGAGGTTAAATCTTTAGTTGAAGCGATTCTTGGCTTTATAAATCTTAGTTCATGTAAGGATTCGGACAAGAATGGTGAATCTCTTGACTTCAATTTGACTACTCCTTTTG
TGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTTAGCAGTGAGGTTCGTGGGGAGTTTGGTTCAGGCGTCTGTGATGTTCGT
CGCTTGGCTGGAGTTGTAATCGCCGAGACATTTCTGATGAAACTGTGCTTGGATTTTAACAGTGGGCGTTCGAGGCAAGATTTGGAGAAAGATTTAAGGATATGGGCTGT
TGGTTCAATAACTCGGATGAAGAACTTCTACTTTTTTGAATTCTGTAACGTGCTCACATCTTGGTCTATCGTTGTATCAACAGAAGCTCTTGTAAGATTCCTGCTGGAGG
CGACTTTACCTGTGACATCTCTGTTGAGTACTGAAGATGAAGCTTTGTTAAGGAAGGTTCTATATGATGCTCTTATACTGGTTGATTATTCATTTTTGAAATCTGAGAAA
GCCATTAACTTACCTGCCAAACATGTGTCGTTTCTCGCTGTTAAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAAAGCAGAGCCAT
CTCTTATCTAAATGCCTTCTCAAGTTCTCTGGTTTCTTCTCAAATTATTAGATGGGTCAAAAGCCAAATTCCTAGCAATGAAAATGTAAATTATCCCAACGGGTCGTCGC
CTAAAATATTTCTTGAGTGGCTTCTCAAGGCTGAAAATCAAGGTGTAAGAGTATTCGACAATACCATTTCCAATCTTCGAGCCAAATTAGTTCTTGATACTTCCAAATCA
GTGTCACTGGAAGGAGATAAAGTAGATGATGATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGACACAGCAATGGATGAATCAGTAAA
TGCAGCTCTTGTAGCTGTGGCTCGTACAATGTCAACGACTGAGAGTGGTTCAGGAAAGAAACGACAGGGAAAGGCAAAAAGAAAGAATAAAAATATCAAGTTTGTTAAGT
ACGATCTCGTTCCGAACTCTGATGCTACCCAATTGAGGTCAGCAGTTGATAATAATGATACAGACAGCGAGGGCGAAGTTCATAATCCACACTCCGACGAAGATTCTGAC
ATGAAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACCCTTTAAAGAGGAATCCTTTTCTCGGAGAAAATTATGAGTTTACTCTTGCGCAATCAATCCAGAATGTTTT
AGTTGAAATTCGCAAAGGAAATTTTAGTTTTTCTCAATTTACGGAAGGATTCTACGAGTTGATTCAAGCTAGAGCTGACCCACCATTGGAATCGATATGGTTCTACTCCG
CATTAACGTTTCGTAGCCGTAGCTTCAATATTAATGGCGACTTTTTGGAACGAGTGGCAGCCATGAAAATCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTTGT
TCTTCGAAGACCATTGCGTTGCTCTCTCCAGTGGTTTCCGAGGTGTATAAATTGATTGTCGACATGCTTGGAAAGGATTTGGGCTCGAAAAGGGGAAAGAAAGCAATGAG
AGAGGTTAAATCTTTAGTTGAAGCGATTCTTGGCTTTATAAATCTTAGTTCATGTAAGGATTCGGACAAGAATGGTGAATCTCTTGACTTCAATTTGACTACTCCTTTTG
TGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTTAGCAGTGAGGTTCGTGGGGAGTTTGGTTCAGGCGTCTGTGATGTTCGT
CGCTTGGCTGGAGTTGTAATCGCCGAGACATTTCTGATGAAACTGTGCTTGGATTTTAACAGTGGGCGTTCGAGGCAAGATTTGGAGAAAGATTTAAGGATATGGGCTGT
TGGTTCAATAACTCGGATGAAGAACTTCTACTTTTTTGAATTCTGTAACGTGCTCACATCTTGGTCTATCGTTGTATCAACAGAAGCTCTTGTAAGATTCCTGCTGGAGG
CGACTTTACCTGTGACATCTCTGTTGAGTACTGAAGATGAAGCTTTGTTAAGGAAGGTTCTATATGATGCTCTTATACTGGTTGATTATTCATTTTTGAAATCTGAGAAA
GCCATTAACTTACCTGCCAAACATGTGTCGTTTCTCGCTGTTAAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAAAGCAGAGCCAT
CTCTTATCTAAATGCCTTCTCAAGTTCTCTGGTTTCTTCTCAAATTATTAGATGGGTCAAAAGCCAAATTCCTAGCAATGAAAATGTAAATTATCCCAACGGGTCGTCGC
CTAAAATATTTCTTGAGTGGCTTCTCAAGGCTGAAAATCAAGGTGTAAGAGTATTCGACAATACCATTTCCAATCTTCGAGCCAAATTAGTTCTTGATACTTCCAAATCA
GTGTCACTGGAAGGAGATAAAGTAGATGATGATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGACACAGCAATGGATGAATCAGTAAA
TGCAGCTCTTGTAGCTGTGGCTCGTACAATGTCAACGACTGAGAGTGGTTCAGGAAAGAAACGACAGGGAAAGGCAAAAAGAAAGAATAAAAATATCAAGTTTGTTAAGT
ACGATCTCGTTCCGAACTCTGATGCTACCCAATTGAGGTCAGCAGTTGATAATAATGATACAGACAGCGAGGGCGAAGTTCATAATCCACACTCCGACGAAGATTCTGAC
ATGAAAGATTAA
Protein sequenceShow/hide protein sequence
MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAPCC
SSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEVRGEFGSGVCDVR
RLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEALLRKVLYDALILVDYSFLKSEK
AINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFLEWLLKAENQGVRVFDNTISNLRAKLVLDTSKS
VSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSD
MKD