; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G002600 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G002600
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr06:2939860..2944743
RNA-Seq ExpressionLsi06G002600
SyntenyLsi06G002600
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589981.1 hypothetical protein SDJN03_15404, partial [Cucurbita argyrosperma subsp. sororia]4.6e-24179.51Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQAR DPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE +LGFINLSSCKDSD+NGESLDFNL TPFVDLISIW + NEGLDQFLPLVSSEV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFLMKLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTEDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPSNENVN+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTM
                        EWLLKAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDESVNAALV+ A TM
Subjt:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTM

Query:  STTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        STT++GS KK++ +  +K K IKF KYDLVPNSD T+LRSAV++NDTDSEGEVHNPHSDEDSD K+
Subjt:  STTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

KAG7023645.1 hypothetical protein SDJN02_14671, partial [Cucurbita argyrosperma subsp. argyrosperma]6.4e-24379.61Show/hide
Query:  VESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSC
        VESM+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQAR DPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSC
Subjt:  VESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSC

Query:  SAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVS
        SAPC SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE +LGFINLSSCKDSD+NGESLDFNL TPFVDLISIW + NEGLDQFLPLVS
Subjt:  SAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVS

Query:  SEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTE
        SEVRGEF SGVCD+RRLAGVVIAETFLMKLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTE
Subjt:  SEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTE

Query:  DEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPK
        DEALLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPSNEN N+P GSSPK
Subjt:  DEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPK

Query:  IFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVA
        IFL                EWLLKAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDESVNAALV+ A
Subjt:  IFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVA

Query:  RTMSTTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
         TMSTT++GSGKK++ +  +K K IKF+KYDLVPNSD T+LRSAV++NDTDSEGEVHNPHSDEDSD K+
Subjt:  RTMSTTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

XP_022960841.1 uncharacterized protein LOC111461526 [Cucurbita moschata]3.9e-24079.33Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQAR DPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE +LGFINLSSCKDSD+NGESLDFNL TPFVDLISIW + NEGLDQFLPLVSSEV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFLMKLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTEDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPSNEN N+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTM
                        EWLLKAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDESVNAALV+ A TM
Subjt:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTM

Query:  STTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        STT++GSGKK++ +  +K K IKF KYDLV NSD T+LRSAV++NDTDSEGEVHNPHSDEDSD K+
Subjt:  STTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

XP_023515389.1 uncharacterized protein LOC111779560 [Cucurbita pepo subsp. pepo]4.3e-23978.94Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQARADPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE ILGFINLSSCKDSD+NGESLDFNL TPFVDLISIWT+ NEGLDQFLPLVS+EV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFLMKLCLD NS RSR DLE DLRIWAVGSITR+KNFYFF               E L RFLLEATLPV SLLS EDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLP  HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPS+ENVN+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMS
                        EWLLKAE+ GVRVFD+TISN RAKLVLD SKSVS     EG  VDD+LLFYIDKQGENENGSEED  MDESVNAALV+ A TMS
Subjt:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMS

Query:  TTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        TT++GSGKK+  +  +K K IKF KYDLVPNSDAT+L SAVD+NDTDS+GEVHNPHSDEDSD K+
Subjt:  TTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

XP_038880003.1 uncharacterized protein LOC120071696 [Benincasa hispida]3.8e-25683.86Show/hide
Query:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ
        MALALVESMDSMNPLK+NPFLGENYEFTLAQSIQNV+ EIRKGN  FSQFTEGFYELIQARADPPLESIWFYSALTFRSR  NI GDFLERVAAMK+LFQ
Subjt:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ

Query:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKN-GESLDFNLTTPFVDLISIWTHPNEGLDQ
        LV SCSAPC SSKTI LLSPVVSEVYKLIVDMLGKDL SKR KKAMREVKSLVEAILGFINLSSCKDSDKN  ESLDFNL TPFVDLIS+WTHPNEGLDQ
Subjt:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKN-GESLDFNLTTPFVDLISIWTHPNEGLDQ

Query:  FLPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVT
        FLPLVSSEVRGEF SGVCDVRRLAGVVIAETFLMKLCLDFN+G SRQDLEKDLRIW VGSITR++NFYFF               E LVRFLLEATLPVT
Subjt:  FLPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVT

Query:  SLLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYP
        SLLSTEDEALLRKVLYD+LILV+YSFLK EKAI+LPA+HV+ LAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSS VSSQIIRWVKSQ+PSNENV  P
Subjt:  SLLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYP

Query:  NGSSPKIFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAV
        NGSSPKI L                EWLL+AE+QGVRVFD TISN  AKLVLDTSKSVSLEGDKVDDDLLFYIDKQGE+ENGS EDT MDESVNAALV+V
Subjt:  NGSSPKIFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAV

Query:  ARTMSTTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        ARTMSTTE+GSGKKRQ   KRKN+ IKFVKYDLVP+SD TQ RS  DNNDTDSEG+VHNPHSD+DSD+K+
Subjt:  ARTMSTTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

TrEMBL top hitse value%identityAlignment
A0A0A0M1W0 Uncharacterized protein4.2e-23271.05Show/hide
Query:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ
        MAL LVESM+S+NPLK+NPFLGENYEFTLAQSIQNVL EIRKGN  FSQFT+ FY+LIQARADPPLESIWFYSAL FRS SFN  GDFLERVAAMK+LFQ
Subjt:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ

Query:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKTI LLSPVVSEVYKL++DM GKDL S R KKAMREVKSLVEAILGF+NLSS +DSDKN +SLDF+L TPF+DLISIWT PNEGLDQF
Subjt:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF

Query:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS
        LPLV SEVR EF SG CDVRRLAGVVIAE FLMKLCLDFN GRSRQDLEKDL  WAVGSIT+++NFY F               E LVR LLEATLPVTS
Subjt:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS

Query:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN
        LLST++EALLRKVLYDALILVDYSFLK E AINLPA+HV+FLAVKRLILT+EAIEFYREHGDQ+RAISYLNAFSSSLVSSQIIRW+KSQ+PSNEN+N PN
Subjt:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN

Query:  GSSPKIFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVA
        G SPK+FL                EWLLKAE+QGVRVFDNTISN R+KLVLDTSKSVS EGDKVDDDLLFYIDKQG N NGSEEDT MDESVNAAL + A
Subjt:  GSSPKIFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVA

Query:  RTMSTTESGSGKK-------------------------------RQG----------------------------------------KAKRKNKNIKFVK
         TMSTTE+ S KK                               +QG                                        KAKRKNK  K VK
Subjt:  RTMSTTESGSGKK-------------------------------RQG----------------------------------------KAKRKNKNIKFVK

Query:  YDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMK
        YDLVPN+DATQL+SAV+NNDT SEGEVHNPHSD+DSDMK
Subjt:  YDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMK

A0A5D3CFU4 Pentatricopeptide repeat-containing protein2.0e-22677.03Show/hide
Query:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ
        MAL LVESM+S+NPLK+N FLGENYEFTLAQSIQNVL EIRKGN  FS+FTEGFY+LIQARADPPLESIWFYSALTFRS SFN  GDFLERVAAMK+LFQ
Subjt:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ

Query:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKTI LLSPVVSEVYKL++DM GKDL SKR KKAMREVKSLVEAILG  NLSSC+DS+KN +SLDFN  TPFVDLISIWTHPNEGLDQF
Subjt:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF

Query:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS
        LPLV SEVR EF SG CDVRRLAGVVIAETFL+KLCLDFN G SRQ LE+DLR W VGSITR++NFYFF               E LVR LLEATLPVTS
Subjt:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS

Query:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN
        LLST+DEALLRKVL DALILVDYSFLK EKAINLPA+H +FLAVKRLILT+EA EFYR+HGDQ+RAISYLNAFSSSLVSSQIIRWVKSQ+PSNEN+N+ N
Subjt:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN

Query:  GSSPKIFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVA
        GSSPK+FL                EWLLKAE+QGVRVFDNTISN RAK+VLDTSKSV  EGDKVDDDLLFYIDKQGENENG EED  MD+SVNAALV+VA
Subjt:  GSSPKIFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVA

Query:  RTMSTTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSE
         TMSTTE+ S KKR  KAK++NK           N+D +QL+SAV+NNDT+ +
Subjt:  RTMSTTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSE

A0A6J1D1X1 uncharacterized protein LOC1110165272.8e-22875.96Show/hide
Query:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ
        MALALVESMDSMNP  +NPFLGENYE TL QSI+NVL EIR+GN  F  FTE FY+L+QAR DPP+ESIWFYSAL FRS S +  GDFL+R+AAMK+LFQ
Subjt:  MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQ

Query:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF
        LVCSCSAPC SSKT+A L+PVV EVYKLI DMLGKDL SKR KKAMREVK+LVEAILGFINLSSCK SD+N E LDFNL TPF+DLISIWTHPNEGLDQF
Subjt:  LVCSCSAPCCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQF

Query:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS
        LPLVSSEVRG F SGVCDVR LAGVVIAE FLMKLCLDF+SGRSRQ+LEKDLR+WAVGSIT ++N Y F               E L+RFLL  TLPV S
Subjt:  LPLVSSEVRGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTS

Query:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN
        LLSTEDE LLRKVLYDALILVDYSFL   KAI+L A+HV+FLAVKRLILTH+AIEF+REHGDQSRAISYLNAFSSS V SQ+IRWV+SQIPSNENVN PN
Subjt:  LLSTEDEALLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPN

Query:  GSSPKIFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAAL
        GSSPKI L                EWL KAE+QGVRVFDNTIS+ RAKLVLD SKS S    LEG+KVDD LLFY+DKQGE EN SEED AMDESVNAAL
Subjt:  GSSPKIFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAAL

Query:  VAVARTMSTTESGSG-KKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        V VARTMS  E+GSG KKRQ K++RKNK IKFVKYDL PN DA QLRSAVDNND +SEGEVHNPH DEDSDM++
Subjt:  VAVARTMSTTESGSG-KKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

A0A6J1H8Q6 uncharacterized protein LOC1114615261.9e-24079.33Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQAR DPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALLSPVV EVYKLI DMLGKDL SKR KKAMREVKSLVE +LGFINLSSCKDSD+NGESLDFNL TPFVDLISIW + NEGLDQFLPLVSSEV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFLMKLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTEDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPSNEN N+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTM
                        EWLLKAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDESVNAALV+ A TM
Subjt:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTM

Query:  STTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        STT++GSGKK++ +  +K K IKF KYDLV NSD T+LRSAV++NDTDSEGEVHNPHSDEDSD K+
Subjt:  STTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

A0A6J1JEZ4 uncharacterized protein LOC1114851552.1e-23978.98Show/hide
Query:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP
        M+SMN  K++PFLGENYEFTL QSIQNVL EIR+GN  FSQF EGFYELIQARADPPLESIWFYSALTFRSR   +NGDFL+RVA MKILFQ  CSCSAP
Subjt:  MDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAP

Query:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV
        C SSKTIALL+PVV EVYKLI DMLGKDL SKR KKAMREVKSLVE ILGFINLSSCKDSD+NGESLDFNL TPFVDLISIWT+ NEGLDQFLPLVSSEV
Subjt:  CCSSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEV

Query:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA
        RGEF SGVCD+RRLAGVVIAETFL+KLCLD NSGRSRQDLE DLRIWAVGSITR+KNFYFF               E LVRFLLEATLPV SLLSTEDEA
Subjt:  RGEFGSGVCDVRRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEA

Query:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL
        LLRK+LYDALILVDYSFL  EKAINLPA HV+FLAVKRLILTHEAIEFYREHGDQ+RAISYLNAFS+SLVSSQIIRWVKSQIPS+ENVN+P GSSPKIFL
Subjt:  LLRKVLYDALILVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFL

Query:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTM
                        EWL KAE+ GVRVFD+TISN RAKLVLDTSKSVS     EG+ VDD+LLFYIDKQGENENGS EED  MDE+VNAALV+ A TM
Subjt:  GTLLVIIFTLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVS----LEGDKVDDDLLFYIDKQGENENGS-EEDTAMDESVNAALVAVARTM

Query:  STTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD
        STT++G  KK++ +  +K K IKF KYDLVPNSDAT+LRSAVD+NDTDS+ EVHNPH DEDSDMK+
Subjt:  STTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDTDSEGEVHNPHSDEDSDMKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G11780.1 unknown protein1.1e-3827.44Show/hide
Query:  LAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARAD-PPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAPCCSSKTIALLSPVVSEVYK
        L  SI+ +L++ R G  +FS F   F  ++    + PPLE +WFYSA+ F S       D    V      FQL+ S S      K ++LLSPVV ++ +
Subjt:  LAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARAD-PPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAPCCSSKTIALLSPVVSEVYK

Query:  LIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWT--------HPNEGLDQFLPLVSSEVRGEFGSGVCDV
        L++        S+R     R+  SL+E I+ +I++    +     + +       F DL  +W            + L+ F+P  S  +R E  S  C V
Subjt:  LIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWT--------HPNEGLDQFLPLVSSEVRGEFGSGVCDV

Query:  RRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLST--EDEALLRKVLYDA
          LAG+V ++ FL+ LC  F+    R +L+KDL+   +  I+   + +FF               + +++ LLE  L +TSL+    EDEA L +++ +A
Subjt:  RRLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLST--EDEALLRKVLYDA

Query:  LI-LVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFLGTLLVIIF
        +I  V+  FL      +  + H+  +A+  L L  + +   R + DQ +   Y N FS+SL+   +I WV SQ     + +     +P  F+        
Subjt:  LI-LVDYSFLKSEKAINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFLGTLLVIIF

Query:  TLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQG
                EWL+  E QG RVF+   S   AK V+  S+         D  +   + KQ E E   + D A +++V++  +    T    E    K+ + 
Subjt:  TLHSIHIDEWLLKAENQGVRVFDNTISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQG

Query:  KAK
        K K
Subjt:  KAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACCCTTTAAAGAGGAATCCTTTTCTCGGAGAAAATTATGAGTTTACTCTTGCGCAATCAATCCAGAATGTTTT
AGTTGAAATTCGCAAAGGAAATTTTAGTTTTTCTCAATTTACGGAAGGATTCTACGAGTTGATTCAAGCTAGAGCTGACCCACCATTGGAATCGATATGGTTCTACTCCG
CATTAACGTTTCGTAGCCGTAGCTTCAATATTAATGGCGACTTTTTGGAACGAGTGGCAGCCATGAAAATCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTTGT
TCTTCGAAGACCATTGCGTTGCTCTCTCCAGTGGTTTCCGAGGTGTATAAATTGATTGTCGACATGCTTGGAAAGGATTTGGGCTCGAAAAGGGGAAAGAAAGCAATGAG
AGAGGTTAAATCTTTAGTTGAAGCGATTCTTGGCTTTATAAATCTTAGTTCATGTAAGGATTCGGACAAGAATGGTGAATCTCTTGACTTCAATTTGACTACTCCTTTTG
TGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTTAGCAGTGAGGTTCGTGGGGAGTTTGGTTCAGGCGTCTGTGATGTTCGT
CGCTTGGCTGGAGTTGTAATCGCCGAGACATTTCTGATGAAACTGTGCTTGGATTTTAACAGTGGGCGTTCGAGGCAAGATTTGGAGAAAGATTTAAGGATATGGGCTGT
TGGTTCAATAACTCGGATGAAGAACTTCTACTTTTTTGAATTCTGTAACGTGCTCACATCTTGGTCTATCGTTGTATCAACAGAAGCTCTTGTAAGATTCCTGCTGGAGG
CGACTTTACCTGTGACATCTCTGTTGAGTACTGAAGATGAAGCTTTGTTAAGGAAGGTTCTATATGATGCTCTTATACTGGTTGATTATTCATTTTTGAAATCTGAGAAA
GCCATTAACTTACCTGCCAAACATGTGTCGTTTCTCGCTGTTAAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAAAGCAGAGCCAT
CTCTTATCTAAATGCCTTCTCAAGTTCTCTGGTTTCTTCTCAAATTATTAGATGGGTCAAAAGCCAAATTCCTAGCAATGAAAATGTAAATTATCCTAACGGGTCGTCGC
CTAAAATATTTCTTGGTACGTTGTTGGTAATAATTTTTACATTACACAGTATACACATTGACGAGTGGCTTCTCAAGGCTGAAAATCAAGGTGTAAGAGTATTCGACAAT
ACCATTTCCAATCTTCGAGCCAAATTAGTTCTTGATACTTCCAAATCAGTGTCACTGGAAGGAGATAAAGTAGATGATGATCTTTTGTTTTACATTGACAAGCAAGGGGA
AAATGAAAATGGAAGTGAGGAGGACACAGCAATGGATGAATCAGTAAATGCAGCTCTTGTAGCTGTGGCTCGTACAATGTCAACGACTGAGAGTGGTTCAGGAAAGAAAC
GACAGGGAAAGGCAAAAAGAAAGAATAAAAATATCAAGTTTGTTAAGTACGATCTCGTTCCGAACTCTGATGCTACCCAATTGAGGTCAGCAGTTGATAATAATGATACA
GACAGCGAGGGCGAAGTTCATAATCCACACTCCGACGAAGATTCTGACATGAAAGATTAA
mRNA sequenceShow/hide mRNA sequence
GTCTTTCAATTCTTTCTCCATTGTAATCGATTCAAATTTCATAAGTTTCAACTAGTTTCATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACCCTTTAAAGAGG
AATCCTTTTCTCGGAGAAAATTATGAGTTTACTCTTGCGCAATCAATCCAGAATGTTTTAGTTGAAATTCGCAAAGGAAATTTTAGTTTTTCTCAATTTACGGAAGGATT
CTACGAGTTGATTCAAGCTAGAGCTGACCCACCATTGGAATCGATATGGTTCTACTCCGCATTAACGTTTCGTAGCCGTAGCTTCAATATTAATGGCGACTTTTTGGAAC
GAGTGGCAGCCATGAAAATCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTTGTTCTTCGAAGACCATTGCGTTGCTCTCTCCAGTGGTTTCCGAGGTGTATAAA
TTGATTGTCGACATGCTTGGAAAGGATTTGGGCTCGAAAAGGGGAAAGAAAGCAATGAGAGAGGTTAAATCTTTAGTTGAAGCGATTCTTGGCTTTATAAATCTTAGTTC
ATGTAAGGATTCGGACAAGAATGGTGAATCTCTTGACTTCAATTTGACTACTCCTTTTGTGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCT
TACCGCTCGTTAGCAGTGAGGTTCGTGGGGAGTTTGGTTCAGGCGTCTGTGATGTTCGTCGCTTGGCTGGAGTTGTAATCGCCGAGACATTTCTGATGAAACTGTGCTTG
GATTTTAACAGTGGGCGTTCGAGGCAAGATTTGGAGAAAGATTTAAGGATATGGGCTGTTGGTTCAATAACTCGGATGAAGAACTTCTACTTTTTTGAATTCTGTAACGT
GCTCACATCTTGGTCTATCGTTGTATCAACAGAAGCTCTTGTAAGATTCCTGCTGGAGGCGACTTTACCTGTGACATCTCTGTTGAGTACTGAAGATGAAGCTTTGTTAA
GGAAGGTTCTATATGATGCTCTTATACTGGTTGATTATTCATTTTTGAAATCTGAGAAAGCCATTAACTTACCTGCCAAACATGTGTCGTTTCTCGCTGTTAAGAGATTG
ATTCTTACTCATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAAAGCAGAGCCATCTCTTATCTAAATGCCTTCTCAAGTTCTCTGGTTTCTTCTCAAATTATTAG
ATGGGTCAAAAGCCAAATTCCTAGCAATGAAAATGTAAATTATCCTAACGGGTCGTCGCCTAAAATATTTCTTGGTACGTTGTTGGTAATAATTTTTACATTACACAGTA
TACACATTGACGAGTGGCTTCTCAAGGCTGAAAATCAAGGTGTAAGAGTATTCGACAATACCATTTCCAATCTTCGAGCCAAATTAGTTCTTGATACTTCCAAATCAGTG
TCACTGGAAGGAGATAAAGTAGATGATGATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGACACAGCAATGGATGAATCAGTAAATGC
AGCTCTTGTAGCTGTGGCTCGTACAATGTCAACGACTGAGAGTGGTTCAGGAAAGAAACGACAGGGAAAGGCAAAAAGAAAGAATAAAAATATCAAGTTTGTTAAGTACG
ATCTCGTTCCGAACTCTGATGCTACCCAATTGAGGTCAGCAGTTGATAATAATGATACAGACAGCGAGGGCGAAGTTCATAATCCACACTCCGACGAAGATTCTGACATG
AAAGATTAATGTGATTTCAACCAGAAAGGTCAGTATTTGGTTGCCACAAAACAAGTTTGTTTTCTCCCTGTTCTTTTGTTGAAGATCTTCCTGGGGTAGCTGGCCGAGTT
ACAAGGTTGATGGTCGATACTCGATACTCGATACCGACGCTGAGAATGGTGGATTTTCAACTCCAGGAGTGCATTTCAACCAAGGATTGTAAATGTAAAGGGATTCATGA
TTTCATCAAGGAAAGGAAGAAAATGGAAAGACATTTTTGGTCATTATAATGCTGCTGGGTTTCATGAAGATGAGTCAAATGTTTAACTGCACTTAATATACTGTATTTTA
TTTCTTTTTAATTTCTCCTGTAAAAGGTTCTTCAAATTTAAAAGACTACATTACAAAAAGTCTTTCGTCCTTTCATCTGACAAAGTCTAAACTTGTAACAGGATGGCTAT
GGTCCATATCCATAGTGATAATTGAGACTCAAATATTTCATTTTCATTTAAAATTTTAGTGAGTTTTTACTCGATCGA
Protein sequenceShow/hide protein sequence
MALALVESMDSMNPLKRNPFLGENYEFTLAQSIQNVLVEIRKGNFSFSQFTEGFYELIQARADPPLESIWFYSALTFRSRSFNINGDFLERVAAMKILFQLVCSCSAPCC
SSKTIALLSPVVSEVYKLIVDMLGKDLGSKRGKKAMREVKSLVEAILGFINLSSCKDSDKNGESLDFNLTTPFVDLISIWTHPNEGLDQFLPLVSSEVRGEFGSGVCDVR
RLAGVVIAETFLMKLCLDFNSGRSRQDLEKDLRIWAVGSITRMKNFYFFEFCNVLTSWSIVVSTEALVRFLLEATLPVTSLLSTEDEALLRKVLYDALILVDYSFLKSEK
AINLPAKHVSFLAVKRLILTHEAIEFYREHGDQSRAISYLNAFSSSLVSSQIIRWVKSQIPSNENVNYPNGSSPKIFLGTLLVIIFTLHSIHIDEWLLKAENQGVRVFDN
TISNLRAKLVLDTSKSVSLEGDKVDDDLLFYIDKQGENENGSEEDTAMDESVNAALVAVARTMSTTESGSGKKRQGKAKRKNKNIKFVKYDLVPNSDATQLRSAVDNNDT
DSEGEVHNPHSDEDSDMKD