; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G080780 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G080780
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionNADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 2
Genome locationchrH04:20255547..20276918
RNA-Seq ExpressionChy4G080780
SyntenyChy4G080780
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR007741 - Ribosomal protein/NADH dehydrogenase domain
IPR010765 - Protein of unknown function DUF1350
IPR029058 - Alpha/Beta hydrolase fold
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5726608.1 hypothetical protein HS088_TW22G00288 [Tripterygium wilfordii]3.48e-28466.08Show/hide
Query:  MALNPNLFPNGMPVPFTNELFVLARDGVEFEIDKIPGANSD--RVKAKGTIYLSNVRMVFVSNKPDPVFTAFDMPLLYVRDEKFNQPIFFCNNISGLVEP
        MALNP LFPNGMPVPF NE+FV+ARDGVEF++DKIPGA+    +VKAKGTIYLSN+RMVFV+ KP   F AFDMPLL+V  EKFNQPIF CNNISG V+P
Subjt:  MALNPNLFPNGMPVPFTNELFVLARDGVEFEIDKIPGANSD--RVKAKGTIYLSNVRMVFVSNKPDPVFTAFDMPLLYVRDEKFNQPIFFCNNISGLVEP

Query:  VVPEDQHRALYSTHSFKILFKEGGCGTFVPLFFNLLSSVRQYNQ--HMNAGPRVDPLQAAQTPVDEMMRHAYVDPNDPTKIFLQQPTTESQLRR---RTY
        VVPE+++RALYSTHSFKILFKEGGCGTFVPLFFNLLSSVR +NQ  + N+ PRVDPLQAAQTPVDEMMRHA+     P  I   Q    S++       +
Subjt:  VVPEDQHRALYSTHSFKILFKEGGCGTFVPLFFNLLSSVRQYNQ--HMNAGPRVDPLQAAQTPVDEMMRHAYVDPNDPTKIFLQQPTTESQLRR---RTY

Query:  QSQPAENAMWAFIAGLMKFMFSEKNINNSKTLALTGRQSRHQFIRRTRIKCNYQDSGNEQPP------PSTSTALQLYSDIERLLTETVRQSQEAWGGLK
        +S+P                   K+++  + L++  R  R     R  + C+Y D     PP      PS  + +QLY  IERL+TETVR+SQ AWGG  
Subjt:  QSQPAENAMWAFIAGLMKFMFSEKNINNSKTLALTGRQSRHQFIRRTRIKCNYQDSGNEQPP------PSTSTALQLYSDIERLLTETVRQSQEAWGGLK

Query:  DWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSV
        DW+EVEGAWVLKP+++ P  VVHF+GG+FVGAAPQLTYRLFLERL++KGI +IATPYASGFD+F IADEVQFKFDRCHR   ++V+DLP FG+GHSLGSV
Subjt:  DWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSV

Query:  IHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDF
        IHLL+ SRYAV+R+GN+LM+FNNKEAS A+PLFSPV+VPMAQS+GPLLSQIA SPT RLG                  +LP+VEQL PLYMDLV+GREDF
Subjt:  IHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDF

Query:  TPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVA
         PKPEETRR++KSYYGISRNLLIKF+DDTIDET +LAQ+LSSESAISS+LDMS R LPG+HGLPLQQ LPDIPPAMAD VNRGSE  +NLT GTPWETVA
Subjt:  TPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVA

Query:  REVGNTLGVDSKILQAEASKDLNLLVEV
        +EVGNT+G DS+IL+AE+SKD+++LV+V
Subjt:  REVGNTLGVDSKILQAEASKDLNLLVEV

KAG6581645.1 hypothetical protein SDJN03_21647, partial [Cucurbita argyrosperma subsp. sororia]4.78e-25490.84Show/hide
Query:  KTLALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQ
        + + L GRQSRHQFIR +RIKC+++DSG+EQPP STSTALQLYSDIERLLTETVRQSQ+AWGGL+DWTEVEGAWVLKPRN+TPKYVVHFVGGIFVGAAPQ
Subjt:  KTLALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQ

Query:  LTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSP
        LTYRLFLERLSEKGIFIIATPYASGFDYF IADEVQFKFDRCHRAFL+SV+DLPIFG+GHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSP
Subjt:  LTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSP

Query:  VLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLI
        V VPMAQSMGPLLSQIASSPTFRLGAEMTMKQL+NLSPPI+KQ LPLVEQLPPLYMDLVRGREDFTPKPEETRR+VKSYYGISRNLLIKFKDD IDETL+
Subjt:  VLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLI

Query:  LAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
        LAQLLSSE+AISSMLDMSTR+LPGNHGLPLQQ LPDIPPAMADAVNRGSELF+NL  GTPWETVA+EVGNTLGVDSK+L+AEASKDL+LLVEV
Subjt:  LAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV

XP_004141151.1 uncharacterized protein LOC101208391 [Cucumis sativus]3.44e-27499.74Show/hide
Query:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
        +ALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
Subjt:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT

Query:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
        YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
Subjt:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL

Query:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
        VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
Subjt:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA

Query:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
        QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
Subjt:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV

XP_008465170.1 PREDICTED: uncharacterized protein LOC103502836 isoform X1 [Cucumis melo]1.75e-26997.44Show/hide
Query:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
        +AL GRQSRHQFI+R+RI CNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
Subjt:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT

Query:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
        YRLFLERLSEKG+FIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLP FG+GHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
Subjt:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL

Query:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
        VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGR+DFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
Subjt:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA

Query:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
        QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVA+EVGNTLGVDSKILQAEASKDLNLLVEV
Subjt:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV

XP_038904905.1 uncharacterized protein LOC120091123 [Benincasa hispida]9.95e-26494.88Show/hide
Query:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
        +AL GRQSRHQFIR +RIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRN+TPKYVVHFVGGIFVGAAPQL 
Subjt:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT

Query:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
        YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSV+DLPIFG+GHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
Subjt:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL

Query:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
        VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPI+KQVLPLVEQLPPLYMDLVRGREDF PKPEETRR++KSYYGISRNLL+KFKDDTIDET ILA
Subjt:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA

Query:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
        QLLSSESAISSMLDMSTR+LPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVA+EVGNTLGVDSK+L+AE SKDL+LLVEV
Subjt:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV

TrEMBL top hitse value%identityAlignment
A0A0A0LGA1 Uncharacterized protein1.2e-21899.74Show/hide
Query:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
        +ALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
Subjt:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT

Query:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
        YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
Subjt:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL

Query:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
        VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
Subjt:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA

Query:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
        QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
Subjt:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV

A0A1S3CN97 uncharacterized protein LOC103502836 isoform X14.7e-21597.44Show/hide
Query:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
        +AL GRQSRHQFI+R+RI CNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
Subjt:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT

Query:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
        YRLFLERLSEKG+FIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLP FG+GHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
Subjt:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL

Query:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
        VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGR+DFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
Subjt:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA

Query:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
        QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVA+EVGNTLGVDSKILQAEASKDLNLLVEV
Subjt:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV

A0A5A7T7X0 DUF1350 domain-containing protein4.7e-21597.44Show/hide
Query:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
        +AL GRQSRHQFI+R+RI CNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT
Subjt:  LALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLT

Query:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
        YRLFLERLSEKG+FIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLP FG+GHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL
Subjt:  YRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVL

Query:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
        VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGR+DFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA
Subjt:  VPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILA

Query:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
        QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVA+EVGNTLGVDSKILQAEASKDLNLLVEV
Subjt:  QLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV

A0A6J1J727 uncharacterized protein LOC1114818511.6e-20290.33Show/hide
Query:  KTLALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQ
        + + L GRQSRHQFIR +RIKC+Y+DSG+EQPP STSTALQLYSDIERLLTETVRQSQ+AWGGL DWTEVEGAWVLKPRN+TPKYVVHFVGGIFVGAAPQ
Subjt:  KTLALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQ

Query:  LTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSP
        LTYRLFLERLSEKGIFIIATPYASGFDYF IADEVQFKFDRCHRAFL+SV+DLPIFG+GHSLGSVIHLLIGSRYAVERSGNVLMAFNNK+ASSAVPLFSP
Subjt:  LTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSP

Query:  VLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLI
        V VP+AQ MGPLLSQIASSPTFRLGAEMTMKQL+NLSPPI+KQ LPLVEQLPPLYMDLVRGREDFTPKPEETRR+VKSYYGISRNLL+KFKDD IDETL+
Subjt:  VLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLI

Query:  LAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV
        LAQLLSSE+AISSMLDMSTR+LPGNHGLPLQQ LPDIPPAMADAVNRGSELF+NLT GTPWETVA+EVGNTLGVDSK+L+AEASKDL+LLVEV
Subjt:  LAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEV

A0A7J7BXQ7 Uncharacterized protein1.5e-22966.08Show/hide
Query:  MALNPNLFPNGMPVPFTNELFVLARDGVEFEIDKIPGA--NSDRVKAKGTIYLSNVRMVFVSNKPDPVFTAFDMPLLYVRDEKFNQPIFFCNNISGLVEP
        MALNP LFPNGMPVPF NE+FV+ARDGVEF++DKIPGA  +  +VKAKGTIYLSN+RMVFV+ KP   F AFDMPLL+V  EKFNQPIF CNNISG V+P
Subjt:  MALNPNLFPNGMPVPFTNELFVLARDGVEFEIDKIPGA--NSDRVKAKGTIYLSNVRMVFVSNKPDPVFTAFDMPLLYVRDEKFNQPIFFCNNISGLVEP

Query:  VVPEDQHRALYSTHSFKILFKEGGCGTFVPLFFNLLSSVRQYNQ--HMNAGPRVDPLQAAQTPVDEMMRHAYVDPNDPTKIFLQQPTTESQ---LRRRTY
        VVPE+++RALYSTHSFKILFKEGGCGTFVPLFFNLLSSVR +NQ  + N+ PRVDPLQAAQTPVDEMMRHA+     P  I   Q    S+   +    +
Subjt:  VVPEDQHRALYSTHSFKILFKEGGCGTFVPLFFNLLSSVRQYNQ--HMNAGPRVDPLQAAQTPVDEMMRHAYVDPNDPTKIFLQQPTTESQ---LRRRTY

Query:  QSQPAENAMWAFIAGLMKFMFSEKNINNSKTLALTGRQSRHQFIRRTRIKCNYQDSGNEQP------PPSTSTALQLYSDIERLLTETVRQSQEAWGGLK
        +S+P                   K+++  + L++  R       RR  + C+Y D     P      PPS  + +QLY  IERL+TETVR+SQ AWGG  
Subjt:  QSQPAENAMWAFIAGLMKFMFSEKNINNSKTLALTGRQSRHQFIRRTRIKCNYQDSGNEQP------PPSTSTALQLYSDIERLLTETVRQSQEAWGGLK

Query:  DWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSV
        DW+EVEGAWVLKP+++ P  VVHF+GG+FVGAAPQLTYRLFLERL++KGI +IATPYASGFD+F IADEVQFKFDRCHR   ++V+DLP FG+GHSLGSV
Subjt:  DWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSV

Query:  IHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDF
        IHLL+ SRYAV+R+GN+LM+FNNKEAS A+PLFSPV+VPMAQS+GPLLSQIA SPT RLG                  +LP+VEQL PLYMDLV+GREDF
Subjt:  IHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDF

Query:  TPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVA
         PKPEETRR++KSYYGISRNLLIKF+DDTIDET +LAQ+LSSESAISS+LDMS R LPG+HGLPLQQ LPDIPPAMAD VNRGSE  +NLT GTPWETVA
Subjt:  TPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVA

Query:  REVGNTLGVDSKILQAEASKDLNLLVEV
        +EVGNT+G DS+IL+AE+SKD+++LV+V
Subjt:  REVGNTLGVDSKILQAEASKDLNLLVEV

SwissProt top hitse value%identityAlignment
P0CB79 NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 23.0e-1750.59Show/hide
Query:  LKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLVKVGSS
        L+E+RI LCQ SP S   R F+EK Y +LK  NP  PILIRECS ++P+LWARY  G E+   L   S  Q+++ALE+++K  +S
Subjt:  LKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLVKVGSS

P0CB80 NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 23.0e-1750.59Show/hide
Query:  LKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLVKVGSS
        L+E+RI LCQ SP S   R F+EK Y +LK  NP  PILIRECS ++P+LWARY  G E+   L   S  Q+++ALE+++K  +S
Subjt:  LKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLVKVGSS

Q02370 NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 25.1e-1750.57Show/hide
Query:  RGQLSKNLKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLV
        RG+L   L+E+RI LCQ SP S   R F+EK Y +LK  NP  PILIRECS ++P+LWARY  G E+   L   S  Q+++ALE+++
Subjt:  RGQLSKNLKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLV

Q4R5E2 NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 21.5e-1651.25Show/hide
Query:  LKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLV
        L+E+RI LCQ SP S   R F+EK Y +LK  NP  PILIRECS ++P+LWARY  G E+   L   S  Q+++ALE+++
Subjt:  LKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLV

Q9FIJ2 NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 27.6e-3774.74Show/hide
Query:  MAWRGQLSKNLKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLVKVGSS
        MAWRG +SK++KELRILLCQSSP+SAP R FVEKNYKDLK+LNPK PILIRECSG++PQ+WARYDMG+ER   L+GL+E QI KALE+LVK G++
Subjt:  MAWRGQLSKNLKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLVKVGSS

Arabidopsis top hitse value%identityAlignment
AT3G43540.1 Protein of unknown function (DUF1350)8.9e-3332.18Show/hide
Query:  LYSDIERLLTETVRQSQEAWGGLKD---WTEVEGAWVL--KPRNTTPKYVVHFVGGIFVGAAPQLTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQ
        L+S   R L+ TV     A GG ++   +T ++   V+   P+   P+ +V F+GG FVGA P+LTY    E L+++G  I++ PY   FD+   A +V 
Subjt:  LYSDIERLLTETVRQSQEAWGGLKD---WTEVEGAWVL--KPRNTTPKYVVHFVGGIFVGAAPQLTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQ

Query:  FKFDRCHRAFLDS-----------VQDLPIFGVGHSLGSVIHLLIGSRYAVE-RSGNVLMAFNNKEASSAVPLFSPVLVPMAQSMGPL--------LSQI
         +F+ C    + S           + DLP+F VGHS G+++ +L GS +A +    N +++FNNK A+ AVP F   L P+ Q M P+        +++ 
Subjt:  FKFDRCHRAFLDS-----------VQDLPIFGVGHSLGSVIHLLIGSRYAVE-RSGNVLMAFNNKEASSAVPLFSPVLVPMAQSMGPL--------LSQI

Query:  ASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILAQLLSSE-SAISSML
        AS    +L  +     + N     +K    LV+QLP ++ ++ +G  +F P P E R   K  Y +   LL++F  D IDET +L + L     +I   L
Subjt:  ASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILAQLLSSE-SAISSML

Query:  DMSTRALPGNHGLPLQQ
        +     L GNH  P  Q
Subjt:  DMSTRALPGNHGLPLQQ

AT3G43540.2 Protein of unknown function (DUF1350)8.3e-2331.3Show/hide
Query:  ERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDS-----------VQDLPIFGVGHSLGSVIHLLIGSRYAVE-RSGNVLMAFNNKEASSAV
        E L+++G  I++ PY   FD+   A +V  +F+ C    + S           + DLP+F VGHS G+++ +L GS +A +    N +++FNNK A+ AV
Subjt:  ERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDS-----------VQDLPIFGVGHSLGSVIHLLIGSRYAVE-RSGNVLMAFNNKEASSAV

Query:  PLFSPVLVPMAQSMGPL--------LSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLL
        P F   L P+ Q M P+        +++ AS    +L  +     + N     +K    LV+QLP ++ ++ +G  +F P P E R   K  Y +   LL
Subjt:  PLFSPVLVPMAQSMGPL--------LSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLL

Query:  IKFKDDTIDETLILAQLLSSE-SAISSMLDMSTRALPGNHGLPLQQ
        ++F  D IDET +L + L     +I   L+     L GNH  P  Q
Subjt:  IKFKDDTIDETLILAQLLSSE-SAISSMLDMSTRALPGNHGLPLQQ

AT5G11680.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cytosol, plasma membrane; EXPRESSED IN: 26 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: WW-domain-binding protein (InterPro:IPR018826); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).4.6e-9076.33Show/hide
Query:  MALNPNLFPNGMPVPFTNELFVLARDGVEFEIDKIPGANSDRVKAKGTIYLSNVRMVFVSNKPDPVFTAFDMPLLYVRDEKFNQPIFFCNNISGLVEPVV
        MALNP L PNGMPVPF NE+FVL RDGVEFE+DKIPG +   VKAKG IYLSN+RMVFVS+KP   F AFDMPLLY+  EKFNQPIF CNNI+G VEPVV
Subjt:  MALNPNLFPNGMPVPFTNELFVLARDGVEFEIDKIPGANSDRVKAKGTIYLSNVRMVFVSNKPDPVFTAFDMPLLYVRDEKFNQPIFFCNNISGLVEPVV

Query:  PEDQHRALYSTHSFKILFKEGGCGTFVPLFFNLLSSVRQYNQHMN-------AGPRVDPLQAAQTPVDEMMRHAYVDPNDPTKIFLQQPTTESQLRRRTY
        PE++HRALYSTHSFKILFKEGGCGTFVPLF NL+SSVRQYN+ M        A P VDPLQAAQTPVDEMMRHAYVDPNDPT+I+LQQP+ ESQLRRR Y
Subjt:  PEDQHRALYSTHSFKILFKEGGCGTFVPLFFNLLSSVRQYNQHMN-------AGPRVDPLQAAQTPVDEMMRHAYVDPNDPTKIFLQQPTTESQLRRRTY

Query:  QSQPAEN
         S  AE+
Subjt:  QSQPAEN

AT5G47860.1 Protein of unknown function (DUF1350)5.6e-16076.4Show/hide
Query:  STALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQ
        ST +Q+Y +IERLLTETV+ SQ + GG  DW+EVEGAWVLKPRN+ PK VVHF+GGIFVGAAPQLTYRLFLERL+EK + +IATPYASGFD+F IADEVQ
Subjt:  STALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGIFVGAAPQLTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQ

Query:  FKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENL
        FK+DRC R+  + VQDLP FG+GHSLGSVIHLLIGSRYAV+R+GNV MAFNNKEAS A+PLFSPVLVPMAQS+GPLLSQ+A+SPT RLGAEMT KQLE L
Subjt:  FKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLFSPVLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENL

Query:  SPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPD
        SPPI+KQ+LPLVEQLPPLYMDLV+GREDF PKPEETRR+++SYYGISRNLLIKF+DD+IDET ILAQ+L  ES+ISS LDMS R LPG+HGLPLQQ LPD
Subjt:  SPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILAQLLSSESAISSMLDMSTRALPGNHGLPLQQGLPD

Query:  IPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVE
        +PP MA+AVNRGSE  +N+  GTPWE++A+EVG +LG+DSKIL+A+ SKDL  LV+
Subjt:  IPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVE

AT5G47890.1 NADH-ubiquinone oxidoreductase B8 subunit, putative5.4e-3874.74Show/hide
Query:  MAWRGQLSKNLKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLVKVGSS
        MAWRG +SK++KELRILLCQSSP+SAP R FVEKNYKDLK+LNPK PILIRECSG++PQ+WARYDMG+ER   L+GL+E QI KALE+LVK G++
Subjt:  MAWRGQLSKNLKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSEAQISKALEDLVKVGSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTCAACCCTAATCTTTTTCCAAATGGGATGCCCGTTCCCTTCACCAACGAGCTTTTCGTATTGGCTAGAGATGGCGTTGAATTTGAGATTGATAAGATC
CCCGGGGCTAATAGTGATCGTGTGAAAGCAAAAGGAACTATTTACTTATCAAATGTACGCATGGTCTTCGTCTCAAATAAACCAGATCCAGTCTTTACTGCATTC
GACATGCCTTTGCTCTATGTTCGTGATGAGAAATTTAACCAGCCAATATTTTTCTGCAACAATATATCTGGATTAGTTGAACCTGTGGTGCCTGAAGACCAGCAC
AGGGCTCTCTACTCAACTCACTCATTTAAGATTTTGTTCAAGGAAGGGGGATGCGGTACTTTTGTGCCACTCTTCTTCAACCTGCTGTCTTCAGTGAGGCAATAC
AATCAACATATGAATGCAGGTCCTCGAGTGGATCCATTGCAGGCAGCACAAACTCCTGTTGATGAAATGATGAGACATGCATATGTCGATCCAAATGATCCAACC
AAAATTTTCTTACAGCAGCCAACAACGGAGTCCCAGTTAAGGAGGCGAACATACCAATCCCAACCAGCTGAAAATGCAATGTGGGCTTTTATTGCCGGTTTAATG
AAGTTCATGTTTTCAGAAAAAAATATCAATAATTCCAAAACTTTAGCTCTAACTGGACGTCAATCTCGCCATCAGTTCATTCGACGCACCCGAATCAAGTGCAAT
TATCAAGACTCCGGCAACGAACAACCACCTCCTTCTACTTCCACTGCCTTGCAACTCTACTCTGATATTGAGAGATTACTAACTGAGACAGTTAGGCAATCACAA
GAGGCCTGGGGTGGTTTAAAAGACTGGACAGAAGTTGAGGGAGCATGGGTTCTCAAACCACGAAACACAACCCCAAAATATGTTGTACATTTTGTTGGGGGTATA
TTTGTTGGAGCTGCACCTCAGCTTACATATCGCTTGTTTTTGGAGCGCCTTTCCGAGAAGGGTATTTTCATCATTGCAACACCCTATGCCAGTGGATTTGACTAT
TTCTTAATTGCGGATGAAGTGCAGTTTAAATTTGATAGGTGTCATCGGGCATTTCTTGACTCAGTACAAGATCTTCCCATTTTTGGTGTTGGCCATTCTCTGGGA
TCTGTCATCCACCTTTTGATTGGATCAAGATATGCTGTGGAAAGAAGTGGAAATGTATTGATGGCATTCAATAACAAGGAGGCAAGCTCAGCTGTTCCTTTGTTC
TCCCCGGTGCTTGTTCCAATGGCTCAAAGCATGGGACCTCTCTTATCACAAATTGCATCATCACCGACTTTTCGTCTAGGGGCAGAGATGACAATGAAACAATTA
GAAAATCTTAGCCCTCCAATAGTGAAGCAAGTTCTTCCTTTAGTTGAGCAGCTGCCTCCCTTGTACATGGATTTGGTGAGGGGAAGAGAAGATTTTACTCCAAAG
CCGGAGGAAACTCGACGAATTGTGAAGTCATACTATGGCATCTCCCGCAATCTTCTCATAAAGTTCAAGGATGATACAATCGATGAAACTTTGATACTAGCTCAG
TTGCTTAGCTCGGAGTCTGCAATTAGTTCAATGCTGGACATGTCAACTCGGGCGTTGCCTGGCAATCATGGGTTACCGTTGCAGCAGGGCCTTCCCGACATCCCA
CCAGCAATGGCGGATGCTGTGAATAGAGGTAGTGAGCTCTTTTCCAATCTAACCGCCGGAACACCTTGGGAAACCGTTGCGAGGGAAGTGGGAAACACATTAGGT
GTAGACTCGAAAATTCTCCAAGCTGAAGCTTCCAAGGATTTAAACCTGCTTGTGGAAGTGTTTCTCACAGTGAATAGGAAATTGAAGGTGCATCGGGAGAAAAAG
GCGATTGGCATTTGGGAAGGGTCTGGGTTCGTTCATCAGAAGAAGTTCATTGACGAAAAACAGCGACAGAGGTTGGAAATGGCTTGGAGAGGACAGCTATCCAAG
AATTTGAAGGAGCTTCGAATCTTACTCTGCCAATCATCTCCTTCTAGCGCCCCCGCTAGAGCTTTTGTTGAAAAAAATTACAAGGACCTCAAGACTTTGAATCCT
AAATTTCCAATATTGATCCGCGAATGCAGTGGAATCGAGCCCCAATTATGGGCTAGATATGACATGGGCATTGAAAGGGTTGCTCGTCTGGAGGGGTTGAGTGAG
GCACAGATCTCAAAGGCACTCGAAGACCTAGTTAAAGTCGGGTCATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTCAACCCTAATCTTTTTCCAAATGGGATGCCCGTTCCCTTCACCAACGAGCTTTTCGTATTGGCTAGAGATGGCGTTGAATTTGAGATTGATAAGATC
CCCGGGGCTAATAGTGATCGTGTGAAAGCAAAAGGAACTATTTACTTATCAAATGTACGCATGGTCTTCGTCTCAAATAAACCAGATCCAGTCTTTACTGCATTC
GACATGCCTTTGCTCTATGTTCGTGATGAGAAATTTAACCAGCCAATATTTTTCTGCAACAATATATCTGGATTAGTTGAACCTGTGGTGCCTGAAGACCAGCAC
AGGGCTCTCTACTCAACTCACTCATTTAAGATTTTGTTCAAGGAAGGGGGATGCGGTACTTTTGTGCCACTCTTCTTCAACCTGCTGTCTTCAGTGAGGCAATAC
AATCAACATATGAATGCAGGTCCTCGAGTGGATCCATTGCAGGCAGCACAAACTCCTGTTGATGAAATGATGAGACATGCATATGTCGATCCAAATGATCCAACC
AAAATTTTCTTACAGCAGCCAACAACGGAGTCCCAGTTAAGGAGGCGAACATACCAATCCCAACCAGCTGAAAATGCAATGTGGGCTTTTATTGCCGGTTTAATG
AAGTTCATGTTTTCAGAAAAAAATATCAATAATTCCAAAACTTTAGCTCTAACTGGACGTCAATCTCGCCATCAGTTCATTCGACGCACCCGAATCAAGTGCAAT
TATCAAGACTCCGGCAACGAACAACCACCTCCTTCTACTTCCACTGCCTTGCAACTCTACTCTGATATTGAGAGATTACTAACTGAGACAGTTAGGCAATCACAA
GAGGCCTGGGGTGGTTTAAAAGACTGGACAGAAGTTGAGGGAGCATGGGTTCTCAAACCACGAAACACAACCCCAAAATATGTTGTACATTTTGTTGGGGGTATA
TTTGTTGGAGCTGCACCTCAGCTTACATATCGCTTGTTTTTGGAGCGCCTTTCCGAGAAGGGTATTTTCATCATTGCAACACCCTATGCCAGTGGATTTGACTAT
TTCTTAATTGCGGATGAAGTGCAGTTTAAATTTGATAGGTGTCATCGGGCATTTCTTGACTCAGTACAAGATCTTCCCATTTTTGGTGTTGGCCATTCTCTGGGA
TCTGTCATCCACCTTTTGATTGGATCAAGATATGCTGTGGAAAGAAGTGGAAATGTATTGATGGCATTCAATAACAAGGAGGCAAGCTCAGCTGTTCCTTTGTTC
TCCCCGGTGCTTGTTCCAATGGCTCAAAGCATGGGACCTCTCTTATCACAAATTGCATCATCACCGACTTTTCGTCTAGGGGCAGAGATGACAATGAAACAATTA
GAAAATCTTAGCCCTCCAATAGTGAAGCAAGTTCTTCCTTTAGTTGAGCAGCTGCCTCCCTTGTACATGGATTTGGTGAGGGGAAGAGAAGATTTTACTCCAAAG
CCGGAGGAAACTCGACGAATTGTGAAGTCATACTATGGCATCTCCCGCAATCTTCTCATAAAGTTCAAGGATGATACAATCGATGAAACTTTGATACTAGCTCAG
TTGCTTAGCTCGGAGTCTGCAATTAGTTCAATGCTGGACATGTCAACTCGGGCGTTGCCTGGCAATCATGGGTTACCGTTGCAGCAGGGCCTTCCCGACATCCCA
CCAGCAATGGCGGATGCTGTGAATAGAGGTAGTGAGCTCTTTTCCAATCTAACCGCCGGAACACCTTGGGAAACCGTTGCGAGGGAAGTGGGAAACACATTAGGT
GTAGACTCGAAAATTCTCCAAGCTGAAGCTTCCAAGGATTTAAACCTGCTTGTGGAAGTGTTTCTCACAGTGAATAGGAAATTGAAGGTGCATCGGGAGAAAAAG
GCGATTGGCATTTGGGAAGGGTCTGGGTTCGTTCATCAGAAGAAGTTCATTGACGAAAAACAGCGACAGAGGTTGGAAATGGCTTGGAGAGGACAGCTATCCAAG
AATTTGAAGGAGCTTCGAATCTTACTCTGCCAATCATCTCCTTCTAGCGCCCCCGCTAGAGCTTTTGTTGAAAAAAATTACAAGGACCTCAAGACTTTGAATCCT
AAATTTCCAATATTGATCCGCGAATGCAGTGGAATCGAGCCCCAATTATGGGCTAGATATGACATGGGCATTGAAAGGGTTGCTCGTCTGGAGGGGTTGAGTGAG
GCACAGATCTCAAAGGCACTCGAAGACCTAGTTAAAGTCGGGTCATCATGA
Protein sequenceShow/hide protein sequence
MALNPNLFPNGMPVPFTNELFVLARDGVEFEIDKIPGANSDRVKAKGTIYLSNVRMVFVSNKPDPVFTAFDMPLLYVRDEKFNQPIFFCNNISGLVEPVVPEDQH
RALYSTHSFKILFKEGGCGTFVPLFFNLLSSVRQYNQHMNAGPRVDPLQAAQTPVDEMMRHAYVDPNDPTKIFLQQPTTESQLRRRTYQSQPAENAMWAFIAGLM
KFMFSEKNINNSKTLALTGRQSRHQFIRRTRIKCNYQDSGNEQPPPSTSTALQLYSDIERLLTETVRQSQEAWGGLKDWTEVEGAWVLKPRNTTPKYVVHFVGGI
FVGAAPQLTYRLFLERLSEKGIFIIATPYASGFDYFLIADEVQFKFDRCHRAFLDSVQDLPIFGVGHSLGSVIHLLIGSRYAVERSGNVLMAFNNKEASSAVPLF
SPVLVPMAQSMGPLLSQIASSPTFRLGAEMTMKQLENLSPPIVKQVLPLVEQLPPLYMDLVRGREDFTPKPEETRRIVKSYYGISRNLLIKFKDDTIDETLILAQ
LLSSESAISSMLDMSTRALPGNHGLPLQQGLPDIPPAMADAVNRGSELFSNLTAGTPWETVAREVGNTLGVDSKILQAEASKDLNLLVEVFLTVNRKLKVHREKK
AIGIWEGSGFVHQKKFIDEKQRQRLEMAWRGQLSKNLKELRILLCQSSPSSAPARAFVEKNYKDLKTLNPKFPILIRECSGIEPQLWARYDMGIERVARLEGLSE
AQISKALEDLVKVGSS