; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G18970 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G18970
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionFiber Fb32-like protein isoform 3
Genome locationClcChr11:29467451..29473674
RNA-Seq ExpressionClc11G18970
SyntenyClc11G18970
Gene Ontology termsGO:0005776 - autophagosome (cellular component)
GO:0061908 - phagophore (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146096.1 uncharacterized protein LOC101204627 [Cucumis sativus]0.0e+0065.25Show/hide
Query:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS
        MKMDLKHKGISWVGNMFQKFEAVC EVDNII+QDKVKYVENQVSSA ANVKRLYS+VVQG+LPP GDPM YEAK  AQRGH PINAYFRS SHNEGKAAS
Subjt:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS

Query:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENM--------------------------------
        NV+NKSSVGHGTST DQIDN S+A C+VP VNEE AQVPNH SLELNADLPL+KNDDV LDK   E+M                                
Subjt:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENM--------------------------------

Query:  ----------------------------------------------------KENAVSELLSEKNDGSLTNKLTLMESDDSHPLSHSLSN----------
                                                            KEN V+ELLSEKNDGSLT+KL+LMESD S PLSHSL+N          
Subjt:  ----------------------------------------------------KENAVSELLSEKNDGSLTNKLTLMESDDSHPLSHSLSN----------

Query:  -------------------------------------------------------------------------VSTEINDTNKKASSVCDGFDMQLEDDV
                                                                                 VSTEIND+NKKAS VCD FDMQLEDDV
Subjt:  -------------------------------------------------------------------------VSTEINDTNKKASSVCDGFDMQLEDDV

Query:  LLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNE
        LLV NNDGVLTDKDESKSSEED++MKFNASDPLKH+AN T CEVK T++EAIL L+NSHLP+ESS LSWKN+ +LSN  S+EFLKK VTME NTADHLNE
Subjt:  LLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNE

Query:  NHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDA
        NHL+H WSGTNFV KEAD SN LLKSVV SGR+DHV+MDKD NKS +K AIFEDDP+S+LLN PR+ NGI FTNEEA MV DRNH QLETEILARKNDD 
Subjt:  NHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDA

Query:  LAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKG
        L VK+SNESL  DTILEL+HDAIYPLKNQPRCTS+  +YK EEVSSVSNDS  +L S  I  KN KA  DKA D SCKEQANLELS ELTLHCGE SIK 
Subjt:  LAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKG

Query:  TLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSILLLKM-------ETTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLK
        +LC++GN CEGDIVT NG  QE+SI CADVE IH VEQASS L+  +       ETTSK LENG+GYSSNA DATSSE  S+VLT GETVEET PVSSLK
Subjt:  TLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSILLLKM-------ETTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLK

Query:  PLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKS
        PLAKGSFSAFRSSV+NL S T+VHEKP E NA+ EC SR SF +  +PSYGN  S MK  SSRSSLSSMESL GTHASRANDT FLP   T  QG+ SKS
Subjt:  PLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKS

Query:  TSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSK
        TSS  PSFST  GCPHDS+ YILDAE+ETVDLGHKV+ +D+CD +DYKALHA+SRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTD+E STNS 
Subjt:  TSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSK

Query:  QKLDKENAST
        QKL+KEN ST
Subjt:  QKLDKENAST

XP_008463725.1 PREDICTED: uncharacterized protein LOC103501804 isoform X1 [Cucumis melo]0.0e+0061.9Show/hide
Query:  MDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNV
        MDLKHKGISWVGNMFQKFEAVC EVDNII+QDKVKYVENQVSSA ANVKRLYS+VVQG+LPPIGDPM+YEAK  AQRGH P+NAYFRS  HNEGKAASNV
Subjt:  MDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNV

Query:  INKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAV-----------------------------
        +N SSVGHGTS+ DQIDN S+ASC+VP VNEE AQVPN S+LELN DLPL+KND V+LDK L+E+MKEN V                             
Subjt:  INKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAV-----------------------------

Query:  -------------------------------------------------------SELLSEKNDGSLTNKLTLMESDDSHPLSHSLSNV-----------
                                                               SELLSEKNDGSLT+KL+LME D S PLSHSLSNV           
Subjt:  -------------------------------------------------------SELLSEKNDGSLTNKLTLMESDDSHPLSHSLSNV-----------

Query:  ------------------------------------------------------------------------STEINDTNKKASSVCDGFDMQLEDDVLL
                                                                                STEIND+NKKAS VCD FDMQLEDDVLL
Subjt:  ------------------------------------------------------------------------STEINDTNKKASSVCDGFDMQLEDDVLL

Query:  VGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENH
        VGNN GVLTDKDESKSSEED+TMK NASDPLKH+AN TSCEVK T++EAIL L+NSHLPMESS LSWKND +LSN +S+EFLKK VTME NTADHLNENH
Subjt:  VGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENH

Query:  LSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDALA
         +H WSGTNFV KEAD SN LLKSVVLSG +DHVVMDKD ++S +K AIFEDDP+S+LLN PR+ NGI FTNEE  MV DRNH QL TEILARKNDDAL 
Subjt:  LSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDALA

Query:  VKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKGTL
        +K+SNESLKNDTILEL+HDA YPLKNQPRCTSS  KYK EEVSSVSNDS L+L+S  +  KN KA IDKA D SCKEQANLELS EL LHCGE SIK TL
Subjt:  VKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKGTL

Query:  CNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVE---------------------------------------QASSILLLK---------------ME
        C++GN  EGD+VT NG  QE+ I C DVE IH+ +                                       + +SI+L                 ME
Subjt:  CNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVE---------------------------------------QASSILLLK---------------ME

Query:  TTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTS
        TT K LENG+G SSNA DATS+E  S+VLT GETVEET PVSSLKPLAKGSFSAF  S +NL S T+VHEKP E NA+ EC SR SFE+  SPSYGN  S
Subjt:  TTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTS

Query:  KMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSR
         MK VSS+SSLSSMESLA THASRANDT FLP  YT  QG+ SKSTSSG PSFST  GCPHDSS YILDAEMETVDLGHKVT ++ECDV+DYKALHAVSR
Subjt:  KMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSR

Query:  RTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST
        RTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTD+E STNS QKL+KEN ST
Subjt:  RTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST

XP_038898347.1 uncharacterized protein LOC120086024 isoform X1 [Benincasa hispida]0.0e+0066.03Show/hide
Query:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS
        MKMDLKHKGISWVGNMFQKFEAVCQEVDNII+QDKVKYVENQVSSA ANVKRLYSDVVQGLLPP+GDPM+YEAK P QRGH PINAYFRS+SHNEGKAAS
Subjt:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS

Query:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLTNKLTLMESDDSHPL
        NV NKSSVGH   TIDQIDN S+ASC VP VNEE AQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLT+KLTLMESD S PL
Subjt:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLTNKLTLMESDDSHPL

Query:  SHSLSNVSTEINDTNKKASSVCDGFDMQLEDDVLLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRL
        S SLSNVSTEINDTNK+ASSVCDGFDM+LEDDVLLVGN+D +LTDKDESKSSEED TMKFNASDPLKH+AN TSCEVK T+EE IL L+NSHLP+ESSR 
Subjt:  SHSLSNVSTEINDTNKKASSVCDGFDMQLEDDVLLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRL

Query:  SWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNE
         WKND DLSN NS+EFLKKVVTMEPNTADHLNENHLSH WSGTNFVSKEAD SNL  +SVVLS RI H +MDKD NKSPVK AIFEDDP SYLLN PR+ 
Subjt:  SWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNE

Query:  N---------------------------------------------------------------------------------------------------
        N                                                                                                   
Subjt:  N---------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  GIIFTNEEASMVPDRNHQQLETEILARKNDDALAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKAS
        GI FTN+EA MV DRNHQQL TEILARKNDDAL VKY NESLKNDTILEL+H A YPL N+PRCTSS I+YKNEEVS+VSN S L+LESE IF KNS A 
Subjt:  GIIFTNEEASMVPDRNHQQLETEILARKNDDALAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKAS

Query:  IDKAPDASCKEQANLELSAELTLHCGEVSIKGTLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSI-------LLLKMETTSKDLENGVGYS
        IDKA DASCKEQANLELS ELTLHCGE SIK TLC++GN  EGDIVTSNG PQ++SI CADV+ IH V+QAS I       L  + ETTSK LENGV YS
Subjt:  IDKAPDASCKEQANLELSAELTLHCGEVSIKGTLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSI-------LLLKMETTSKDLENGVGYS

Query:  SNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSS
        SNA DAT     S+VLT GETVEET PVSSLKPLAK SFSAFRS V+NL ++T++HEKP EQNAYIEC SRPSFE++ SPSYGNK SKMKFVSS+SSLSS
Subjt:  SNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSS

Query:  MESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQ
        +E LA  HASRAND AFLP  YT SQGEFSKSTSSGIPS ST GGCPHDSS Y   ++METVDLGHKVTL+DE DVVDYK LHAVSRRTQKLRSYKKRIQ
Subjt:  MESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQ

Query:  DAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST
        DAF+SKKRLAKEYEQLAIWYGDTDLE STN+ QKL+KENAST
Subjt:  DAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST

XP_038898348.1 uncharacterized protein LOC120086024 isoform X2 [Benincasa hispida]0.0e+0065.93Show/hide
Query:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS
        MKMDLKHKGISWVGNMFQKFEAVCQEVDNII+QDKVKYVENQVSSA ANVKRLYSDVVQGLLPP+GDPM+YEAK P QRGH PINAYFRS+SHNEGKAAS
Subjt:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS

Query:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLTNKLTLMESDDSHPL
        NV NKSSVGH   TIDQIDN S+ASC VP VNEE AQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLT+KLTLMESD S PL
Subjt:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLTNKLTLMESDDSHPL

Query:  SHSLSNVSTEINDTNKKASSVCDGFDMQLEDDVLLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRL
        S SLSNVSTEINDTNK+ASSVCDGFDM+LEDDVLLVGN+D +LTDKDESKSSEED TMKFNASDPLKH+AN TSCEVK T+EE IL L+NSHLP+ESSR 
Subjt:  SHSLSNVSTEINDTNKKASSVCDGFDMQLEDDVLLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRL

Query:  SWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNE
         WKND DLSN NS+EFLKKVVTMEPNTADHLNENHLSH WSGTNFVSKEAD SNL  +SVVLS RI H +MDKD NKSPVK AIFEDDP SYLLN PR+ 
Subjt:  SWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNE

Query:  N---------------------------------------------------------------------------------------------------
        N                                                                                                   
Subjt:  N---------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  GIIFTNEEASMVPDRNHQQLETEILARKNDDALAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKAS
        GI FTN+EA MV DRNHQQL TEILARKNDDAL VKY NESLKNDTILEL+H A YPL N+PRCTSS I+YKNEEVS+VSN S L+LESE IF KNS A 
Subjt:  GIIFTNEEASMVPDRNHQQLETEILARKNDDALAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKAS

Query:  IDKAPDASCKEQANLELSAELTLHCGEVSIKGTLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSI-------LLLKMETTSKDLENGVGYS
        IDKA DASCKEQANLELS ELTLHCGE SIK TLC++GN  EGDIVTSNG PQ++SI CADV+ IH V+QAS I       L  + ETTSK LENGV YS
Subjt:  IDKAPDASCKEQANLELSAELTLHCGEVSIKGTLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSI-------LLLKMETTSKDLENGVGYS

Query:  SNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSS
        SNA DAT     S+VLT GETVEET PVSSLKPLAK SFSAFRS V+NL ++T++HEKP EQNAYIEC SRPSFE++ SPSYGNK SKMKFVSS+SSLSS
Subjt:  SNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSS

Query:  MESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQ
        +E LA  HASRAND AFLP  YT SQGEFSKSTSSGIPS ST GGCPHDSS Y   ++METVDLGHKVTL+DE DVVDYK LHAVSRRTQKLRSY KRIQ
Subjt:  MESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQ

Query:  DAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST
        DAF+SKKRLAKEYEQLAIWYGDTDLE STN+ QKL+KENAST
Subjt:  DAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST

XP_038898349.1 uncharacterized protein LOC120086024 isoform X3 [Benincasa hispida]0.0e+0064.49Show/hide
Query:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS
        MKMDLKHKGISWVGNMFQKFEAVCQEVDNII+QDKVKYVENQVSSA ANVKRLYSDVVQGLLPP+GDPM+YEAK P QRGH PINAYFRS+SHNEGKAAS
Subjt:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS

Query:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLTNKLTLMESDDSHPL
        NV NKSSVGH   TIDQIDN S+ASC VP VNEE AQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLT+KLTLMESD S PL
Subjt:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLTNKLTLMESDDSHPL

Query:  SHSLSNVSTEINDTNKKASSVCDGFDMQLEDDVLLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRL
        S SLSNVSTEINDTNK+ASSVCDGFDM+LEDDVLLVGN+D +LTDKDESKSSEED TMKFNASDPLKH+AN TSCEVK T+EE IL L+NSHLP+ESSR 
Subjt:  SHSLSNVSTEINDTNKKASSVCDGFDMQLEDDVLLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRL

Query:  SWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNE
         WKND DLSN NS+EFLKKVVTMEPNTADHLNENHLSH WSGTNFVSKEAD SNL  +SVVLS RI H +MDKD NKSPVK AIFEDDP SYLLN PR+ 
Subjt:  SWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNE

Query:  N---------------------------------------------------------------------------------------------------
        N                                                                                                   
Subjt:  N---------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  GIIFTNEEASMVPDRNHQQLETEILARKNDDALAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKAS
        GI FTN+EA MV DRNHQQL TEILARKNDDAL VKY NESLKNDTILEL+H A YPL N+PRCTSS I+YKNEEVS+VSN S L+LESE IF KNS A 
Subjt:  GIIFTNEEASMVPDRNHQQLETEILARKNDDALAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKAS

Query:  IDKAPDASCKEQANLELSAELTLHCGEVSIKGTLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSI-------LLLKMETTSKDLENGVGYS
        IDKA DASCKEQANLELS ELTLHCGE SIK TLC++GN  EGDIVTSNG PQ++SI CADV+ IH V+QAS I       L  + ETTSK LENGV YS
Subjt:  IDKAPDASCKEQANLELSAELTLHCGEVSIKGTLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSI-------LLLKMETTSKDLENGVGYS

Query:  SNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSS
        SNA DAT     S+VLT GETVEET PVSSLKPLAK SFSAFRS V+NL ++T++HEKP EQNAYIEC SRPSFE++ SPSYGNK SKMKFVSS+SSLSS
Subjt:  SNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSS

Query:  MESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQ
        +E LA  HASRAND AFLP  YT SQ                  GCPHDSS Y   ++METVDLGHKVTL+DE DVVDYK LHAVSRRTQKLRSYKKRIQ
Subjt:  MESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQ

Query:  DAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST
        DAF+SKKRLAKEYEQLAIWYGDTDLE STN+ QKL+KENAST
Subjt:  DAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST

TrEMBL top hitse value%identityAlignment
A0A0A0KZJ5 Uncharacterized protein0.0e+0065.25Show/hide
Query:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS
        MKMDLKHKGISWVGNMFQKFEAVC EVDNII+QDKVKYVENQVSSA ANVKRLYS+VVQG+LPP GDPM YEAK  AQRGH PINAYFRS SHNEGKAAS
Subjt:  MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAAS

Query:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENM--------------------------------
        NV+NKSSVGHGTST DQIDN S+A C+VP VNEE AQVPNH SLELNADLPL+KNDDV LDK   E+M                                
Subjt:  NVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENM--------------------------------

Query:  ----------------------------------------------------KENAVSELLSEKNDGSLTNKLTLMESDDSHPLSHSLSN----------
                                                            KEN V+ELLSEKNDGSLT+KL+LMESD S PLSHSL+N          
Subjt:  ----------------------------------------------------KENAVSELLSEKNDGSLTNKLTLMESDDSHPLSHSLSN----------

Query:  -------------------------------------------------------------------------VSTEINDTNKKASSVCDGFDMQLEDDV
                                                                                 VSTEIND+NKKAS VCD FDMQLEDDV
Subjt:  -------------------------------------------------------------------------VSTEINDTNKKASSVCDGFDMQLEDDV

Query:  LLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNE
        LLV NNDGVLTDKDESKSSEED++MKFNASDPLKH+AN T CEVK T++EAIL L+NSHLP+ESS LSWKN+ +LSN  S+EFLKK VTME NTADHLNE
Subjt:  LLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNE

Query:  NHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDA
        NHL+H WSGTNFV KEAD SN LLKSVV SGR+DHV+MDKD NKS +K AIFEDDP+S+LLN PR+ NGI FTNEEA MV DRNH QLETEILARKNDD 
Subjt:  NHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDA

Query:  LAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKG
        L VK+SNESL  DTILEL+HDAIYPLKNQPRCTS+  +YK EEVSSVSNDS  +L S  I  KN KA  DKA D SCKEQANLELS ELTLHCGE SIK 
Subjt:  LAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKG

Query:  TLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSILLLKM-------ETTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLK
        +LC++GN CEGDIVT NG  QE+SI CADVE IH VEQASS L+  +       ETTSK LENG+GYSSNA DATSSE  S+VLT GETVEET PVSSLK
Subjt:  TLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVEQASSILLLKM-------ETTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLK

Query:  PLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKS
        PLAKGSFSAFRSSV+NL S T+VHEKP E NA+ EC SR SF +  +PSYGN  S MK  SSRSSLSSMESL GTHASRANDT FLP   T  QG+ SKS
Subjt:  PLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKS

Query:  TSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSK
        TSS  PSFST  GCPHDS+ YILDAE+ETVDLGHKV+ +D+CD +DYKALHA+SRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTD+E STNS 
Subjt:  TSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSK

Query:  QKLDKENAST
        QKL+KEN ST
Subjt:  QKLDKENAST

A0A1S3CJX6 uncharacterized protein LOC103501804 isoform X10.0e+0061.9Show/hide
Query:  MDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNV
        MDLKHKGISWVGNMFQKFEAVC EVDNII+QDKVKYVENQVSSA ANVKRLYS+VVQG+LPPIGDPM+YEAK  AQRGH P+NAYFRS  HNEGKAASNV
Subjt:  MDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNV

Query:  INKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAV-----------------------------
        +N SSVGHGTS+ DQIDN S+ASC+VP VNEE AQVPN S+LELN DLPL+KND V+LDK L+E+MKEN V                             
Subjt:  INKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAV-----------------------------

Query:  -------------------------------------------------------SELLSEKNDGSLTNKLTLMESDDSHPLSHSLSNV-----------
                                                               SELLSEKNDGSLT+KL+LME D S PLSHSLSNV           
Subjt:  -------------------------------------------------------SELLSEKNDGSLTNKLTLMESDDSHPLSHSLSNV-----------

Query:  ------------------------------------------------------------------------STEINDTNKKASSVCDGFDMQLEDDVLL
                                                                                STEIND+NKKAS VCD FDMQLEDDVLL
Subjt:  ------------------------------------------------------------------------STEINDTNKKASSVCDGFDMQLEDDVLL

Query:  VGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENH
        VGNN GVLTDKDESKSSEED+TMK NASDPLKH+AN TSCEVK T++EAIL L+NSHLPMESS LSWKND +LSN +S+EFLKK VTME NTADHLNENH
Subjt:  VGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENH

Query:  LSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDALA
         +H WSGTNFV KEAD SN LLKSVVLSG +DHVVMDKD ++S +K AIFEDDP+S+LLN PR+ NGI FTNEE  MV DRNH QL TEILARKNDDAL 
Subjt:  LSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDALA

Query:  VKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKGTL
        +K+SNESLKNDTILEL+HDA YPLKNQPRCTSS  KYK EEVSSVSNDS L+L+S  +  KN KA IDKA D SCKEQANLELS EL LHCGE SIK TL
Subjt:  VKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKGTL

Query:  CNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVE---------------------------------------QASSILLLK---------------ME
        C++GN  EGD+VT NG  QE+ I C DVE IH+ +                                       + +SI+L                 ME
Subjt:  CNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVE---------------------------------------QASSILLLK---------------ME

Query:  TTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTS
        TT K LENG+G SSNA DATS+E  S+VLT GETVEET PVSSLKPLAKGSFSAF  S +NL S T+VHEKP E NA+ EC SR SFE+  SPSYGN  S
Subjt:  TTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTS

Query:  KMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSR
         MK VSS+SSLSSMESLA THASRANDT FLP  YT  QG+ SKSTSSG PSFST  GCPHDSS YILDAEMETVDLGHKVT ++ECDV+DYKALHAVSR
Subjt:  KMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSR

Query:  RTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST
        RTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTD+E STNS QKL+KEN ST
Subjt:  RTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST

A0A1S3CKE3 uncharacterized protein LOC103501804 isoform X26.7e-31059.15Show/hide
Query:  MDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNV
        MDLKHKGISWVGNMFQKFEAVC EVDNII+QDKVKYVENQVSSA ANVKRLYS+VVQG+LPPIGDPM+YEAK  AQRGH P+NAYFRS  HNEGKAASNV
Subjt:  MDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNV

Query:  INKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAV-----------------------------
        +N SSVGHGTS+ DQIDN S+ASC+VP VNEE AQVPN S+LELN DLPL+KND V+LDK L+E+MKEN V                             
Subjt:  INKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAV-----------------------------

Query:  -------------------------------------------------------SELLSEKNDGSLTNKLTLMESDDSHPLSHSLSNV-----------
                                                               SELLSEKNDGSLT+KL+LME D S PLSHSLSNV           
Subjt:  -------------------------------------------------------SELLSEKNDGSLTNKLTLMESDDSHPLSHSLSNV-----------

Query:  ------------------------------------------------------------------------STEINDTNKKASSVCDGFDMQLEDDVLL
                                                                                STEIND+NKKAS VCD FDMQLEDDVLL
Subjt:  ------------------------------------------------------------------------STEINDTNKKASSVCDGFDMQLEDDVLL

Query:  VGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENH
        VGNN GVLTDKDESKSSEED+TMK NASDPLKH+AN TSCEVK T++EAIL L+NSHLPMESS LSWKND +LSN +S+EFLKK VTME NTADHLNENH
Subjt:  VGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENH

Query:  LSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDALA
         +H WSGTNFV KEAD SN LLKSVVLSG +DHVVMDKD ++S +K AIFEDDP+S+LLN PR+ NGI FTNEE  MV DRNH QL TEILARKNDDAL 
Subjt:  LSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDALA

Query:  VKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKGTL
        +K+SNESLKNDTILEL+HDA YPLKNQPRCTSS  KYK EEVSSVSNDS L+L+S  +  KN KA IDKA D SCKEQANLELS EL LHCGE SIK TL
Subjt:  VKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKGTL

Query:  CNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVE---------------------------------------QASSILLLK---------------ME
        C++GN  EGD+VT NG  QE+ I C DVE IH+ +                                       + +SI+L                 ME
Subjt:  CNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVE---------------------------------------QASSILLLK---------------ME

Query:  TTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTS
        TT K LENG+G SSNA DATS+E  S+VLT GETVEET PVSSLKPLAKGSFSAF  S +NL S T+VHEKP E NA+ EC SR SFE+  SPSYGN  S
Subjt:  TTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTS

Query:  KMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSR
         MK VSS+SSLSSMESL                                        GCPHDSS YILDAEMETVDLGHKVT ++ECDV+DYKALHAVSR
Subjt:  KMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSR

Query:  RTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST
        RTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTD+E STNS QKL+KEN ST
Subjt:  RTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST

A0A5A7VK64 Fiber Fb32-like protein isoform 30.0e+0059.47Show/hide
Query:  MDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNV
        MDLKHKGISWVGNMFQKFEAVC EVDNII+QDKVKYVENQVSSA ANVKRLYS+VVQG+LPPIGDPM+YEAK  AQRGH P+NAYFRS  HNEGKAASNV
Subjt:  MDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNV

Query:  INKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAV-----------------------------
        +N SSVGHGTS+ DQIDN S+ASC+VP VNEE AQVPN S+LELN DLPL+KND V+LDK L+E+MKEN V                             
Subjt:  INKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAV-----------------------------

Query:  -------------------------------------------------------SELLSEKNDGSLTNKLTLMESDDSHPLSHSLSNV-----------
                                                               SELLSEKNDGSLT+KL+LME D S PLSHSLSNV           
Subjt:  -------------------------------------------------------SELLSEKNDGSLTNKLTLMESDDSHPLSHSLSNV-----------

Query:  ------------------------------------------------------------------------STEINDTNKKASSVCDGFDMQLEDDVLL
                                                                                STEIND+NKKAS VCD FDMQLEDDVLL
Subjt:  ------------------------------------------------------------------------STEINDTNKKASSVCDGFDMQLEDDVLL

Query:  VGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENH
        VGNN GVLTDKDESKSSEED+TMK NASDPLKH+AN TSCEVK T++EAIL L+NSHLPMESS LSWKND +LSN +S+EFLKK VTME NTADHLNENH
Subjt:  VGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENH

Query:  LSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDALA
         +H WSGTNFV KEAD SN LLKSVVLSG +DHVVMDKD ++S +K AIFEDDP+S+LLN PR+ NGI FTNEE  MV DRNH QL TEILARKNDDAL 
Subjt:  LSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDALA

Query:  VKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKGTL
        +K+SNESLKNDTILEL+HDA YPLKNQPRCTSS  KYK EEVSSVSNDS L+L+S  +  KN KA IDKA D SCKEQANLELS EL LHCGE SIK TL
Subjt:  VKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKGTL

Query:  CNHGNVCEGDIVTSNGYPQESSIRCADVECIHE-------------------------------------------------------------------
        C++GN  EGD+VT NG  QE+ I C DVE IH+                                                                   
Subjt:  CNHGNVCEGDIVTSNGYPQESSIRCADVECIHE-------------------------------------------------------------------

Query:  ---------------------VEQASSILL--------------LKMETTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSF
                              EQAS +L                 METT K LENG+G SSNA DATSSE  S+VLT GETVEET PVSSLKPLAKGSF
Subjt:  ---------------------VEQASSILL--------------LKMETTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSF

Query:  SAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPS
        SAF  S +NL S T+VHEKP E NA+ EC SR SFE+  SPSYGN  S MK VSS+SSLSSMESLA THASRANDT FLP  YT  QG+ SKSTSSG PS
Subjt:  SAFRSSVNNLPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPS

Query:  FSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKEN
        FST  GCPHDSS YILDAEMETVDLGHKVT ++ECDV+DYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTD+E STNS QKL+KEN
Subjt:  FSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKEN

Query:  AST
         ST
Subjt:  AST

A0A5D3DW70 Fiber Fb32-like protein isoform 32.3e-28556.96Show/hide
Query:  MRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENM
        M+YEAK  AQRGH P+NAYFRS  HNEGKAASNV+N SSVGHGTS+ DQIDN S+ASC+VP VNEE AQVPN S+LELN DLPL+KND V+LDK L+E+M
Subjt:  MRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNVINKSSVGHGTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENM

Query:  KENAV------------------------------------------------------------------------------------SELLSEKNDGS
        KEN V                                                                                    SELLSEKNDGS
Subjt:  KENAV------------------------------------------------------------------------------------SELLSEKNDGS

Query:  LTNKLTLMESDDSHPLSHSLSNV-----------------------------------------------------------------------------
        LT+KL+LME D S PLSHSLSNV                                                                             
Subjt:  LTNKLTLMESDDSHPLSHSLSNV-----------------------------------------------------------------------------

Query:  ------STEINDTNKKASSVCDGFDMQLEDDVLLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLS
              STEIND+NKKAS VCD FDMQLEDDVLLVGNN GVLTDKDESKSSEED+TMK NASDPLKH+AN TSCEVK T++EAIL L+NSHLPMESS LS
Subjt:  ------STEINDTNKKASSVCDGFDMQLEDDVLLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLS

Query:  WKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNEN
        WKND +LSN +S+EFLKK VTME NTADHLNENH +H WSGTNFV KEAD SN LLKSVVLSG +DHVVMDKD ++S +K AIFEDDP+S+LLN PR+ N
Subjt:  WKNDCDLSNGNSNEFLKKVVTMEPNTADHLNENHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNEN

Query:  GIIFTNEEASMVPDRNHQQLETEILARKNDDALAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKAS
        GI FTNEE  MV DRNH QL TEILARKNDDAL +K+SNESLKNDTILEL+HDA YPLKNQPRCTSS  KYK EEVSSVSNDS L+L+S  +  KN KA 
Subjt:  GIIFTNEEASMVPDRNHQQLETEILARKNDDALAVKYSNESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKAS

Query:  IDKAPDASCKEQANLELSAELTLHCGEVSIKGTLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVE-------------------------------
        IDKA D SCKEQANLELS EL LHCGE SIK TLC++GN  EGD+VT NG  QE+ I C DVE IH+ +                               
Subjt:  IDKAPDASCKEQANLELSAELTLHCGEVSIKGTLCNHGNVCEGDIVTSNGYPQESSIRCADVECIHEVE-------------------------------

Query:  --------QASSILLLK---------------METTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSST
                + +SI+L                 METT K LENG+G SSNA DATSSE  S+VLT GETVEET PVSSLKPLAKGSFSAF  S +NL S T
Subjt:  --------QASSILLLK---------------METTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSST

Query:  IVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESL-------------------------------------------------
        +VHEKP E NA+ EC SR SFE+  SPSYGN  S MK VSS+SSLSSMESL                                                 
Subjt:  IVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESL-------------------------------------------------

Query:  ---AGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQD
           A THASRANDT FLP  YT  QG+ SKSTSSG PSFST  GCPHDSS YILDAEMETVDLGHKVT ++ECDV+DYKALHAVSRRTQKLRSYKKRIQD
Subjt:  ---AGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQD

Query:  AFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST
        AFTSKKRLAKEYEQLAIWYGDTD+E STNS QKL+KEN ST
Subjt:  AFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17780.2 unknown protein8.2e-0926.9Show/hide
Query:  LPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPH
        L  S ++ EK  E   Y +C    +     +    ++++     S R+ ++  +    + +  + D + L    T +     +   +   SFS      +
Subjt:  LPSSTIVHEKPAEQNAYIECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPH

Query:  DSSAYILDAE---METVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST
            +I D +   M+T+DL + +T +++    D   L+A+  RT++LRS+K++I DA  SK+R  KEYEQLAIW+GD D+     +    DKE ++T
Subjt:  DSSAYILDAE---METVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENAST

AT1G73130.1 unknown protein5.3e-0838.55Show/hide
Query:  VDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDK-ENASTICYLKLQRETQDLEMI
        V+   L+A+  RT+KLRS+K+++ D  TSK+R  KEYEQL IWYGD  +     +K++  + E   +   L L+ E    E++
Subjt:  VDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDK-ENASTICYLKLQRETQDLEMI

AT2G16575.1 unknown protein5.7e-1047.76Show/hide
Query:  METVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDL
        M T+DL + +T +++    D   L+A+  RT++LRS+K++I DA  SK+R  KEYEQLAIW+GD D+
Subjt:  METVDLGHKVTLKDECDVVDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDL

AT2G31130.1 unknown protein5.5e-1362.5Show/hide
Query:  KGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLP
        KGI WVGN++QKFEA+C EV+ II QD  KYVENQV +   +VK+  SDVV  LLP
Subjt:  KGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGATTTGAAACATAAAGGTATATCATGGGTTGGGAACATGTTCCAAAAGTTTGAAGCGGTGTGCCAGGAAGTGGATAATATTATAAGCCAGGATAAGGTTAA
ATATGTTGAAAACCAGGTTAGTTCAGCACGTGCAAATGTGAAGAGATTATACTCTGATGTTGTTCAAGGTTTACTTCCACCTATAGGAGATCCCATGAGGTATGAAGCTA
AACCACCGGCCCAGAGGGGGCATGCTCCAATTAATGCATATTTCAGGTCAGTGTCACACAATGAAGGAAAAGCTGCAAGTAATGTTATTAATAAATCATCTGTGGGGCAT
GGTACTAGTACAATTGATCAAATAGATAACTGGAGTCGAGCATCTTGTAAAGTTCCCATTGTAAATGAAGAATTTGCTCAAGTTCCTAATCATTCTTCTCTAGAGTTGAA
TGCTGATTTACCTTTGGAAAAGAATGATGATGTCTTGTTAGATAAAGACTTATACGAGAACATGAAAGAAAATGCCGTTAGTGAACTACTTTCAGAGAAAAATGATGGCT
CATTGACAAATAAGCTTACCCTCATGGAGTCAGATGATAGTCATCCTTTGAGTCACTCACTGAGCAACGTAAGTACTGAAATTAATGATACTAATAAAAAAGCTTCTTCG
GTTTGTGATGGCTTTGATATGCAATTGGAGGATGATGTACTTTTAGTAGGGAACAATGATGGGGTTTTGACAGATAAAGATGAAAGTAAGAGTTCTGAAGAGGATACCAC
CATGAAGTTCAATGCTAGTGATCCTTTGAAGCATCTGGCTAATAGTACATCTTGTGAAGTTAAAGCTACTGATGAAGAAGCAATTCTGTTTCTGAATAATTCTCATTTGC
CAATGGAATCTTCCAGACTCTCGTGGAAAAATGACTGCGACTTATCAAATGGGAACTCAAATGAGTTTCTAAAGAAGGTTGTCACCATGGAGCCTAACACTGCGGATCAT
TTGAATGAAAATCATCTTAGTCATGAATGGAGTGGGACAAACTTTGTAAGTAAAGAAGCTGATCATTCTAATTTGCTTTTGAAGTCTGTGGTGCTTTCAGGCAGAATTGA
TCATGTCGTGATGGATAAAGACTCCAATAAGAGTCCTGTGAAGTGTGCTATCTTTGAGGATGATCCTAAAAGTTATTTGTTAAATCAGCCCAGGAATGAAAATGGAATTA
TCTTCACCAACGAAGAAGCTAGTATGGTTCCTGATAGAAACCATCAGCAGTTGGAGACTGAGATACTTGCTAGAAAGAATGATGATGCCTTGGCAGTTAAATACTCCAAT
GAAAGTTTAAAAAATGATACCATCTTGGAGTTGAAGCATGATGCAATTTATCCTTTAAAGAACCAGCCAAGATGCACATCAAGCATCATAAAATATAAAAATGAAGAAGT
TTCTTCAGTTTCAAATGATTCTTTGCTAGAGTTGGAGAGTGAGGATATTTTTGAGAAGAATAGTAAAGCTTCAATAGATAAAGCACCAGATGCAAGTTGTAAAGAACAGG
CCAATTTAGAATTATCAGCTGAGTTAACTTTGCATTGTGGTGAAGTGTCAATCAAGGGAACTTTGTGCAATCATGGTAATGTATGTGAAGGGGATATTGTGACCTCGAAT
GGATATCCACAGGAAAGTTCAATTCGTTGTGCAGATGTTGAATGCATCCATGAAGTAGAACAAGCATCCAGCATCTTGTTACTAAAGATGGAGACAACTTCGAAGGACTT
GGAAAATGGAGTTGGTTATTCTTCTAATGCTGCAGATGCTACTTCTTCTGAACTGGATTCAGTAGTTTTAACTTGTGGGGAAACTGTGGAAGAGACAAATCCAGTCTCCT
CTTTGAAACCCCTAGCAAAAGGTTCTTTTTCTGCTTTCAGAAGTTCAGTCAACAACCTTCCTAGTAGCACCATTGTTCATGAAAAACCTGCTGAACAGAATGCATACATT
GAATGTGGATCTCGTCCATCATTTGAACTGGTCACTAGTCCATCTTATGGAAACAAGACTTCGAAGATGAAATTTGTCTCCTCCAGAAGCTCCTTATCATCAATGGAATC
ATTAGCTGGGACTCATGCTTCAAGAGCCAATGATACTGCATTTCTTCCTGACGTCTATACTAGTAGTCAGGGTGAGTTTTCCAAATCTACTAGTTCTGGGATTCCAAGTT
TCTCTACTGGAGGAGGTTGTCCACATGATTCCAGTGCTTATATTCTGGATGCTGAAATGGAAACAGTGGATTTGGGACATAAAGTGACCCTCAAAGACGAGTGTGATGTT
GTTGACTATAAAGCTCTCCATGCTGTCTCTCGCAGAACTCAAAAGCTCCGTTCTTACAAGAAGAGAATCCAGGATGCTTTTACTTCCAAAAAGAGGTTGGCAAAGGAGTA
TGAACAACTAGCAATCTGGTATGGAGATACTGATCTGGAACTCAGTACAAACAGTAAACAGAAGTTGGACAAGGAGAATGCATCAACTATTTGTTACCTCAAGCTTCAGA
GAGAGACACAAGACTTGGAAATGATCTATGAAACTCACAAGTACACATTCAAGATGATCACACATCAATATTTGGAGAGAGATTTTTATGAGACTACAAAAGCTGATTTA
GGTACTGCTAGAAATGAAGAAATTAAGTGGATTTTGAAAAGAAGTTTCCCTTCAACTTTGTTGGATTAG
mRNA sequenceShow/hide mRNA sequence
AAAAAAGTTAAAACCTTGTTCCTGGGCAGCTGCCGATGAAGAACAGCAGCCTTCCAACCCTGTGATCATCGACAGGCTGCTCAGGTGAATGAAAATGGATTTGAAACATA
AAGGTATATCATGGGTTGGGAACATGTTCCAAAAGTTTGAAGCGGTGTGCCAGGAAGTGGATAATATTATAAGCCAGGATAAGGTTAAATATGTTGAAAACCAGGTTAGT
TCAGCACGTGCAAATGTGAAGAGATTATACTCTGATGTTGTTCAAGGTTTACTTCCACCTATAGGAGATCCCATGAGGTATGAAGCTAAACCACCGGCCCAGAGGGGGCA
TGCTCCAATTAATGCATATTTCAGGTCAGTGTCACACAATGAAGGAAAAGCTGCAAGTAATGTTATTAATAAATCATCTGTGGGGCATGGTACTAGTACAATTGATCAAA
TAGATAACTGGAGTCGAGCATCTTGTAAAGTTCCCATTGTAAATGAAGAATTTGCTCAAGTTCCTAATCATTCTTCTCTAGAGTTGAATGCTGATTTACCTTTGGAAAAG
AATGATGATGTCTTGTTAGATAAAGACTTATACGAGAACATGAAAGAAAATGCCGTTAGTGAACTACTTTCAGAGAAAAATGATGGCTCATTGACAAATAAGCTTACCCT
CATGGAGTCAGATGATAGTCATCCTTTGAGTCACTCACTGAGCAACGTAAGTACTGAAATTAATGATACTAATAAAAAAGCTTCTTCGGTTTGTGATGGCTTTGATATGC
AATTGGAGGATGATGTACTTTTAGTAGGGAACAATGATGGGGTTTTGACAGATAAAGATGAAAGTAAGAGTTCTGAAGAGGATACCACCATGAAGTTCAATGCTAGTGAT
CCTTTGAAGCATCTGGCTAATAGTACATCTTGTGAAGTTAAAGCTACTGATGAAGAAGCAATTCTGTTTCTGAATAATTCTCATTTGCCAATGGAATCTTCCAGACTCTC
GTGGAAAAATGACTGCGACTTATCAAATGGGAACTCAAATGAGTTTCTAAAGAAGGTTGTCACCATGGAGCCTAACACTGCGGATCATTTGAATGAAAATCATCTTAGTC
ATGAATGGAGTGGGACAAACTTTGTAAGTAAAGAAGCTGATCATTCTAATTTGCTTTTGAAGTCTGTGGTGCTTTCAGGCAGAATTGATCATGTCGTGATGGATAAAGAC
TCCAATAAGAGTCCTGTGAAGTGTGCTATCTTTGAGGATGATCCTAAAAGTTATTTGTTAAATCAGCCCAGGAATGAAAATGGAATTATCTTCACCAACGAAGAAGCTAG
TATGGTTCCTGATAGAAACCATCAGCAGTTGGAGACTGAGATACTTGCTAGAAAGAATGATGATGCCTTGGCAGTTAAATACTCCAATGAAAGTTTAAAAAATGATACCA
TCTTGGAGTTGAAGCATGATGCAATTTATCCTTTAAAGAACCAGCCAAGATGCACATCAAGCATCATAAAATATAAAAATGAAGAAGTTTCTTCAGTTTCAAATGATTCT
TTGCTAGAGTTGGAGAGTGAGGATATTTTTGAGAAGAATAGTAAAGCTTCAATAGATAAAGCACCAGATGCAAGTTGTAAAGAACAGGCCAATTTAGAATTATCAGCTGA
GTTAACTTTGCATTGTGGTGAAGTGTCAATCAAGGGAACTTTGTGCAATCATGGTAATGTATGTGAAGGGGATATTGTGACCTCGAATGGATATCCACAGGAAAGTTCAA
TTCGTTGTGCAGATGTTGAATGCATCCATGAAGTAGAACAAGCATCCAGCATCTTGTTACTAAAGATGGAGACAACTTCGAAGGACTTGGAAAATGGAGTTGGTTATTCT
TCTAATGCTGCAGATGCTACTTCTTCTGAACTGGATTCAGTAGTTTTAACTTGTGGGGAAACTGTGGAAGAGACAAATCCAGTCTCCTCTTTGAAACCCCTAGCAAAAGG
TTCTTTTTCTGCTTTCAGAAGTTCAGTCAACAACCTTCCTAGTAGCACCATTGTTCATGAAAAACCTGCTGAACAGAATGCATACATTGAATGTGGATCTCGTCCATCAT
TTGAACTGGTCACTAGTCCATCTTATGGAAACAAGACTTCGAAGATGAAATTTGTCTCCTCCAGAAGCTCCTTATCATCAATGGAATCATTAGCTGGGACTCATGCTTCA
AGAGCCAATGATACTGCATTTCTTCCTGACGTCTATACTAGTAGTCAGGGTGAGTTTTCCAAATCTACTAGTTCTGGGATTCCAAGTTTCTCTACTGGAGGAGGTTGTCC
ACATGATTCCAGTGCTTATATTCTGGATGCTGAAATGGAAACAGTGGATTTGGGACATAAAGTGACCCTCAAAGACGAGTGTGATGTTGTTGACTATAAAGCTCTCCATG
CTGTCTCTCGCAGAACTCAAAAGCTCCGTTCTTACAAGAAGAGAATCCAGGATGCTTTTACTTCCAAAAAGAGGTTGGCAAAGGAGTATGAACAACTAGCAATCTGGTAT
GGAGATACTGATCTGGAACTCAGTACAAACAGTAAACAGAAGTTGGACAAGGAGAATGCATCAACTATTTGTTACCTCAAGCTTCAGAGAGAGACACAAGACTTGGAAAT
GATCTATGAAACTCACAAGTACACATTCAAGATGATCACACATCAATATTTGGAGAGAGATTTTTATGAGACTACAAAAGCTGATTTAGGTACTGCTAGAAATGAAGAAA
TTAAGTGGATTTTGAAAAGAAGTTTCCCTTCAACTTTGTTGGATTAGATTATTTGGCTAAGACAACAACATGAGCCATGTGTATTTATTGGGAGTTTCATATCAAACAAG
AGGGCTATGAAATACTTCAAAAGATAACAAGAAGAGATACTCTGAGAGGCTGTAAAT
Protein sequenceShow/hide protein sequence
MKMDLKHKGISWVGNMFQKFEAVCQEVDNIISQDKVKYVENQVSSARANVKRLYSDVVQGLLPPIGDPMRYEAKPPAQRGHAPINAYFRSVSHNEGKAASNVINKSSVGH
GTSTIDQIDNWSRASCKVPIVNEEFAQVPNHSSLELNADLPLEKNDDVLLDKDLYENMKENAVSELLSEKNDGSLTNKLTLMESDDSHPLSHSLSNVSTEINDTNKKASS
VCDGFDMQLEDDVLLVGNNDGVLTDKDESKSSEEDTTMKFNASDPLKHLANSTSCEVKATDEEAILFLNNSHLPMESSRLSWKNDCDLSNGNSNEFLKKVVTMEPNTADH
LNENHLSHEWSGTNFVSKEADHSNLLLKSVVLSGRIDHVVMDKDSNKSPVKCAIFEDDPKSYLLNQPRNENGIIFTNEEASMVPDRNHQQLETEILARKNDDALAVKYSN
ESLKNDTILELKHDAIYPLKNQPRCTSSIIKYKNEEVSSVSNDSLLELESEDIFEKNSKASIDKAPDASCKEQANLELSAELTLHCGEVSIKGTLCNHGNVCEGDIVTSN
GYPQESSIRCADVECIHEVEQASSILLLKMETTSKDLENGVGYSSNAADATSSELDSVVLTCGETVEETNPVSSLKPLAKGSFSAFRSSVNNLPSSTIVHEKPAEQNAYI
ECGSRPSFELVTSPSYGNKTSKMKFVSSRSSLSSMESLAGTHASRANDTAFLPDVYTSSQGEFSKSTSSGIPSFSTGGGCPHDSSAYILDAEMETVDLGHKVTLKDECDV
VDYKALHAVSRRTQKLRSYKKRIQDAFTSKKRLAKEYEQLAIWYGDTDLELSTNSKQKLDKENASTICYLKLQRETQDLEMIYETHKYTFKMITHQYLERDFYETTKADL
GTARNEEIKWILKRSFPSTLLD