; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007636 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007636
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDUF4220 domain-containing protein
Genome locationChr10:8883122..8884931
RNA-Seq ExpressionHG10007636
SyntenyHG10007636
Gene Ontology termsNA
InterPro domainsIPR007658 - Protein of unknown function DUF594
IPR025315 - Domain of unknown function DUF4220


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037446.1 uncharacterized protein E6C27_scaffold277G00320 [Cucumis melo var. makuwa]5.8e-17959.23Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        MGLKRKHSSSQFLR FLL  YSF+DWI NFSF +LVE+YG+GCYD+F  P YIIRAFLA FLLLHLGGSDTITAYSMEDNELWLR LLS+L  L AS+YI
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK
        FL AL PTSLNYV+IPV +AGIIK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D +MLH+ Y FFNRDKRLFVGLGPTSYD  QNRL YY+K
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK

Query:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------
        F S S FKIIELELGFMYDFFYTK+ INHS+ G L RLTTFSSL                                                        
Subjt:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------

Query:  ------------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE
                                                   + SYY KF  TK+MAAFSVQ RPISNNLE  IFQQLK+KL      L        NE
Subjt:  ------------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE

Query:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKIN
        +GWSL  DLD++ILLWHIATD CY+SS   + +EE S+S      SI +SNFLAY +VH PSLFPSGMSQIRHKATSE VL+ ++DKKL      MLK N
Subjt:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKIN

Query:  LELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTM------EEH
        LELKIE  KE +  SM+ DACR+A  LEK+E  +KWEIIGNVWVELL RISCE EWYDHAK LTQGGNL+TRVWILMHHLG  KP DV TM      ++ 
Subjt:  LELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTM------EEH

Query:  RPLLDHEIIPDSVVAQMFDVIFN
         PLL H+I+ D VV QM +VIFN
Subjt:  RPLLDHEIIPDSVVAQMFDVIFN

XP_004139148.1 uncharacterized protein LOC101222078 [Cucumis sativus]4.6e-17657.69Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        +GLKRK SSSQFLR FLL  Y+F+DWI NFSF +LVE+YG+GCYD+F  P Y+IRAFLA FLLLHLGGSDTITAYSMEDNELWLR LLSML  L AS+YI
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK
        FL AL PTSLNY++IPV +AG+IK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D ++L + Y FF RDKRLFVGLGPTSYD  QNRL YY+K
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK

Query:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------
        F SKS FKIIELELGFMYDFFYTK+ INHS+ G L RLTTFSSL                                                        
Subjt:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------

Query:  -----------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEV
                                                 KN SYY KF  TK++AAFSVQ RPISNNLE  IFQQLKQKL      L        NE+
Subjt:  -----------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEV

Query:  GWSLNFDLDETILLWHIATDICYHSSKIEERAEESS-----KSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKINLE
        GWSL  DLD++IL+WHIATD CYHSS   + +EES      + S+ +SNFLAY +VH PSLFPSGMSQIRHKATSEHVL+ ++D+KL      MLK NLE
Subjt:  GWSLNFDLDETILLWHIATDICYHSSKIEERAEESS-----KSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKINLE

Query:  LKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLT------MEEHRP
        L IE  KE +  S + DA R+A  LEK+E  +KWEIIGNVWVELL RISCECEWYDHAK LTQGG+L+TRVWILMHHLGYLK  DV T       ++  P
Subjt:  LKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLT------MEEHRP

Query:  LLDHEIIPDSVVAQMFDVIFNIAS
        LL HEI+ D VV QM +V+FN +S
Subjt:  LLDHEIIPDSVVAQMFDVIFNIAS

XP_008458716.1 PREDICTED: uncharacterized protein LOC103498043 [Cucumis melo]3.7e-17858.91Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        MGLKRKHSSSQF R FLL  YSF+DWI NFSF +LVE+YG+GCYD+F  P YIIRAFLA FLLLHLGG DTITAYSMEDNELWLR LLS+L  L AS+YI
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK
        FL AL PTSLNYV+IPV +AGIIK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D +MLH+ Y FFNRDKRLFVGLGPTSYD  QNRLCYY+K
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK

Query:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------
        F S S FKIIELELGFMYDFFYTK+ INHS+ G L RLTTFSSL                                                        
Subjt:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------

Query:  ------------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE
                                                   + SYY KF  TK+MAAFSVQ RPISNNLE  IFQQLK+KL      L        NE
Subjt:  ------------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE

Query:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKIN
        +GWSL  DLD++ILLWHIATD CY+SS   + +EE S+S      SI +SNFLAY +VH PSLFPS MSQIRHKATSE VL+ ++DKKL      MLK N
Subjt:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKIN

Query:  LELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTM------EEH
        LELKIE  KE +  SM+ DACR+A  LEK+E  +KWEIIGNVWVELL RISCE EWYDHAK LTQGGNL+TRVWILMHHLG +KP DV TM      ++ 
Subjt:  LELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTM------EEH

Query:  RPLLDHEIIPDSVVAQMFDVIFN
         PLL H+I+ D VV QM +VIFN
Subjt:  RPLLDHEIIPDSVVAQMFDVIFN

XP_022141971.1 uncharacterized protein LOC111012216 [Momordica charantia]3.6e-10439.97Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        MGL+RK+SS+  LR+ LL  Y  ADW    S G LV+ YGS   D FF   +I    LAPF+LLHLGGSDTITAYSMEDN+LW R+     +Q+  + YI
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLV-VSNHSPITIH----------------------NQEEVLDLQMLHLTYNFFNRDK
         LLALQP  L+++ IP+FVAGIIKY E+IW FRS S +RL D L+  +  SPI I+                             L +LH+ Y FF  +K
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLV-VSNHSPITIH----------------------NQEEVLDLQMLHLTYNFFNRDK

Query:  RLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSS----------------------------------
         LFV L  TSYD  Q+ L Y+ +F+S+  FK+IELELGFMYDFFYTK+ I HS WG +LRLTT  S                                  
Subjt:  RLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSS----------------------------------

Query:  ------------------------------------------------------------LKNQSY--YSKF--TKSMAAFSVQRRPISNNLETQIFQQL
                                                                    LK   Y  Y K+  T  ++      R IS+ L+T+IFQQL
Subjt:  ------------------------------------------------------------LKNQSY--YSKF--TKSMAAFSVQRRPISNNLETQIFQQL

Query:  KQKLEGSSSTLEDSKKV---------IINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEES-SKSSILVSNFLAYLVVHCPSLFPSGMSQIRHKA
         QKLE +    E+++K+           N++GWSL  D D++ILLWHIAT+ICYH  K  E +  S  +   L+S+FL YL+V+  SLF  GMS+IR   
Subjt:  KQKLEGSSSTLEDSKKV---------IINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEES-SKSSILVSNFLAYLVVHCPSLFPSGMSQIRHKA

Query:  TSEHVLQFVEDKKLM--------LKINLELKIEEAKESKSMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWI
        T +  ++F++ +K +          ++LE          S+ F  CR+AR+L+ +EG ++WEII +VWVE+LA ISCEC WY+HAK L  GGNLLT VW+
Subjt:  TSEHVLQFVEDKKLM--------LKINLELKIEEAKESKSMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWI

Query:  LMHHLGYLKPADVLTMEEHR-PLLDHEI
        LMHHLGY+KPA++ TM++ +  +LDH +
Subjt:  LMHHLGYLKPADVLTMEEHR-PLLDHEI

XP_038880416.1 uncharacterized protein LOC120072067 [Benincasa hispida]1.3e-14958.3Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        MGL RK +S+QFLR+FLLF YSFADWIT+FSFGILVEKYGSGCYDEF  PTYIIRA LAPFLLLHLGGSDTITAYSMED ELWLR LL ML QL+AS Y+
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK
        FLLALQPTSL YVAIP+FVAGIIKY EKIWA R+ASAERLRDF+ VS  S I  H+QEE+ D+QMLH  Y+FFN+DKR+FVGLGPTS+DRHQN L YY++
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK

Query:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSLKNQSYYSKFTKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE
        FNSK  FKIIELELGFMYDFFYTK+ INHS  G L  L TFSSL                                                        
Subjt:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSLKNQSYYSKFTKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE

Query:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL-----MLKINLELKIE-
                                                      +  +V +C       M   + K TSEHVL+ ++DKKL     MLK N+ELKIE 
Subjt:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL-----MLKINLELKIE-

Query:  -----EAKESKSMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTMEEHRPLLDHEII
             E + +KSML D CR+ARQLE++E  KKWEIIGNVW+ELL RISCECEWYDHAK LTQGGNLLTRVWILMHHLGY+KP++V TMEE +PLLDHEI+
Subjt:  -----EAKESKSMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTMEEHRPLLDHEII

Query:  PDSVVAQMFDVIFNIASL
        PD V+ QMFDVIFNI SL
Subjt:  PDSVVAQMFDVIFNIASL

TrEMBL top hitse value%identityAlignment
A0A0A0LZZ2 DUF4220 domain-containing protein2.2e-17657.69Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        +GLKRK SSSQFLR FLL  Y+F+DWI NFSF +LVE+YG+GCYD+F  P Y+IRAFLA FLLLHLGGSDTITAYSMEDNELWLR LLSML  L AS+YI
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK
        FL AL PTSLNY++IPV +AG+IK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D ++L + Y FF RDKRLFVGLGPTSYD  QNRL YY+K
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK

Query:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------
        F SKS FKIIELELGFMYDFFYTK+ INHS+ G L RLTTFSSL                                                        
Subjt:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------

Query:  -----------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEV
                                                 KN SYY KF  TK++AAFSVQ RPISNNLE  IFQQLKQKL      L        NE+
Subjt:  -----------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEV

Query:  GWSLNFDLDETILLWHIATDICYHSSKIEERAEESS-----KSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKINLE
        GWSL  DLD++IL+WHIATD CYHSS   + +EES      + S+ +SNFLAY +VH PSLFPSGMSQIRHKATSEHVL+ ++D+KL      MLK NLE
Subjt:  GWSLNFDLDETILLWHIATDICYHSSKIEERAEESS-----KSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKINLE

Query:  LKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLT------MEEHRP
        L IE  KE +  S + DA R+A  LEK+E  +KWEIIGNVWVELL RISCECEWYDHAK LTQGG+L+TRVWILMHHLGYLK  DV T       ++  P
Subjt:  LKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLT------MEEHRP

Query:  LLDHEIIPDSVVAQMFDVIFNIAS
        LL HEI+ D VV QM +V+FN +S
Subjt:  LLDHEIIPDSVVAQMFDVIFNIAS

A0A1S3C8L7 uncharacterized protein LOC1034980431.8e-17858.91Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        MGLKRKHSSSQF R FLL  YSF+DWI NFSF +LVE+YG+GCYD+F  P YIIRAFLA FLLLHLGG DTITAYSMEDNELWLR LLS+L  L AS+YI
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK
        FL AL PTSLNYV+IPV +AGIIK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D +MLH+ Y FFNRDKRLFVGLGPTSYD  QNRLCYY+K
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK

Query:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------
        F S S FKIIELELGFMYDFFYTK+ INHS+ G L RLTTFSSL                                                        
Subjt:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------

Query:  ------------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE
                                                   + SYY KF  TK+MAAFSVQ RPISNNLE  IFQQLK+KL      L        NE
Subjt:  ------------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE

Query:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKIN
        +GWSL  DLD++ILLWHIATD CY+SS   + +EE S+S      SI +SNFLAY +VH PSLFPS MSQIRHKATSE VL+ ++DKKL      MLK N
Subjt:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKIN

Query:  LELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTM------EEH
        LELKIE  KE +  SM+ DACR+A  LEK+E  +KWEIIGNVWVELL RISCE EWYDHAK LTQGGNL+TRVWILMHHLG +KP DV TM      ++ 
Subjt:  LELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTM------EEH

Query:  RPLLDHEIIPDSVVAQMFDVIFN
         PLL H+I+ D VV QM +VIFN
Subjt:  RPLLDHEIIPDSVVAQMFDVIFN

A0A5B7BVN3 DUF4220 domain-containing protein4.2e-7431.12Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        +G +RK+++  +LR+ L  +Y  ADW+   + G+L    G     +   P+Y+I AF APFLL+HLGG DTITAY++EDNELWLR LL +++Q+  +LY+
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVS--------------------------NHSPITIHNQEEVLDLQMLHLTYNFFN
        F+ +L PT LN+VA+P+FVAGIIKY E+ W  RSAS++  R+ L+                              +P          D   L+  Y+FF 
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVS--------------------------NHSPITIHNQEEVLDLQMLHLTYNFFN

Query:  RDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL------------------------------
          +RLF  L   S+   +  L ++ + + ++ F++IE+ELGFMYD  YTK+ I +S+WG  LR T  SS                               
Subjt:  RDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL------------------------------

Query:  ---------------------------------KNQSYYS------KFTKSMAAFSV-------------------------------QRRPISNNLETQ
                                         K  S ++      K++ SMA +++                                   +S NL+  
Subjt:  ---------------------------------KNQSYYS------KFTKSMAAFSV-------------------------------QRRPISNNLETQ

Query:  IFQQLKQKLEGSSSTLEDSKKV-------------IINEVGWSLNFDLDETILLWHIATDICYHSSKIEERA----EESSKSSILVSNFLAYLVVHCPSL
        IF+QL +  +G+S    D KK+              + ++GWS+  + D +ILLWHIATD+CY+S   +E A        K S L+SN++ Y++V  P +
Subjt:  IFQQLKQKLEGSSSTLEDSKKV-------------IINEVGWSLNFDLDETILLWHIATDICYHSSKIEERA----EESSKSSILVSNFLAYLVVHCPSL

Query:  FPSGMSQIRHKATSEHVLQFVEDK---------------KLMLKINLELKIEEAK--ESKSMLFDACRVARQLEKVEGLK------KWEIIGNVWVELLA
         P+G+ QIR + T    ++F E+                K +L++++++   E K   SKS+LFDAC++A+ L+ +E  K      KWE++ +VW+E+L+
Subjt:  FPSGMSQIRHKATSEHVLQFVEDK---------------KLMLKINLELKIEEAK--ESKSMLFDACRVARQLEKVEGLK------KWEIIGNVWVELLA

Query:  RISCECEWYDHAKMLTQGGNLLTRVWILMHHLG
          + +C W  HA+ L +GG LLT VW+LM HLG
Subjt:  RISCECEWYDHAKMLTQGGNLLTRVWILMHHLG

A0A5D3BS41 DUF4220 domain-containing protein2.8e-17959.23Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        MGLKRKHSSSQFLR FLL  YSF+DWI NFSF +LVE+YG+GCYD+F  P YIIRAFLA FLLLHLGGSDTITAYSMEDNELWLR LLS+L  L AS+YI
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK
        FL AL PTSLNYV+IPV +AGIIK  EKIWA RSASAERLRDFL VS  SPIT HN+EEV D +MLH+ Y FFNRDKRLFVGLGPTSYD  QNRL YY+K
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDK

Query:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------
        F S S FKIIELELGFMYDFFYTK+ INHS+ G L RLTTFSSL                                                        
Subjt:  FNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL--------------------------------------------------------

Query:  ------------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE
                                                   + SYY KF  TK+MAAFSVQ RPISNNLE  IFQQLK+KL      L        NE
Subjt:  ------------------------------------------KNQSYYSKF--TKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINE

Query:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKIN
        +GWSL  DLD++ILLWHIATD CY+SS   + +EE S+S      SI +SNFLAY +VH PSLFPSGMSQIRHKATSE VL+ ++DKKL      MLK N
Subjt:  VGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKS------SILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKL------MLKIN

Query:  LELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTM------EEH
        LELKIE  KE +  SM+ DACR+A  LEK+E  +KWEIIGNVWVELL RISCE EWYDHAK LTQGGNL+TRVWILMHHLG  KP DV TM      ++ 
Subjt:  LELKIEEAKESK--SMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTM------EEH

Query:  RPLLDHEIIPDSVVAQMFDVIFN
         PLL H+I+ D VV QM +VIFN
Subjt:  RPLLDHEIIPDSVVAQMFDVIFN

A0A6J1CKT2 uncharacterized protein LOC1110122161.7e-10439.97Show/hide
Query:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI
        MGL+RK+SS+  LR+ LL  Y  ADW    S G LV+ YGS   D FF   +I    LAPF+LLHLGGSDTITAYSMEDN+LW R+     +Q+  + YI
Subjt:  MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYI

Query:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLV-VSNHSPITIH----------------------NQEEVLDLQMLHLTYNFFNRDK
         LLALQP  L+++ IP+FVAGIIKY E+IW FRS S +RL D L+  +  SPI I+                             L +LH+ Y FF  +K
Subjt:  FLLALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLV-VSNHSPITIH----------------------NQEEVLDLQMLHLTYNFFNRDK

Query:  RLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSS----------------------------------
         LFV L  TSYD  Q+ L Y+ +F+S+  FK+IELELGFMYDFFYTK+ I HS WG +LRLTT  S                                  
Subjt:  RLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSS----------------------------------

Query:  ------------------------------------------------------------LKNQSY--YSKF--TKSMAAFSVQRRPISNNLETQIFQQL
                                                                    LK   Y  Y K+  T  ++      R IS+ L+T+IFQQL
Subjt:  ------------------------------------------------------------LKNQSY--YSKF--TKSMAAFSVQRRPISNNLETQIFQQL

Query:  KQKLEGSSSTLEDSKKV---------IINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEES-SKSSILVSNFLAYLVVHCPSLFPSGMSQIRHKA
         QKLE +    E+++K+           N++GWSL  D D++ILLWHIAT+ICYH  K  E +  S  +   L+S+FL YL+V+  SLF  GMS+IR   
Subjt:  KQKLEGSSSTLEDSKKV---------IINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEES-SKSSILVSNFLAYLVVHCPSLFPSGMSQIRHKA

Query:  TSEHVLQFVEDKKLM--------LKINLELKIEEAKESKSMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWI
        T +  ++F++ +K +          ++LE          S+ F  CR+AR+L+ +EG ++WEII +VWVE+LA ISCEC WY+HAK L  GGNLLT VW+
Subjt:  TSEHVLQFVEDKKLM--------LKINLELKIEEAKESKSMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWI

Query:  LMHHLGYLKPADVLTMEEHR-PLLDHEI
        LMHHLGY+KPA++ TM++ +  +LDH +
Subjt:  LMHHLGYLKPADVLTMEEHR-PLLDHEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G45460.1 unknown protein7.6e-2830.14Show/hide
Query:  RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLA
        RK +  + L + +  +Y  ADW  NF+ G++ +  G     +       + A  APFLLLHLGG DTITA+++EDN LWLR +  ++ Q +A +Y+ L +
Subjt:  RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLA

Query:  LQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVV----------------------------------SNHSPITIHN--------QEEVLD
        L P SL    + VF++G IKY E+  A  SAS ++ RD ++                                     H P  + +        ++E+  
Subjt:  LQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVV----------------------------------SNHSPITIHN--------QEEVLD

Query:  LQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL
        L++    Y FFN  K L V L  +  +R Q+   + +  + +   +IIE+ELGF+YD  +TK+ + H++ G + R+    SL
Subjt:  LQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL

AT5G45470.1 Protein of unknown function (DUF594)2.9e-2729.79Show/hide
Query:  RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLA
        RK +  + L + +  +Y  ADW  NF+ G++ +  G     +       + A  APFLLLHLGG DTITA+++EDN LWLR +  ++ Q +A +Y+ +++
Subjt:  RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLA

Query:  LQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVS-----NHSPI-------------------------------------TIHNQEEVLD
        L P SL  V + VFV+G IKY E+  A  SAS ++ RD ++ +     N++ +                                     +   ++++ D
Subjt:  LQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVS-----NHSPI-------------------------------------TIHNQEEVLD

Query:  LQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL
        L+++   Y FFN  K L V L  +  +R ++   + +  + +   +IIE+ELGF+YD  +TK  I H+  G + R+    +L
Subjt:  LQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL

AT5G45470.1 Protein of unknown function (DUF594)1.5e-2329.76Show/hide
Query:  VQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSL------------------NFDLDETILLWHIATDICYHSSK---IEERAEESSKS-
        V   P++  L   IF++LK K +   S  E++K++ +    W+L                    D D+++L+WHIAT++CY   +   I E  +E  K  
Subjt:  VQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSL------------------NFDLDETILLWHIATDICYHSSK---IEERAEESSKS-

Query:  -----SILVSNFLAYLVVHCPSLFP--SGMSQIRHKATSEHVLQFVEDKKL----------MLKINLELKIE----EAKESKSMLFDACRVAR---QLEK
             S ++S+++ YL++  P L    +G+ +IR + T     +F + + +          +  +++E +IE    +   SKS+LFDA R+A+   ++EK
Subjt:  -----SILVSNFLAYLVVHCPSLFP--SGMSQIRHKATSEHVLQFVEDKKL----------MLKINLELKIE----EAKESKSMLFDACRVAR---QLEK

Query:  VEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLG
             KWEI+  VWVELL   +C C+   H + L++GG L+  VW+LM H G
Subjt:  VEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLG

AT5G45480.1 Protein of unknown function (DUF594)9.3e-2631.52Show/hide
Query:  KRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLL
        +RK SS + L  F+   Y  ADW  NF+ G + +  G          +  + AF  PFLLLHLGG DTITA ++EDNELWLR LL +  Q VA++Y+ L 
Subjt:  KRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLL

Query:  ALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVV-----SNHSPI------------------------------TIHNQEEVLDLQMLHL
        +L P +L    + VF  G+IKY E+  A   AS ++ +D ++       N++ +                               +   +    L +L  
Subjt:  ALQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVV-----SNHSPI------------------------------TIHNQEEVLDLQMLHL

Query:  TYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL
         Y +FN  K L V L  T   R +++  ++D   ++   +I+E+EL F+Y   YTK+ I H+  G L R      L
Subjt:  TYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSSL

AT5G45480.1 Protein of unknown function (DUF594)4.5e-2027.7Show/hide
Query:  LEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKSSILVSNFLAYLVVHCPSLFPS--GMSQIRHKATSEHVLQFVED
        ++G   T +  +K++     + +  D D+++L+WHIAT++ Y + K  +    + + S ++S+++ YL++  P+L  +  G+ +IR + T E   +F + 
Subjt:  LEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIATDICYHSSKIEERAEESSKSSILVSNFLAYLVVHCPSLFPS--GMSQIRHKATSEHVLQFVED

Query:  KKLM-----------------LKINLELKIE----EAKESKSMLFDACRVARQLE-----KVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGN
        + +M                 L + +  K E    +   SKS+LFD   +A++L+     K +  + W+I+  VWVELL+  + +C   +HA  L++GG 
Subjt:  KKLM-----------------LKINLELKIE----EAKESKSMLFDACRVARQLE-----KVEGLKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGN

Query:  LLTRVWILMHHLG
        L++ VW+LM H G
Subjt:  LLTRVWILMHHLG

AT5G45530.1 Protein of unknown function (DUF594)2.4e-2921.27Show/hide
Query:  RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLA
        RK +S + L   L   Y  ADW  N++   + +  G             + A  APFLLLHLGG DTITA ++EDN LW R L  ++ Q +A +Y  + +
Subjt:  RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLA

Query:  LQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVV-----SNHS------------------------------PITIHNQEEVLDLQMLHLT
        L+      + + +F+ G IKY E+  A  SAS ++ +D ++      SN++                              P  +    ++ DL+++   
Subjt:  LQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVV-----SNHS------------------------------PITIHNQEEVLDLQMLHLT

Query:  YNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSS------------LKNQSYYSK-----
        + FFN  K L V L  +  +R ++R  ++ +       +IIE ELGF+Y+  YTK+ I H+  G L RL +F S            LK++ ++       
Subjt:  YNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLRLTTFSS------------LKNQSYYSK-----

Query:  ---------------------------------------------------------------------------FTKSMAA------------------
                                                                                   FT+  +                   
Subjt:  ---------------------------------------------------------------------------FTKSMAA------------------

Query:  -----------------------------------------------------------------------------------FSV----------QRRP
                                                                                           FSV           R P
Subjt:  -----------------------------------------------------------------------------------FSV----------QRRP

Query:  ISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSL----------------NFDLDETILLWHIATDICYHS------SKIEERAEESSKSSILVS
        ++ N    IF ++K K  G + T E +KKV      W+L                  D D+++LLWHIAT++C+         K+     +  + S ++S
Subjt:  ISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSL----------------NFDLDETILLWHIATDICYHS------SKIEERAEESSKSSILVS

Query:  NFLAYLVVHCPSLFP--SGMSQIRHKATSEHVLQFVEDKK--------------LMLKINLELKIEEAKESKSMLFDACRVARQLEKVEGLK----KWEI
        +++ YL++  P L    +G+  IR + T     +F + ++              L++  ++E  + +   SKS+LFDA  +A++L+ ++       KW +
Subjt:  NFLAYLVVHCPSLFP--SGMSQIRHKATSEHVLQFVEDKK--------------LMLKINLELKIEEAKESKSMLFDACRVARQLEKVEGLK----KWEI

Query:  IGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLG
        +  VWVELL   +  C+  +H   L++GG LL  VW+LM H G
Subjt:  IGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLG

AT5G45540.1 Protein of unknown function (DUF594)2.5e-3122.03Show/hide
Query:  RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLA
        R+ ++ +   + +   Y  ADW  +++ G + +                + AF +PFLLLHLGG DTITA ++EDNELW R L S++ Q VA++Y+ LL+
Subjt:  RKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLA

Query:  LQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLV-----VSNHSPI------------------------------TIHNQEEVLDLQMLHLT
        + P  L    + +FV G+IKY E+  A  SAS ++ +D ++      +N++ +                               +    E+  LQ++   
Subjt:  LQPTSLNYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLV-----VSNHSPI------------------------------TIHNQEEVLDLQMLHLT

Query:  YNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLR----------LTTFSSLKNQSY----------
        Y +FN  K L V L  T+ +R ++R  ++DK  ++   +IIE+ELG +YD  +TK+ I H+  G + R          L  F   K   Y          
Subjt:  YNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDFFYTKSFINHSMWGCLLR----------LTTFSSLKNQSY----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----YSKFTKSMAAFSVQ------------------------------------------------------------------RRPISNNLETQ----
             +S F +++   S+                                                                   R  +S+ L  +    
Subjt:  -----YSKFTKSMAAFSVQ------------------------------------------------------------------RRPISNNLETQ----

Query:  IFQQLKQKLEGSSSTLEDSKKVIINEVGWSL----------------------NFDLDETILLWHIATDICY-------------HSSKIEERAEESSKS
        IF +++QK    +   E +K +      W+L                        D D++ILLWHIAT++ Y             HS+  E+    + + 
Subjt:  IFQQLKQKLEGSSSTLEDSKKVIINEVGWSL----------------------NFDLDETILLWHIATDICY-------------HSSKIEERAEESSKS

Query:  SILVSNFLAYLVVHCPSLFP--SGMSQIRHKATSEHVLQFVE----DK-------------KLMLKINLELKIEEAK--ESKSMLFDACRVARQLEKVEG
        S ++S+++ YL++  P+L    SG+++IR + T E    F +    DK             + +L +N E+     K   SKS+LFDA  +A++L   EG
Subjt:  SILVSNFLAYLVVHCPSLFP--SGMSQIRHKATSEHVLQFVE----DK-------------KLMLKINLELKIEEAK--ESKSMLFDACRVARQLEKVEG

Query:  LKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLG
           WE++  VWVELL   S  C+  +HA  L++GG L+  VW+LM H G
Subjt:  LKKWEIIGNVWVELLARISCECEWYDHAKMLTQGGNLLTRVWILMHHLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTTGAAGAGAAAACACAGCTCAAGCCAATTCCTTCGCATGTTTCTCTTATTTACATACTCCTTTGCTGATTGGATCACCAATTTTTCATTCGGCATACTCGTCGA
AAAATATGGTAGCGGCTGCTACGACGAGTTCTTTCACCCCACATACATCATTAGAGCCTTCTTAGCTCCATTTTTACTACTTCACTTAGGTGGCTCTGACACTATCACTG
CCTACTCAATGGAAGACAATGAGCTATGGCTTAGAGCTCTTCTTTCTATGCTCCTTCAACTTGTTGCTTCACTTTACATCTTCTTACTTGCCCTACAGCCAACTTCTCTC
AACTATGTAGCCATTCCAGTTTTTGTTGCTGGCATAATCAAGTATGATGAGAAGATTTGGGCTTTCAGATCTGCTTCCGCCGAACGCTTGCGAGATTTTCTTGTTGTTTC
AAATCATTCTCCAATCACTATCCATAATCAGGAGGAAGTGCTAGATCTTCAGATGCTTCACCTTACTTACAACTTTTTCAATAGAGACAAAAGGTTGTTTGTAGGTTTAG
GTCCGACTTCCTATGATCGCCATCAAAACCGTCTTTGCTATTATGACAAATTCAACTCTAAATCCGTCTTTAAAATCATTGAGCTTGAACTTGGATTTATGTATGATTTC
TTCTACACAAAATCCTTCATCAACCACTCTATGTGGGGTTGTCTTCTACGCCTCACAACCTTTTCTTCTCTTAAAAACCAAAGCTATTATTCCAAGTTCACCAAAAGCAT
GGCAGCATTTTCAGTACAACGACGTCCCATCTCGAATAACCTCGAAACACAAATCTTCCAACAACTAAAGCAAAAGTTGGAAGGATCATCAAGTACATTAGAAGACAGCA
AGAAGGTAATTATTAATGAAGTTGGTTGGAGCCTAAATTTTGATTTGGACGAAACCATCCTCCTCTGGCACATCGCTACCGATATTTGCTATCATTCTTCAAAAATTGAA
GAAAGGGCGGAAGAATCATCAAAGTCAAGCATATTGGTGTCTAATTTCTTGGCTTACCTTGTAGTGCACTGTCCATCCTTATTTCCCAGTGGAATGAGTCAAATAAGGCA
TAAAGCCACTAGTGAACATGTCCTTCAATTTGTGGAAGACAAGAAGTTAATGTTGAAGATCAATTTGGAGTTGAAGATTGAGGAAGCTAAGGAAAGCAAGTCAATGTTGT
TTGATGCTTGTCGTGTTGCAAGGCAGCTTGAGAAAGTAGAAGGGTTAAAGAAGTGGGAGATAATAGGGAATGTGTGGGTAGAATTGTTAGCACGTATTTCATGTGAATGT
GAATGGTATGACCATGCTAAAATGCTTACACAAGGAGGTAATTTGTTAACACGTGTCTGGATTTTGATGCATCATCTTGGATATCTCAAACCAGCCGATGTCTTGACCAT
GGAAGAACATCGACCACTTCTAGACCATGAAATCATACCTGATTCTGTGGTCGCTCAAATGTTTGATGTTATATTTAATATTGCCTCCCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTTGAAGAGAAAACACAGCTCAAGCCAATTCCTTCGCATGTTTCTCTTATTTACATACTCCTTTGCTGATTGGATCACCAATTTTTCATTCGGCATACTCGTCGA
AAAATATGGTAGCGGCTGCTACGACGAGTTCTTTCACCCCACATACATCATTAGAGCCTTCTTAGCTCCATTTTTACTACTTCACTTAGGTGGCTCTGACACTATCACTG
CCTACTCAATGGAAGACAATGAGCTATGGCTTAGAGCTCTTCTTTCTATGCTCCTTCAACTTGTTGCTTCACTTTACATCTTCTTACTTGCCCTACAGCCAACTTCTCTC
AACTATGTAGCCATTCCAGTTTTTGTTGCTGGCATAATCAAGTATGATGAGAAGATTTGGGCTTTCAGATCTGCTTCCGCCGAACGCTTGCGAGATTTTCTTGTTGTTTC
AAATCATTCTCCAATCACTATCCATAATCAGGAGGAAGTGCTAGATCTTCAGATGCTTCACCTTACTTACAACTTTTTCAATAGAGACAAAAGGTTGTTTGTAGGTTTAG
GTCCGACTTCCTATGATCGCCATCAAAACCGTCTTTGCTATTATGACAAATTCAACTCTAAATCCGTCTTTAAAATCATTGAGCTTGAACTTGGATTTATGTATGATTTC
TTCTACACAAAATCCTTCATCAACCACTCTATGTGGGGTTGTCTTCTACGCCTCACAACCTTTTCTTCTCTTAAAAACCAAAGCTATTATTCCAAGTTCACCAAAAGCAT
GGCAGCATTTTCAGTACAACGACGTCCCATCTCGAATAACCTCGAAACACAAATCTTCCAACAACTAAAGCAAAAGTTGGAAGGATCATCAAGTACATTAGAAGACAGCA
AGAAGGTAATTATTAATGAAGTTGGTTGGAGCCTAAATTTTGATTTGGACGAAACCATCCTCCTCTGGCACATCGCTACCGATATTTGCTATCATTCTTCAAAAATTGAA
GAAAGGGCGGAAGAATCATCAAAGTCAAGCATATTGGTGTCTAATTTCTTGGCTTACCTTGTAGTGCACTGTCCATCCTTATTTCCCAGTGGAATGAGTCAAATAAGGCA
TAAAGCCACTAGTGAACATGTCCTTCAATTTGTGGAAGACAAGAAGTTAATGTTGAAGATCAATTTGGAGTTGAAGATTGAGGAAGCTAAGGAAAGCAAGTCAATGTTGT
TTGATGCTTGTCGTGTTGCAAGGCAGCTTGAGAAAGTAGAAGGGTTAAAGAAGTGGGAGATAATAGGGAATGTGTGGGTAGAATTGTTAGCACGTATTTCATGTGAATGT
GAATGGTATGACCATGCTAAAATGCTTACACAAGGAGGTAATTTGTTAACACGTGTCTGGATTTTGATGCATCATCTTGGATATCTCAAACCAGCCGATGTCTTGACCAT
GGAAGAACATCGACCACTTCTAGACCATGAAATCATACCTGATTCTGTGGTCGCTCAAATGTTTGATGTTATATTTAATATTGCCTCCCTCTAA
Protein sequenceShow/hide protein sequence
MGLKRKHSSSQFLRMFLLFTYSFADWITNFSFGILVEKYGSGCYDEFFHPTYIIRAFLAPFLLLHLGGSDTITAYSMEDNELWLRALLSMLLQLVASLYIFLLALQPTSL
NYVAIPVFVAGIIKYDEKIWAFRSASAERLRDFLVVSNHSPITIHNQEEVLDLQMLHLTYNFFNRDKRLFVGLGPTSYDRHQNRLCYYDKFNSKSVFKIIELELGFMYDF
FYTKSFINHSMWGCLLRLTTFSSLKNQSYYSKFTKSMAAFSVQRRPISNNLETQIFQQLKQKLEGSSSTLEDSKKVIINEVGWSLNFDLDETILLWHIATDICYHSSKIE
ERAEESSKSSILVSNFLAYLVVHCPSLFPSGMSQIRHKATSEHVLQFVEDKKLMLKINLELKIEEAKESKSMLFDACRVARQLEKVEGLKKWEIIGNVWVELLARISCEC
EWYDHAKMLTQGGNLLTRVWILMHHLGYLKPADVLTMEEHRPLLDHEIIPDSVVAQMFDVIFNIASL