; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021985 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021985
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4220 domain-containing protein
Genome locationscaffold2:15458000..15459019
RNA-Seq ExpressionSpg021985
SyntenySpg021985
Gene Ontology termsNA
InterPro domainsIPR025315 - Domain of unknown function DUF4220


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFZ17712.1 transmembrane protein, putative [Actinidia rufa]6.5e-1425.38Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTH-------------------------
        +ILS   L++S  +F    +K AF+V+ +ELGF++D+LYTK ++I+ + WG  L   S SS VIA +AFC  + H                         
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTH-------------------------

Query:  ---------------RDPKFLRNFYTV-----------------------------------------------------TTSQPISDEFKKRIFEQLKE
                        D K +   YTV                                                      TS+ +  + K+ IF QL E
Subjt:  ---------------RDPKFLRNFYTV-----------------------------------------------------TTSQPISDEFKKRIFEQLKE

Query:  ---------MLEMVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDH-----------KNKSLNSSF--LEETKLFSNYLIYL
                 + + +     DR+L  W        + +GWS +++   S+LLW+IAT++CY+SD            K++ L+S F   E +KL S+Y++Y+
Subjt:  ---------MLEMVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDH-----------KNKSLNSSF--LEETKLFSNYLIYL

Query:  LAYRHSLFSNGMGRTRFDATVLDTKSF
        L  R ++  NG+G+ RF  T  +   F
Subjt:  LAYRHSLFSNGMGRTRFDATVLDTKSF

KAF8389223.1 hypothetical protein HHK36_025916 [Tetracentron sinense]2.7e-1533.64Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTHRDPKFLRNFYTVTTSQPISDEFKKR
        +ILS    + S  YF     + AFKVVELELG+++D LYTK S + +T+ G  L LI+ SS V   VAF   N                   +S E K  
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTHRDPKFLRNFYTVTTSQPISDEFKKR

Query:  IFEQLKEMLEMVDYVDNDRIL--PYWVRRKRIFYSRIGW----STKLDSVQSMLLWYIATEVCYYS--DHKNKSLNSSFLEETKLFSNYLIYLLAYRHSL
        IF+QL E  + +     + ++      RR  +      +    +T+ D  Q +LLW+IAT++CYYS  D  +    S   E T L S Y++YLL     +
Subjt:  IFEQLKEMLEMVDYVDNDRIL--PYWVRRKRIFYSRIGW----STKLDSVQSMLLWYIATEVCYYS--DHKNKSLNSSFLEETKLFSNYLIYLLAYRHSL

Query:  FSNGMGRTRFDATVLDTKSF
          NG+G+ RF  T  +  +F
Subjt:  FSNGMGRTRFDATVLDTKSF

TKY64267.1 DUF594 family protein [Spatholobus suberectus]8.5e-1431.94Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTHRDPKFLRNFYTVTTSQPISDEFKKR
        +ILS   + +S     N   K  F+V+E+ELGF++DL YTK + + ++  G+ L L++ S  V    AF              F T     P  D F   
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTHRDPKFLRNFYTVTTSQPISDEFKKR

Query:  IFEQLKEMLEMVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCY---YSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSNGM
        +       LE+      DR+L  +   K      + WS K +  QS+LLW+IAT++CY     +  + S   SF E +KL S Y++YLL  R S+  NG+
Subjt:  IFEQLKEMLEMVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCY---YSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSNGM

Query:  GRTRFDATVLDTKSFL
        G  RF  T  +   F+
Subjt:  GRTRFDATVLDTKSFL

XP_022141971.1 uncharacterized protein LOC111012216 [Momordica charantia]1.5e-3133.66Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAF-------------------------------
        + L+   L+QSL+YF  FD + AFKV+ELELGF++D  YTK S+I H+RWG  L L +  SIV+  V F                               
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAF-------------------------------

Query:  ------------------------------------------------------CTSNTH--RDPKFLRNFY---TVTTSQPISDEFKKRIFEQLKEMLE
                                                              C   T   R  K+ R  Y    +T S+ ISDE K RIF+QL + LE
Subjt:  ------------------------------------------------------CTSNTH--RDPKFLRNFY---TVTTSQPISDEFKKRIFEQLKEMLE

Query:  MVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRFDATVLDTK
        +    + +R LP W+ RK   Y+++GWS +LDS QS+LLW+IAT +CY+ D + ++ N S LE+  L S++L YLL Y HSLF +GM   RF  TV    
Subjt:  MVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRFDATVLDTK

Query:  SFL
         F+
Subjt:  SFL

XP_030458463.1 uncharacterized protein LOC115679061 [Syzygium oleosum]1.8e-1626.45Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFC------------------------------
        +ILS   L++S  +F   D++ AFKV+E+ELGF++D+L+TK +++ H  WG  L  IS SS   A  AFC                              
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFC------------------------------

Query:  ------------------------------------------------TSNTHRD------------PKFLRNFYTVTTSQPISDEFKKRIFEQLKE-ML
                                                         S+  +D             K L N Y     +P+SD  K  +FEQL E   
Subjt:  ------------------------------------------------TSNTHRD------------PKFLRNFYTVTTSQPISDEFKKRIFEQLKE-ML

Query:  EMVDYVDNDRILPY---WVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSL---NSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRFD
          VD+     +  Y   WV +K     ++GWS  ++  QS++LW+IAT++C Y D  NK+L   + S  E  +L SNY++++L  R  +  +G+G+ R  
Subjt:  EMVDYVDNDRILPY---WVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSL---NSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRFD

Query:  ATVLDTKSFL
         T  + ++F+
Subjt:  ATVLDTKSFL

TrEMBL top hitse value%identityAlignment
A0A5B7BVN3 DUF4220 domain-containing protein1.5e-1625.37Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTH-------------------------
        +ILS   L++SL +F    ++ AF+V+E+ELGF++D+LYTK ++++   WG  L     SS +IA V FCT + H                         
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTH-------------------------

Query:  ----------------------RDPKFLRNFYTVT---------------------------------------------TSQPISDEFKKRIFEQLKEM
                                 K +  F  VT                                             +S+ +S   K+ IF+QL E 
Subjt:  ----------------------RDPKFLRNFYTVT---------------------------------------------TSQPISDEFKKRIFEQLKEM

Query:  LE-MVDYVDNDRILP---YWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSLN---SSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTR
         +   ++ D  ++       V +K     ++GWS K +   S+LLW+IAT++CYYSD+ ++  N    +  +E+KL SNY++Y+L  R  +  NG+G+ R
Subjt:  LE-MVDYVDNDRILP---YWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSLN---SSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTR

Query:  FDATVLDTKSFLGRHVPKLL---GTVDYAYEYILQ
        F  T  +   F      K      T++ A + +LQ
Subjt:  FDATVLDTKSFLGRHVPKLL---GTVDYAYEYILQ

A0A6J0ZWT9 uncharacterized protein LOC1104127475.4e-1425.23Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCT-----------------------------
        +ILS   +++S  YF    +  AFKV+E+ELGF++DL YTK S+++ + WG  L  +S SS +I    F                               
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCT-----------------------------

Query:  -------------SNTHRDP-----KFLRNFYTVT-------------------------------------------TSQPISDEFKKRIFEQLKEMLE
                     S   + P     K +  F  +T                                           TS+ +S   K+ +FE+L E  +
Subjt:  -------------SNTHRDP-----KFLRNFYTVT-------------------------------------------TSQPISDEFKKRIFEQLKEMLE

Query:  MVDYVDNDRIL----PYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKN--KSLNSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRFDA
        +       R L       V  ++    R+GWS +++   S+LLW+IAT +CY  D K    S+  S  + +KL S YL+Y+L  R S+  NG+G+ RF  
Subjt:  MVDYVDNDRIL----PYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKN--KSLNSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRFDA

Query:  TVLDTKSFLGRHVPKLLGTVDYAYEYILQ
        T+ +   F+     K +     A E +LQ
Subjt:  TVLDTKSFLGRHVPKLLGTVDYAYEYILQ

A0A6J1CKT2 uncharacterized protein LOC1110122167.5e-3233.66Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAF-------------------------------
        + L+   L+QSL+YF  FD + AFKV+ELELGF++D  YTK S+I H+RWG  L L +  SIV+  V F                               
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAF-------------------------------

Query:  ------------------------------------------------------CTSNTH--RDPKFLRNFY---TVTTSQPISDEFKKRIFEQLKEMLE
                                                              C   T   R  K+ R  Y    +T S+ ISDE K RIF+QL + LE
Subjt:  ------------------------------------------------------CTSNTH--RDPKFLRNFY---TVTTSQPISDEFKKRIFEQLKEMLE

Query:  MVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRFDATVLDTK
        +    + +R LP W+ RK   Y+++GWS +LDS QS+LLW+IAT +CY+ D + ++ N S LE+  L S++L YLL Y HSLF +GM   RF  TV    
Subjt:  MVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRFDATVLDTK

Query:  SFL
         F+
Subjt:  SFL

A0A7J0H3V1 Transmembrane protein, putative3.2e-1425.38Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTH-------------------------
        +ILS   L++S  +F    +K AF+V+ +ELGF++D+LYTK ++I+ + WG  L   S SS VIA +AFC  + H                         
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTH-------------------------

Query:  ---------------RDPKFLRNFYTV-----------------------------------------------------TTSQPISDEFKKRIFEQLKE
                        D K +   YTV                                                      TS+ +  + K+ IF QL E
Subjt:  ---------------RDPKFLRNFYTV-----------------------------------------------------TTSQPISDEFKKRIFEQLKE

Query:  ---------MLEMVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDH-----------KNKSLNSSF--LEETKLFSNYLIYL
                 + + +     DR+L  W        + +GWS +++   S+LLW+IAT++CY+SD            K++ L+S F   E +KL S+Y++Y+
Subjt:  ---------MLEMVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDH-----------KNKSLNSSF--LEETKLFSNYLIYL

Query:  LAYRHSLFSNGMGRTRFDATVLDTKSF
        L  R ++  NG+G+ RF  T  +   F
Subjt:  LAYRHSLFSNGMGRTRFDATVLDTKSF

A5BA61 Uncharacterized protein5.4e-1426.32Show/hide
Query:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTHRDPK-------------FLRNFYTV
        ++L P+ L+ S  +F N  +  AF+V+E+ELGF++D+  TK  ++ +  WG     I   S +   + F T + H                  ++  Y++
Subjt:  MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTHRDPK-------------FLRNFYTV

Query:  -----------------TTSQPISDEF----------------------------KKRIFEQLKEMLEMVDYVDNDRIL----PYWVRRKRIFYSRIGWS
                          T   I +EF                            K  IFEQL E   +   ++  + L      W+  K   +SR+GWS
Subjt:  -----------------TTSQPISDEF----------------------------KKRIFEQLKEMLEMVDYVDNDRIL----PYWVRRKRIFYSRIGWS

Query:  TKLDSVQSMLLWYIATEVCYYSDHKNKS--LNSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRF
        T  +  QS+LLW++AT++CYY+D   KS    +S  + +KL S+Y++YLL     +  +G+G+ RF
Subjt:  TKLDSVQSMLLWYIATEVCYYSDHKNKS--LNSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19080.1 Protein of unknown function (DUF594)1.0e-0429.33Show/hide
Query:  TKLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSN--GMGRTRFDATVLDTKS
        T+++  Q +L+W++ATE+ + ++  N + N    E +K  S+Y++YLL  + SL S   G+ + +F   + + K+
Subjt:  TKLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSN--GMGRTRFDATVLDTKS

AT4G19090.1 Protein of unknown function (DUF594)2.0e-0533.33Show/hide
Query:  TKLDSVQSMLLWYIATEVCYYSDHK-----NKSLNSSFLEETKLFSNYLIYLLAYRHSLFSN--GMGRTRFDATVLDTKSF
        T +D   S+L+W+IATE+CY  +       +KS   +  + +K+ S+Y++YLL  +  L S   G+G+ RF  T+ +   F
Subjt:  TKLDSVQSMLLWYIATEVCYYSDHK-----NKSLNSSFLEETKLFSNYLIYLLAYRHSLFSN--GMGRTRFDATVLDTKSF

AT5G45470.1 Protein of unknown function (DUF594)1.6e-1031.08Show/hide
Query:  QPISDEFKKRIFEQLKEMLEMVDYVDNDRILPY----WVRRKRIFYSR-----IGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSN-
        +P++ E  K IFE+LK   +  D  +N + +      W  R+ +         + + TK+D  QS+L+W+IATE+C Y  H+ +++   + E+ K +SN 
Subjt:  QPISDEFKKRIFEQLKEMLEMVDYVDNDRILPY----WVRRKRIFYSR-----IGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSN-

Query:  --------YLIYLLAYRHSLFSN--GMGRTRFDATVLDT-KSFLGRHV
                Y++YLL  +  L S   G+G+ RF  T+ +T K F  RH+
Subjt:  --------YLIYLLAYRHSLFSN--GMGRTRFDATVLDT-KSFLGRHV

AT5G45480.1 Protein of unknown function (DUF594)4.1e-0637.18Show/hide
Query:  KLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSN--GMGRTRFDATVLDTKSFLGR
        ++D  QS+L+W+IATE+ Y +  K    N S  E +K+ S+Y++YLL  + +L S   G+G+ RF  T  + + F  R
Subjt:  KLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSN--GMGRTRFDATVLDTKSFLGR

AT5G45530.1 Protein of unknown function (DUF594)2.4e-0637.08Show/hide
Query:  KLDSVQSMLLWYIATEVCYYSDHKNKSLNSS-----FLEETKLFSNYLIYLLAYRHSLFSN--GMGRTRFDATVLDTKSFL-GRHVPKL
        K+D  QS+LLW+IATE+C+  +   K    S       E +K+ S+Y++YLL  R  L S   G+G  RF  T  + + F  GR +  L
Subjt:  KLDSVQSMLLWYIATEVCYYSDHKNKSLNSS-----FLEETKLFSNYLIYLLAYRHSLFSN--GMGRTRFDATVLDTKSFL-GRHVPKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTGAGCCCGAGTCATCTCAAGCAAAGCCTTTACTACTTTGCGAATTTTGATTTTAAAACCGCTTTCAAAGTGGTTGAGCTTGAGCTTGGATTTTTGCATGATTT
ACTCTACACAAAAGTCTCTCTCATATTCCATACTCGTTGGGGTGCTCCCCTTCACCTTATTAGTTCATCTTCAATTGTCATAGCCACCGTCGCCTTTTGCACGAGCAATA
CACACAGAGACCCCAAGTTCTTGAGAAATTTTTACACTGTGACGACTTCACAACCAATCTCTGATGAATTTAAAAAACGAATATTTGAACAGTTAAAAGAGATGCTAGAA
ATGGTGGATTATGTAGACAATGATAGGATATTGCCTTATTGGGTTAGAAGAAAGCGTATATTTTATTCTCGTATTGGTTGGAGCACGAAGTTGGATTCTGTTCAAAGCAT
GCTCCTTTGGTACATCGCCACTGAAGTTTGCTATTATTCCGATCATAAAAACAAGTCTTTAAATTCTTCTTTTCTTGAAGAAACTAAATTGTTTAGTAATTACCTCATCT
ACCTTTTAGCGTATCGTCATTCATTATTCTCCAATGGAATGGGTCGAACTAGATTTGATGCCACGGTTCTTGACACCAAAAGTTTCCTAGGACGACATGTCCCAAAACTA
TTGGGAACAGTCGACTACGCTTATGAGTATATATTGCAAGGTTTGGACGTGAGGCATTGTTGTGGAGTAGATGTTAAAAGAATTGGCAATCGGGTCAGAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCTGAGCCCGAGTCATCTCAAGCAAAGCCTTTACTACTTTGCGAATTTTGATTTTAAAACCGCTTTCAAAGTGGTTGAGCTTGAGCTTGGATTTTTGCATGATTT
ACTCTACACAAAAGTCTCTCTCATATTCCATACTCGTTGGGGTGCTCCCCTTCACCTTATTAGTTCATCTTCAATTGTCATAGCCACCGTCGCCTTTTGCACGAGCAATA
CACACAGAGACCCCAAGTTCTTGAGAAATTTTTACACTGTGACGACTTCACAACCAATCTCTGATGAATTTAAAAAACGAATATTTGAACAGTTAAAAGAGATGCTAGAA
ATGGTGGATTATGTAGACAATGATAGGATATTGCCTTATTGGGTTAGAAGAAAGCGTATATTTTATTCTCGTATTGGTTGGAGCACGAAGTTGGATTCTGTTCAAAGCAT
GCTCCTTTGGTACATCGCCACTGAAGTTTGCTATTATTCCGATCATAAAAACAAGTCTTTAAATTCTTCTTTTCTTGAAGAAACTAAATTGTTTAGTAATTACCTCATCT
ACCTTTTAGCGTATCGTCATTCATTATTCTCCAATGGAATGGGTCGAACTAGATTTGATGCCACGGTTCTTGACACCAAAAGTTTCCTAGGACGACATGTCCCAAAACTA
TTGGGAACAGTCGACTACGCTTATGAGTATATATTGCAAGGTTTGGACGTGAGGCATTGTTGTGGAGTAGATGTTAAAAGAATTGGCAATCGGGTCAGAGTTTGA
Protein sequenceShow/hide protein sequence
MILSPSHLKQSLYYFANFDFKTAFKVVELELGFLHDLLYTKVSLIFHTRWGAPLHLISSSSIVIATVAFCTSNTHRDPKFLRNFYTVTTSQPISDEFKKRIFEQLKEMLE
MVDYVDNDRILPYWVRRKRIFYSRIGWSTKLDSVQSMLLWYIATEVCYYSDHKNKSLNSSFLEETKLFSNYLIYLLAYRHSLFSNGMGRTRFDATVLDTKSFLGRHVPKL
LGTVDYAYEYILQGLDVRHCCGVDVKRIGNRVRV