; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G022140 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G022140
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function, DUF599
Genome locationchr04:29195718..29197467
RNA-Seq ExpressionLsi04G022140
SyntenyLsi04G022140
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006747 - Protein of unknown function DUF599


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038728.1 uncharacterized protein E6C27_scaffold92G002740 [Cucumis melo var. makuwa]1.3e-8594.51Show/hide
Query:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA
        DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG A
Subjt:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA

Query:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV
        ++T++YISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS ATGK+K TIVAG GNGELS V
Subjt:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV

XP_004136437.1 uncharacterized protein LOC101209101 [Cucumis sativus]3.0e-8793.55Show/hide
Query:  NSINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI
        +SI  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI
Subjt:  NSINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI

Query:  PPGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        PPG A++T+DYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS  T KRKTIVAG GNGELS V
Subjt:  PPGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

XP_008466268.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103503728 [Cucumis melo]2.4e-8492.47Show/hide
Query:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP
        SI  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP
Subjt:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP

Query:  PGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV
        P   ++T++YISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS ATGK+K TIVAG GNGELS V
Subjt:  PGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV

XP_023536044.1 uncharacterized protein LOC111797297 [Cucurbita pepo subsp. pepo]1.8e-8492.82Show/hide
Query:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA
        DNEKKNILAVQ+LRNTIMGCTLMATTSILLC GLAAVLSSTYSIKKPLNDAVYGAHGDFML LKYVTLLTLFLFSFFCHSLSIRFINQ NILINIPPG A
Subjt:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA

Query:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD VCS A GKRKT+ AG+ NGELS V
Subjt:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

XP_038899190.1 uncharacterized protein LOC120086553 [Benincasa hispida]1.8e-8794.62Show/hide
Query:  NSINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI
        +SI  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI
Subjt:  NSINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI

Query:  PPGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        P G AAVTSDYISDLLDKGFILN VGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS+AT KRKTIVAGEGNGEL+ V
Subjt:  PPGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

TrEMBL top hitse value%identityAlignment
A0A0A0LEJ6 Uncharacterized protein1.4e-8793.55Show/hide
Query:  NSINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI
        +SI  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI
Subjt:  NSINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI

Query:  PPGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        PPG A++T+DYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS  T KRKTIVAG GNGELS V
Subjt:  PPGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

A0A1S3CQV1 LOW QUALITY PROTEIN: uncharacterized protein LOC1035037281.1e-8492.47Show/hide
Query:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP
        SI  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP
Subjt:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP

Query:  PGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV
        P   ++T++YISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS ATGK+K TIVAG GNGELS V
Subjt:  PGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV

A0A5A7T794 Uncharacterized protein6.1e-8694.51Show/hide
Query:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA
        DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG A
Subjt:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA

Query:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV
        ++T++YISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS ATGK+K TIVAG GNGELS V
Subjt:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV

A0A6J1FE98 uncharacterized protein LOC1114432815.7e-8492.27Show/hide
Query:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA
        DNEKKNILAVQ+LRNTIMGCTLMATTSILLC GLAAVLSSTYSIKKPLNDAVYGAHGDFML LKYVTLLTLFLFSFFCHSLSIRFINQ NILINIPPG A
Subjt:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA

Query:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        AVTS YISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD VCS A GKRKT+ AG+ NGELS V
Subjt:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

A0A6J1IF19 uncharacterized protein LOC1114760392.4e-8290.61Show/hide
Query:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA
        DNEKKNILAVQ+LRN IMGCTLMATTSILLC GLAAVLSSTYSIKKPLND VYGAHGDFML LKYVTLLTLFLFSFFCHSLSIRFINQ NILINIPPG  
Subjt:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA

Query:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        AVTS YISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD VCS A GKRKT+ AG+ NGELS V
Subjt:  AVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18215.1 Protein of unknown function, DUF5993.4e-2035.44Show/hide
Query:  KKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA-AV
        K   LAVQ++RN IM  TL+ATT+I LC+ +   +S++ S K    + +YG+    +   K   +L  FL +F C+  SIR+   V+ L+ +P       
Subjt:  KKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA-AV

Query:  TSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS
          +Y+S  L++     ++G R FY + P+ LW FGP+ +FVC   M  +LY LD   S
Subjt:  TSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS

AT4G31330.1 Protein of unknown function, DUF5993.5e-6273.65Show/hide
Query:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP
        SI  DN+KKNILAVQ+LRN IMG TLMATTSILLC GLAAVLSSTY++KKPLNDAV+GA G+FM+ LKYVT+LT+FLFSFF HSLSIRFINQVNILIN P
Subjt:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP

Query:  -------PGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD
                       +Y+++LL++GFILNTVGNRLFYAALP++LWIFGPVLVF+CSV MVP+LYNLD
Subjt:  -------PGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD

AT5G10580.1 Protein of unknown function, DUF5995.1e-6172.51Show/hide
Query:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP
        +I  DNEKKNILAVQ+LRNTIMG TLMATT ILLC GLAAVLSSTYSIKKPLNDAVYGAHGDF + LKYVT+LT+FLF+FF HSLSIRFINQVNILIN P
Subjt:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP

Query:  PG---------FAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVV
                    + VT +Y+S+LL+K F+LNTVGNRLFY  LP++LWIFGPVLVF+ S  ++PVLYNLD V
Subjt:  PG---------FAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVV

AT5G10580.2 Protein of unknown function, DUF5991.1e-4472.18Show/hide
Query:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP
        +I  DNEKKNILAVQ+LRNTIMG TLMATT ILLC GLAAVLSSTYSIKKPLNDAVYGAHGDF + LKYVT+LT+FLF+FF HSLSIRFINQVNILIN P
Subjt:  SINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP

Query:  PG---------FAAVTSDYISDLLDKGFILNTV
                    + VT +Y+S+LL+K F+LNT+
Subjt:  PG---------FAAVTSDYISDLLDKGFILNTV

AT5G24790.1 Protein of unknown function, DUF5997.9e-5465.24Show/hide
Query:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI-----
        DN+K NILAVQ+LRN +MG TLMATT +LLC GLAAVLSSTYSIKKPLNDAV+GAHGDF + +KY+T+LT+F+FSFF HSLSIRF+NQV IL+NI     
Subjt:  DNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI-----

Query:  -PPGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVV
         P G   +TS+++S++ +KG  LNTVGNRLFYA   ++LWIFGP+LVF   + MV VL +LD V
Subjt:  -PPGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGGCTTTGGCATAGAGTTAGAACTCAGCCTTTCACAACTCTCATTGGCATTAACACTAATGCTAGACGTTTTTGGATCTCTTCCATTTTACAGAATAGACACATA
TGATAATTCAATAAATGTCGATAATGAGAAGAAAAACATCCTGGCCGTCCAATCGTTGAGGAACACGATAATGGGATGCACCCTAATGGCCACCACTTCGATTCTCCTCT
GTACGGGTCTAGCGGCAGTATTGAGTAGCACATATAGCATAAAAAAGCCCCTAAACGACGCCGTTTACGGGGCCCACGGCGATTTCATGTTGGGCCTTAAATACGTGACT
TTACTCACACTATTCCTCTTCTCCTTCTTCTGCCATTCCCTCTCCATTAGATTCATAAATCAGGTTAATATTCTCATCAACATTCCCCCCGGCTTCGCCGCCGTCACCTC
CGATTACATTTCCGATCTTCTTGACAAAGGATTTATTCTCAACACCGTCGGCAACCGCCTCTTCTACGCCGCCCTGCCGATGCTCCTCTGGATTTTTGGCCCTGTTTTGG
TCTTCGTTTGCTCTGTTTCTATGGTCCCTGTGCTTTATAATCTTGACGTCGTTTGCTCCGCCGCCACCGGAAAAAGGAAGACGATCGTCGCCGGTGAGGGTAATGGTGAG
CTCAGTATTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATTGCCTTCTCTCTTTTCTTTGTATATATAAGTGCTTTGTTTCTGCAGCCACTTTGGCAGTGCAAAGAGCAAAGATCTGATCTAGCTAGAGATAATATTGTTATTATTGT
TATTATTATTGAAGAAATAATAACAATAACAATAATAAATATTAATAATTAATTAATGGGGGCAGAATGGAGGAGCTGTTATTTGGACATAATATTGGTTCCATTAGGGT
TTTTGATAAGCACTGGATATCATGCATGGCTTTGGCATAGAGTTAGAACTCAGCCTTTCACAACTCTCATTGGCATTAACACTAATGCTAGACGTTTTTGGATCTCTTCC
ATTTTACAGAATAGACACATATGATAATTCAATAAATGTCGATAATGAGAAGAAAAACATCCTGGCCGTCCAATCGTTGAGGAACACGATAATGGGATGCACCCTAATGG
CCACCACTTCGATTCTCCTCTGTACGGGTCTAGCGGCAGTATTGAGTAGCACATATAGCATAAAAAAGCCCCTAAACGACGCCGTTTACGGGGCCCACGGCGATTTCATG
TTGGGCCTTAAATACGTGACTTTACTCACACTATTCCTCTTCTCCTTCTTCTGCCATTCCCTCTCCATTAGATTCATAAATCAGGTTAATATTCTCATCAACATTCCCCC
CGGCTTCGCCGCCGTCACCTCCGATTACATTTCCGATCTTCTTGACAAAGGATTTATTCTCAACACCGTCGGCAACCGCCTCTTCTACGCCGCCCTGCCGATGCTCCTCT
GGATTTTTGGCCCTGTTTTGGTCTTCGTTTGCTCTGTTTCTATGGTCCCTGTGCTTTATAATCTTGACGTCGTTTGCTCCGCCGCCACCGGAAAAAGGAAGACGATCGTC
GCCGGTGAGGGTAATGGTGAGCTCAGTATTGTTTAGGATTTGGTTTCTTTGACTCTATGGATATGTATTATGATCTGATATTATGTATGTATTTATGTAAATTGATCAGT
GCACTGAAATTTGAGGCTGTTTGTGTCTTTAGGGTTTGAAACGAACTTTTGAATGGGAATAATTGAAAGTGCAAATGTCTCTGATTTCTCACCATCA
Protein sequenceShow/hide protein sequence
MHGFGIELELSLSQLSLALTLMLDVFGSLPFYRIDTYDNSINVDNEKKNILAVQSLRNTIMGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVT
LLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGE
LSIV