; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023439 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023439
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLEA_2 domain-containing protein
Genome locationchr7:48225408..48226136
RNA-Seq ExpressionLag0023439
SyntenyLag0023439
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607382.1 hypothetical protein SDJN03_00724, partial [Cucurbita argyrosperma subsp. sororia]3.6e-7060.17Show/hide
Query:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS
        N V  QL+RVPS D KG+RRVAF+DSLPKHR+TS  +     S    RLFA C WIC+ VFGI + +LILGVIF+SFLQSGLPEITV+ LDLSK +IQNS
Subjt:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS

Query:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG
        TNQ+ A LN++V+M++++KNKNEK+EL+YS++ + LVSE++ LG++VI  FS  PG+TT  N+T +V  DS DR++   LEDD+KK Q+ V++TM  ++G
Subjt:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG

Query:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR
        FHLGIF LNKVPIHV C + Q L+LYR ++PPC+I MFPTR
Subjt:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR

XP_008457557.1 PREDICTED: uncharacterized protein LOC103497223 [Cucumis melo]2.4e-8265.43Show/hide
Query:  MNDVG-TQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQ
        MN++G  +L+R+PSQ+ KGSRRVAF+DSLPKHR   GD      ++CCPRLFACC WIC+G+FGIV+ +LILGVIF+SFLQSGLPEITVR L+LS FEI+
Subjt:  MNDVG-TQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQ

Query:  NSTNQ--DNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTME
        NSTNQ  +NA LN+++ MS+EM+NKNEK+EL+YS+I +NLVSEDV LG+SVI  FSH+PG+TT  N+T +V   STD++N  +LEDD+KKVQMDVQV ME
Subjt:  NSTNQ--DNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTME

Query:  ATIGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFP
        A +GFH+GIF+L  VPIHV C + Q+L++YR  +PPCNIRMFP
Subjt:  ATIGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFP

XP_022949169.1 uncharacterized protein LOC111452600 [Cucurbita moschata]6.8e-6959.34Show/hide
Query:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS
        N V  QL+RVPS D KG+RRVAF+DSLPKHR+TS  +     S     L A C WIC+ VFGI + +LILGVIF+SFLQSGLPEITV+ LDLSK +IQNS
Subjt:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS

Query:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG
        TNQ+ A LN++V+M++++KNKNEK+EL+YS++ + LVSE++ LG++VI  FS  PG+TT  N+T +V  DS DR++   LEDD+KK Q+ V++TM  ++G
Subjt:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG

Query:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR
        FHLGIF LNKVPIHV C + Q L+LYR ++PPC+I MFPTR
Subjt:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR

XP_022998792.1 uncharacterized protein LOC111493353 [Cucurbita maxima]9.8e-6858.51Show/hide
Query:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS
        N V  QL+RVPS D KG+RRVAF+DSLPKHR+ S  +     S     LFA C WIC+ VFGI + +LILGVIF+SFLQS LPEITV+ LDLSK +IQNS
Subjt:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS

Query:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG
        TNQ+ A LN++V+M+++++NKNEK+EL+YS++ + LVSE++ LG++VI  FS  PG+TT  N+T  V  DS DR++   LEDD+KK Q+ V++TM  ++G
Subjt:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG

Query:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR
        FHLGIF LNKVPIHV C + Q L+LYR ++PPC+I MFPTR
Subjt:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR

XP_038895624.1 uncharacterized protein LOC120083816 [Benincasa hispida]3.3e-8465.7Show/hide
Query:  MNDVGT-QLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQ
        MN+V + QL+R+PSQ+ +GSRRVAF++SLP+HR  SG+      S+CCPRLFACC WIC+ VFGIVL VLILGVIF+SFLQSGLP+ITV+ L+LSKFE  
Subjt:  MNDVGT-QLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQ

Query:  NSTNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEAT
        NSTNQ+N  LN++V +S+EM+NKN+K+EL+YSNI +NL S+DV LG+SVI GF+H PG+TT FN+T +V G STD++N  +LEDD+K+VQM+VQVTME+T
Subjt:  NSTNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEAT

Query:  IGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPT
        +GFH+GIF+LN VPIHV C + Q L+LYR  +PPCNIRMFPT
Subjt:  IGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPT

TrEMBL top hitse value%identityAlignment
A0A1S3C5S1 uncharacterized protein LOC1034972231.2e-8265.43Show/hide
Query:  MNDVG-TQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQ
        MN++G  +L+R+PSQ+ KGSRRVAF+DSLPKHR   GD      ++CCPRLFACC WIC+G+FGIV+ +LILGVIF+SFLQSGLPEITVR L+LS FEI+
Subjt:  MNDVG-TQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQ

Query:  NSTNQ--DNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTME
        NSTNQ  +NA LN+++ MS+EM+NKNEK+EL+YS+I +NLVSEDV LG+SVI  FSH+PG+TT  N+T +V   STD++N  +LEDD+KKVQMDVQV ME
Subjt:  NSTNQ--DNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTME

Query:  ATIGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFP
        A +GFH+GIF+L  VPIHV C + Q+L++YR  +PPCNIRMFP
Subjt:  ATIGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFP

A0A5A7V2C7 Putative transmembrane protein1.2e-8265.43Show/hide
Query:  MNDVG-TQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQ
        MN++G  +L+R+PSQ+ KGSRRVAF+DSLPKHR   GD      ++CCPRLFACC WIC+G+FGIV+ +LILGVIF+SFLQSGLPEITVR L+LS FEI+
Subjt:  MNDVG-TQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQ

Query:  NSTNQ--DNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTME
        NSTNQ  +NA LN+++ MS+EM+NKNEK+EL+YS+I +NLVSEDV LG+SVI  FSH+PG+TT  N+T +V   STD++N  +LEDD+KKVQMDVQV ME
Subjt:  NSTNQ--DNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTME

Query:  ATIGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFP
        A +GFH+GIF+L  VPIHV C + Q+L++YR  +PPCNIRMFP
Subjt:  ATIGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFP

A0A6J1E0W1 uncharacterized protein LOC1110249231.6e-6358.95Show/hide
Query:  KGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNSTNQ--DNAPLNSRVQ
        KG RRV F++SLP HR TS      +G++C  RLFA CG ICIG FGI+L +LI+ VIF+SFLQSGLPEI+++TL LSKFEI +STNQ  +NA L++RV 
Subjt:  KGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNSTNQ--DNAPLNSRVQ

Query:  MSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIGFHLGIFHLNKVPI
        +S+ ++NKN+K+EL+Y +I +N+ S+DV LGKSVI GFSH PG+TT  N+TT+V GD  DRENALE++++KK+V+M  QV MEA IGFH GIF + KVPI
Subjt:  MSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIGFHLGIFHLNKVPI

Query:  HVGC-VYHQSLILYRAQQPPCNIRMFPTR
        HV C    Q L++ R ++  CNIRMFP R
Subjt:  HVGC-VYHQSLILYRAQQPPCNIRMFPTR

A0A6J1GC15 uncharacterized protein LOC1114526003.3e-6959.34Show/hide
Query:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS
        N V  QL+RVPS D KG+RRVAF+DSLPKHR+TS  +     S     L A C WIC+ VFGI + +LILGVIF+SFLQSGLPEITV+ LDLSK +IQNS
Subjt:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS

Query:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG
        TNQ+ A LN++V+M++++KNKNEK+EL+YS++ + LVSE++ LG++VI  FS  PG+TT  N+T +V  DS DR++   LEDD+KK Q+ V++TM  ++G
Subjt:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG

Query:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR
        FHLGIF LNKVPIHV C + Q L+LYR ++PPC+I MFPTR
Subjt:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR

A0A6J1K8Y2 uncharacterized protein LOC1114933534.7e-6858.51Show/hide
Query:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS
        N V  QL+RVPS D KG+RRVAF+DSLPKHR+ S  +     S     LFA C WIC+ VFGI + +LILGVIF+SFLQS LPEITV+ LDLSK +IQNS
Subjt:  NDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNS

Query:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG
        TNQ+ A LN++V+M+++++NKNEK+EL+YS++ + LVSE++ LG++VI  FS  PG+TT  N+T  V  DS DR++   LEDD+KK Q+ V++TM  ++G
Subjt:  TNQDNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIG

Query:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR
        FHLGIF LNKVPIHV C + Q L+LYR ++PPC+I MFPTR
Subjt:  FHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMFPTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17620.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.1e-0623.86Show/hide
Query:  PKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNSTNQDNAPLNSRVQMSVEMKNKNEKMEL
        P +R  +G         CC R   CC W    +  ++L+V     +     +   P  TV  L +S     ++       L + + +SV  +N N+ +  
Subjt:  PKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNSTNQDNAPLNSRVQMSVEMKNKNEKMEL

Query:  TYSNIALNLV------SEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDD-KKKVQMDVQVTMEATIGFHLGIFHLNKVPIHVGC
         Y    + L        +DV +GK  IA FSH   +TT    T     D  D  +A +L+ D K K  + +++ + + +   +G     K  I V C
Subjt:  TYSNIALNLV------SEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDD-KKKVQMDVQVTMEATIGFHLGIFHLNKVPIHVGC

AT2G30505.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.6e-1227.51Show/hide
Query:  CCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNSTNQ--DNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIA
        CC   C+ V  ++++VL++G+   S ++S LP++ V  L  S+ +I  S+     NA LN+ +Q+S    N N+K  L YS +  ++ SE++ LGK  ++
Subjt:  CCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNSTNQ--DNAPLNSRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIA

Query:  GFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMF
        GF  +PG+ T   + T +        +A  L + +K ++  V V +   +      F ++ +PI + C   +   +    +P C++R+F
Subjt:  GFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIGFHLGIFHLNKVPIHVGCVYHQSLILYRAQQPPCNIRMF

AT4G01110.1 unknown protein1.7e-0624.44Show/hide
Query:  SRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNSTNQDN-APLNSRVQMSVEMKNKNEKMELTYSNIALNL-VSED
        SRC  R+F CC  +CI V  ++L++++   +F  +    LP + + +  +S F        D  + L +     ++ +N N K+   Y N+ + + V ED
Subjt:  SRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNSTNQDN-APLNSRVQMSVEMKNKNEKMELTYSNIALNL-VSED

Query:  ---VGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIGFHLGIFHLNKVPIHVGC
             LG + + GF   PG+ T   +   V     D      L  D K  ++ V+V  +  +G  +G   +  V + + C
Subjt:  ---VGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIGFHLGIFHLNKVPIHVGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGATGTTGGTACTCAACTTAATCGAGTTCCAAGTCAAGATCTCAAAGGATCGCGTCGGGTCGCCTTCACCGATTCCCTTCCTAAACACCGCACGACATCCGGCGA
TTCTGAGCATGGCACTGGCAGCAGATGTTGTCCCCGTTTATTTGCTTGTTGCGGATGGATATGCATTGGGGTGTTCGGAATTGTTCTCGTCGTCCTCATCCTTGGCGTGA
TATTTTTGTCCTTCCTTCAATCAGGATTGCCAGAGATCACCGTGAGAACCTTGGATTTGTCCAAGTTTGAGATTCAAAACTCCACAAATCAGGACAACGCCCCCCTAAAT
TCCAGAGTACAGATGTCTGTCGAGATGAAGAACAAGAATGAGAAAATGGAGTTGACTTATAGCAATATTGCGCTGAATCTGGTGTCAGAGGACGTGGGATTGGGCAAGAG
CGTGATTGCTGGTTTCTCTCATAATCCTGGAGATACCACACGTTTCAATCTAACCACAGACGTCGCAGGAGATTCCACAGATAGAGAGAACGCATTGGAACTAGAAGATG
ACAAAAAAAAGGTGCAAATGGATGTGCAGGTGACAATGGAAGCTACAATTGGTTTTCATCTCGGGATATTCCACCTGAACAAGGTGCCGATCCATGTAGGATGTGTTTAT
CATCAGTCTCTTATTTTGTATCGCGCACAGCAGCCCCCATGTAATATTAGAATGTTTCCCACCAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACGATGTTGGTACTCAACTTAATCGAGTTCCAAGTCAAGATCTCAAAGGATCGCGTCGGGTCGCCTTCACCGATTCCCTTCCTAAACACCGCACGACATCCGGCGA
TTCTGAGCATGGCACTGGCAGCAGATGTTGTCCCCGTTTATTTGCTTGTTGCGGATGGATATGCATTGGGGTGTTCGGAATTGTTCTCGTCGTCCTCATCCTTGGCGTGA
TATTTTTGTCCTTCCTTCAATCAGGATTGCCAGAGATCACCGTGAGAACCTTGGATTTGTCCAAGTTTGAGATTCAAAACTCCACAAATCAGGACAACGCCCCCCTAAAT
TCCAGAGTACAGATGTCTGTCGAGATGAAGAACAAGAATGAGAAAATGGAGTTGACTTATAGCAATATTGCGCTGAATCTGGTGTCAGAGGACGTGGGATTGGGCAAGAG
CGTGATTGCTGGTTTCTCTCATAATCCTGGAGATACCACACGTTTCAATCTAACCACAGACGTCGCAGGAGATTCCACAGATAGAGAGAACGCATTGGAACTAGAAGATG
ACAAAAAAAAGGTGCAAATGGATGTGCAGGTGACAATGGAAGCTACAATTGGTTTTCATCTCGGGATATTCCACCTGAACAAGGTGCCGATCCATGTAGGATGTGTTTAT
CATCAGTCTCTTATTTTGTATCGCGCACAGCAGCCCCCATGTAATATTAGAATGTTTCCCACCAGGTAA
Protein sequenceShow/hide protein sequence
MNDVGTQLNRVPSQDLKGSRRVAFTDSLPKHRTTSGDSEHGTGSRCCPRLFACCGWICIGVFGIVLVVLILGVIFLSFLQSGLPEITVRTLDLSKFEIQNSTNQDNAPLN
SRVQMSVEMKNKNEKMELTYSNIALNLVSEDVGLGKSVIAGFSHNPGDTTRFNLTTDVAGDSTDRENALELEDDKKKVQMDVQVTMEATIGFHLGIFHLNKVPIHVGCVY
HQSLILYRAQQPPCNIRMFPTR