; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0760 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0760
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionThioredoxin-like_fold domain-containing protein
Genome locationMC08:6194584..6198829
RNA-Seq ExpressionMC08g0760
SyntenyMC08g0760
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR012336 - Thioredoxin-like fold
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144362.1 uncharacterized protein LOC111014065 [Momordica charantia]3.58e-190100Show/hide
Query:  MNDVYFANLVEILVTKDRRNSSTKLQSKAKMAKLETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSR
        MNDVYFANLVEILVTKDRRNSSTKLQSKAKMAKLETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSR
Subjt:  MNDVYFANLVEILVTKDRRNSSTKLQSKAKMAKLETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSR

Query:  DSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTG
        DSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTG
Subjt:  DSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTG

Query:  FNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRDLIDPLVGNKKRIG
        FNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRDLIDPLVGNKKRIG
Subjt:  FNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRDLIDPLVGNKKRIG

XP_022951987.1 uncharacterized protein LOC111454702 [Cucurbita moschata]3.83e-12978.72Show/hide
Query:  LETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAY
        +E   S    L S A  FFFF FF   A AQ+LPP KFDGF YGN + D +T+ IEAFYDPVCPDSRDSWPPLKKALDYYGSRV LVIHLLPLPYHDNAY
Subjt:  LETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAY

Query:  AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGF
        AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAET YLSR S+VD IVKFG +VLG+SY+ ALV GFNDR+TDLLTRVSFKFSTSRGVYGTPFFFINGF
Subjt:  AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGF

Query:  LAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL
        LAPD+GS +N+TGWR LIDPL+  KKR  S HLSL
Subjt:  LAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL

XP_023002487.1 uncharacterized protein LOC111496311 [Cucurbita maxima]2.49e-13079.32Show/hide
Query:  LETRHSQSPPLHSAA--FYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDN
        +E   S    L S A  F+FFFF FF   A AQ+LPP KFDGF YGN + D +T+ IEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDN
Subjt:  LETRHSQSPPLHSAA--FYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDN

Query:  AYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFIN
        AYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAET YLSR S+VD IVKFG +VLG+SY+ ALV GFNDR+TDLLTRVSFKFSTSRGVYGTPFFFIN
Subjt:  AYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFIN

Query:  GFLAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL
        GFLAPD+GS +NYTGWR LIDPL+  KKR  S HLSL
Subjt:  GFLAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL

XP_023538607.1 uncharacterized protein LOC111799321 [Cucurbita pepo subsp. pepo]3.03e-12879.15Show/hide
Query:  LETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAY
        +E   S    L S A  FFFF FF   A AQ+LPP KFDGF YGN + D +T+ IEAFYDPVCPDSRDSWPPLKKALDYYGSRV LVIHLLPLPYHDNAY
Subjt:  LETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAY

Query:  AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGF
        AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAET YLSR S+VD IVKFG +VLG+SY+ ALV GFNDR+TDLLTRVSFKFSTSRGVYGTPFFFINGF
Subjt:  AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGF

Query:  LAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL
        LAPD+GS +NYTGWR LIDPL+  KKR  S HLSL
Subjt:  LAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL

XP_038886167.1 uncharacterized protein LOC120076418 [Benincasa hispida]5.71e-13179.15Show/hide
Query:  LETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAY
        +E   +   PL S AF F F A FL  A AQ+LPP KFDGF YGN + D DT+LIEAF+DPVCPDSRDSWPPLKKALD+YGSRVRLVIHLLPLPYHDNAY
Subjt:  LETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAY

Query:  AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGF
        AASRALHIVDLVNPS TF LLEAFFG QKQFYNAET YLSR S+VD IVKFG EVLG SY+  L+TGFNDR+TDLLTRVSFKFSTSRGVYGTPFFFINGF
Subjt:  AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGF

Query:  LAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL
        LAPDKGS INYTGWR+LIDPL+   KR GS HLSL
Subjt:  LAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL

TrEMBL top hitse value%identityAlignment
A0A0A0LKC9 Thioredoxin-like_fold domain-containing protein1.29e-12778.39Show/hide
Query:  LETRHSQSPPLHSA-AFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNA
        +E  +S S  L S  AF F F A F  TA AQ+LPPPKFDGF YGN + DF+T+ IEAF+DPVCPDSRDSWPPLKKALD+YGSRVRLVIHLLPLPYHDNA
Subjt:  LETRHSQSPPLHSA-AFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNA

Query:  YAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFING
        YAASRALHIVDLVNPS TF LLEAFFG QKQFYNAET YLSR +IVD +VKFG EVLG SY+  LVTGFNDR TDLLTRVSFKFSTSRGVYGTPFFFING
Subjt:  YAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFING

Query:  FLAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL
        FLAPDKGS +NYT WR+LIDPL+   KR GS HLSL
Subjt:  FLAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL

A0A1S3BD67 uncharacterized protein LOC1034883957.61e-12575.21Show/hide
Query:  MAKLETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHD
        M K  T  S    L + +F+F F A FL TA AQ+LPP KFDGF YGN + D +T+LIEAF+DPVCPDSRDSWPPLKKALD+YGSRVRLVIHLLPLPYH+
Subjt:  MAKLETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHD

Query:  NAYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFI
        NAYA SRALHIVDLVNPS TF LLEAFFG QKQFYNAET YLSR +++D IVKFG EVLG SY+  LVTGF+DR TDLLTRVSFKFSTSRGVYGTPFFFI
Subjt:  NAYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFI

Query:  NGFLAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL
        NGFLAPDKGS +NY  WR+LIDPL+   KR GS HLSL
Subjt:  NGFLAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL

A0A6J1CTH2 uncharacterized protein LOC1110140651.73e-190100Show/hide
Query:  MNDVYFANLVEILVTKDRRNSSTKLQSKAKMAKLETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSR
        MNDVYFANLVEILVTKDRRNSSTKLQSKAKMAKLETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSR
Subjt:  MNDVYFANLVEILVTKDRRNSSTKLQSKAKMAKLETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSR

Query:  DSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTG
        DSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTG
Subjt:  DSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTG

Query:  FNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRDLIDPLVGNKKRIG
        FNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRDLIDPLVGNKKRIG
Subjt:  FNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRDLIDPLVGNKKRIG

A0A6J1GJ54 uncharacterized protein LOC1114547021.86e-12978.72Show/hide
Query:  LETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAY
        +E   S    L S A  FFFF FF   A AQ+LPP KFDGF YGN + D +T+ IEAFYDPVCPDSRDSWPPLKKALDYYGSRV LVIHLLPLPYHDNAY
Subjt:  LETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAY

Query:  AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGF
        AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAET YLSR S+VD IVKFG +VLG+SY+ ALV GFNDR+TDLLTRVSFKFSTSRGVYGTPFFFINGF
Subjt:  AASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGF

Query:  LAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL
        LAPD+GS +N+TGWR LIDPL+  KKR  S HLSL
Subjt:  LAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL

A0A6J1KJN0 uncharacterized protein LOC1114963111.21e-13079.32Show/hide
Query:  LETRHSQSPPLHSAA--FYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDN
        +E   S    L S A  F+FFFF FF   A AQ+LPP KFDGF YGN + D +T+ IEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDN
Subjt:  LETRHSQSPPLHSAA--FYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDN

Query:  AYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFIN
        AYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAET YLSR S+VD IVKFG +VLG+SY+ ALV GFNDR+TDLLTRVSFKFSTSRGVYGTPFFFIN
Subjt:  AYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFIN

Query:  GFLAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL
        GFLAPD+GS +NYTGWR LIDPL+  KKR  S HLSL
Subjt:  GFLAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G20225.1 Thioredoxin superfamily protein2.0e-6456.59Show/hide
Query:  FFAFFLTTAL-AQNLPPPKFDGFPY-GNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSAT
        F  FF+ T + AQ +PP + DGF Y   R  D DT+LIEA+ DPVCPD RD+W PLK A+D+YGSRV LV+HL+PLP+HDNA+ ASRALHIVD +N +AT
Subjt:  FFAFFLTTAL-AQNLPPPKFDGFPY-GNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSAT

Query:  FTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRDL
        F LLE  F  Q  FYN++T  +SR ++V++++K G   LG SY   L +GF++  +DL TRVSFK+S SRGV  TP F++NGF  P  GS  +Y GWRD 
Subjt:  FTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRDL

Query:  IDPLV
        IDPLV
Subjt:  IDPLV

AT1G76020.1 Thioredoxin superfamily protein4.0e-6859.71Show/hide
Query:  FFAFFLTTAL-AQNLPPPKFDGFPY--GNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSA
        F  F + T + AQ +PP + DGF Y  G+R +D DT+LIEA++DPVCPDSRDSWPPLK+AL +YGSRV L++HLLPLPYHDNAY  SRALHIV+ V+ +A
Subjt:  FFAFFLTTAL-AQNLPPPKFDGFPY--GNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKALDYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSA

Query:  TFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRD
        TF+LLE FF  Q  FYNA+T  LSR ++V+KIV+ G   LG SY+  L +GF+D+ +D  TRVSFK+S SRGVYGTP F++NGF+  D  S  N+ GW+ 
Subjt:  TFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSRGVYGTPFFFINGFLAPDKGSAINYTGWRD

Query:  LIDPLV
        +IDPLV
Subjt:  LIDPLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATGTATATTTCGCTAACCTTGTGGAAATTTTGGTAACCAAAGACAGAAGAAACTCCTCTACGAAGCTCCAAAGCAAAGCAAAAATGGCGAAACTGGAAACCCG
TCACTCACAGTCACCACCGCTTCATTCAGCTGCCTTTTACTTCTTTTTCTTCGCCTTCTTCCTTACGACCGCGCTTGCTCAGAATCTACCACCGCCAAAATTCGATGGAT
TTCCGTACGGAAATCGCACCTACGACTTCGATACCGTTCTCATCGAAGCCTTCTACGACCCGGTTTGCCCCGATAGCAGAGACTCCTGGCCGCCGCTCAAGAAGGCCCTC
GACTACTACGGCTCTCGCGTTCGTCTCGTCATCCACCTCCTCCCACTGCCTTATCATGACAACGCATATGCTGCGTCTCGTGCTTTACACATCGTGGATTTGGTGAATCC
CTCGGCTACATTCACATTGTTGGAGGCATTCTTCGGTCAACAGAAGCAATTTTACAATGCCGAAACTCATTATTTGTCAAGGGTTTCTATTGTGGATAAAATCGTGAAGT
TTGGAGCGGAAGTACTCGGGAAGTCTTACGAAAAGGCTCTAGTAACTGGCTTCAATGATAGGAACACTGATCTTTTGACACGTGTTTCTTTCAAGTTTAGCACCTCAAGA
GGAGTCTACGGAACACCCTTTTTCTTTATAAATGGATTCCTAGCTCCTGATAAAGGTTCTGCCATAAATTACACCGGATGGAGGGACTTGATCGACCCTTTGGTCGGGAA
TAAGAAGAGAATAGGGTCTCCACATCTATCGCTATGA
mRNA sequenceShow/hide mRNA sequence
ATTGCCCATATCCAAAGTTTAATTATGAATGATGTATATTTCGCTAACCTTGTGGAAATTTTGGTAACCAAAGACAGAAGAAACTCCTCTACGAAGCTCCAAAGCAAAGC
AAAAATGGCGAAACTGGAAACCCGTCACTCACAGTCACCACCGCTTCATTCAGCTGCCTTTTACTTCTTTTTCTTCGCCTTCTTCCTTACGACCGCGCTTGCTCAGAATC
TACCACCGCCAAAATTCGATGGATTTCCGTACGGAAATCGCACCTACGACTTCGATACCGTTCTCATCGAAGCCTTCTACGACCCGGTTTGCCCCGATAGCAGAGACTCC
TGGCCGCCGCTCAAGAAGGCCCTCGACTACTACGGCTCTCGCGTTCGTCTCGTCATCCACCTCCTCCCACTGCCTTATCATGACAACGCATATGCTGCGTCTCGTGCTTT
ACACATCGTGGATTTGGTGAATCCCTCGGCTACATTCACATTGTTGGAGGCATTCTTCGGTCAACAGAAGCAATTTTACAATGCCGAAACTCATTATTTGTCAAGGGTTT
CTATTGTGGATAAAATCGTGAAGTTTGGAGCGGAAGTACTCGGGAAGTCTTACGAAAAGGCTCTAGTAACTGGCTTCAATGATAGGAACACTGATCTTTTGACACGTGTT
TCTTTCAAGTTTAGCACCTCAAGAGGAGTCTACGGAACACCCTTTTTCTTTATAAATGGATTCCTAGCTCCTGATAAAGGTTCTGCCATAAATTACACCGGATGGAGGGA
CTTGATCGACCCTTTGGTCGGGAATAAGAAGAGAATAGGGTCTCCACATCTATCGCTATGAAGCGTTTGGTCTATATAAACTTTCTAGTCTCCATGGCCATACATCTTTA
CTTACTTTATACATATCTATATGGTTTGCAGATGGAAAACAAATACTGGAAAGCATACAATTGCAGACTTTGCATTTATGTGATACCCCATTATCTCAGAAAAAGGTATA
GCAAGATTGGTTATGCAATAAATGACTCTATGTTTTGTCTTTCTTTTTATGTTTTGTAATTGTCAAGTAATGTGAAATATAAACTCAGACAGAGATGCTTTATATGGTTC
TTCTCTTTTTATGAAGAAAATATACATCACTGGCTTAGTTACTTCGCATGGAGCTGCCTCGTTCTACCAGTTGCTTATCCACTTCATGATGCCATTTTATACCCCTGCTA
ATTCTCTAGTTACGATACTTTTGATAAAGGATTTGAAAAAAAACATTTTTTTTTTATTTTTATAAAATATCTCTTTTCCTTTGTTAATGCATAATTGTGTGTTTTGCTTC
ATTTTCTATTAGAGGACTTGAGTACACCTTGAATTTGTCTGTCTAGCATAGAAAAAAGGGAATAATTTAAAGGTTTGCTTTTGGGTTGGATCATTAGGGGTCCAAGAGGC
TGACTTGTTGGGCTGGTCCAATATTGAAATAAAGGGTTGGCCTCACTTGGCCAGTTAAACTTTGATCTTTGGCTGGAGCTTAGGGAGCCTCTTTAATGACTTGGGTGTGA
GAAAAGTGTGGGCTTGGAGAAACATTAGAAAGCAATCAGAGTTAGGGTCTTGTGGTAAAAAAAAAAGGTTTTAATTAAAATACTTGTGTGTTTAAAATGTCAAATTACTA
TAAAGTGATTCTTAGTTAATCATTCTAGAATCATTTTTTTGAGAAGTTATTCTTCCATAAATGATTATAAACCAATTTGATTATTACACAAAAATATTTTTGGCCCTCCG
AAATCATTTGGAAGCTAATTCGAATCCATTCTAAGTCCGGACAAGTAAAAAGACCACTCAAGAATGCATGTGGTTAGTTTAGGAGTGAATGTCAAAACACTACCAAGTAA
ACAAACCAAGTAGAATCAGTTTTAGTACCAAGTAAACAAACCAAGTAGAATCAGTTTTAAGGATGTAATCTTCAATAAAATTAGAACTCTTAAAAAGAGTAGATGAACAA
TAAGTTGAATAGGAGTTTGGGTTGCCATATTTATAGTAGAGGTAGAAGTAAATAAATGGGATTGAAAGAAGAGAAGAGATGAGTTATGCCTAGAAGGGAGAGTGAACACT
TTTGCAAACATGTGTGAGCTGTGAGTTCTGACACCCACAACTTTTTTTGAGTTTCAGTGTCTGTCTTAGTTTGTCCAATTGCAGGTCTACTCCTTGATATTACCAATGAT
CTTATCCAAACCAACCAGTACTGGCTTGGCACACATCTAAATGTGTATGAGGAAGCATCTTCATCTTAACTTGGGTTTTTGTTTCCTCTCTTAGATTCACTTTTCTTTTT
GTTCTCTGTTGGCAACTGATGCATTGGAAAATTCACATATTACTGCTCCCTGCATAGTTGTGTTATGCTTTGCAGGTGTGTTCGGTTCGGTTGTGTTGTGCATAGGAGGT
ATTCTATTCGAATTCCTACAGGACCATTTTCTTTTCAATTAACATTAATAACTACTTTAGCTTTTTGTAAGAATTTCTAAGTCTACAAGTGAGGGGAGTGTGAAATATTG
ATAAATGATTATATTTACGGTAACCTATCAGTTTAAACTTTTGAGTTGATTGATGATGTAACATCCTATCAGAGTAGGAGGTTGTGTACTCGGAGCCTTGTAATGTCATT
TTCTCCCCATTTAATATTGATTAACATTTGTTGGGTTTTGACCTTTTGGTCAGAGATCGGTGTTAACTATCGTTGAGCTAAATTCATG
Protein sequenceShow/hide protein sequence
MNDVYFANLVEILVTKDRRNSSTKLQSKAKMAKLETRHSQSPPLHSAAFYFFFFAFFLTTALAQNLPPPKFDGFPYGNRTYDFDTVLIEAFYDPVCPDSRDSWPPLKKAL
DYYGSRVRLVIHLLPLPYHDNAYAASRALHIVDLVNPSATFTLLEAFFGQQKQFYNAETHYLSRVSIVDKIVKFGAEVLGKSYEKALVTGFNDRNTDLLTRVSFKFSTSR
GVYGTPFFFINGFLAPDKGSAINYTGWRDLIDPLVGNKKRIGSPHLSL