; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006405 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006405
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionForkhead-associated domain protein
Genome locationscaffold327:376137..377941
RNA-Seq ExpressionMS006405
SyntenyMS006405
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572449.1 hypothetical protein SDJN03_29177, partial [Cucurbita argyrosperma subsp. sororia]5.0e-10581.62Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STEQLREELSHLHSEAE+TR KAN+AR+RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASSEQ AAEDS ETD E KDV L E QDLQ   ED AI+ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL

Query:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        GSEDE+V  +KG SSYEDFMENLD QLN+IE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R
Subjt:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

XP_022147963.1 uncharacterized protein LOC111016762 [Momordica charantia]1.1e-13197.79Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASS+QGAAEDSKETDF+SKDVKLTEYQ LQSKGEDHA+ITNDSEQEAPSCSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL

Query:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        GSEDEIV SMKGTSSYEDFMENLDNQLN+IEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
Subjt:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

XP_022952528.1 uncharacterized protein LOC111455192 [Cucurbita moschata]9.5e-10480.51Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STE+LREELSHLHSEA++TR KAN+AR+RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKV+QALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASSEQ AAEDS ETD E KDV L E QDLQ   ED AI+ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL

Query:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        GSEDE+V  +KG SSYEDFMENLD QLN+IE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R
Subjt:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

XP_023554111.1 uncharacterized protein LOC111811477 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-10380.88Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STEQLREELSHLHSEA+ TR KAN+AR+RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASSEQ AAEDS ETD   KDV L E QDLQ   ED AI+ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL

Query:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        GSEDE+V  +KG SSYEDFMENLD QLN+IE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R
Subjt:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

XP_038888021.1 uncharacterized protein LOC120077955 [Benincasa hispida]1.2e-10381.39Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA IAK+ PL PL  PIRASS  STEQLREEL+HLHSEAE+TR KAN+ARLRLLRLSEAAEKLRRQAAISV+TGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQ-SKGEDHA-IITNDSEQEAPSCS
        SKSRIKL DELS KL+EAIYVKE+QLIGNI SDL + TED SSP+RIASSEQ AAEDS+ETDFESKDV L EY+DLQ S  EDHA II +D EQEAP CS
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQ-SKGEDHA-IITNDSEQEAPSCS

Query:  DLGSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        DLGSE+++V SMKG SSYEDFMENLD QL++IE ELD VLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R
Subjt:  DLGSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

TrEMBL top hitse value%identityAlignment
A0A1S3BGP8 uncharacterized protein LOC1034896592.0e-9978.75Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA IA+  P+ PL  PIRASS  ST+QLREEL+HLHSEAE TR KANSARLRLLRLSEAAEKLR+QAAISVRTGKED+ARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQ-SKGEDHAIITNDSEQEAPSCSD
        S SRIKL DELSAKLNEAIYVKE+QLIGNID DL + TED SSP+RIA+SEQ A +DSK+T FESKDV L E QD+  S  EDHA   ND EQE P CSD
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQ-SKGEDHAIITNDSEQEAPSCSD

Query:  LGSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        LGSEDEIV SMKG SSYEDFMENLD QLN+IE ELD VLRASTVLL+GEDKQKN RVQQI+EL +SIR+IR R
Subjt:  LGSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

A0A5D3CL95 Uncharacterized protein2.0e-9978.75Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA IA+  P+ PL  PIRASS  ST+QLREEL+HLHSEAE TR KANSARLRLLRLSEAAEKLR+QAAISVRTGKED+ARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQ-SKGEDHAIITNDSEQEAPSCSD
        S SRIKL DELSAKLNEAIYVKE+QLIGNID DL + TED SSP+RIA+SEQ A +DSK+T FESKDV L E QD+  S  EDHA   ND EQE P CSD
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQ-SKGEDHAIITNDSEQEAPSCSD

Query:  LGSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        LGSEDEIV SMKG SSYEDFMENLD QLN+IE ELD VLRASTVLL+GEDKQKN RVQQI+EL +SIR+IR R
Subjt:  LGSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

A0A6J1D2R5 uncharacterized protein LOC1110167625.2e-13297.79Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASS+QGAAEDSKETDF+SKDVKLTEYQ LQSKGEDHA+ITNDSEQEAPSCSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL

Query:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        GSEDEIV SMKGTSSYEDFMENLDNQLN+IEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
Subjt:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

A0A6J1GKV2 uncharacterized protein LOC1114551924.6e-10480.51Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STE+LREELSHLHSEA++TR KAN+AR+RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKV+QALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASSEQ AAEDS ETD E KDV L E QDLQ   ED AI+ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL

Query:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        GSEDE+V  +KG SSYEDFMENLD QLN+IE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R
Subjt:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

A0A6J1HZQ7 uncharacterized protein LOC1114683921.1e-10279.78Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STEQLREEL+HLHSEA++TR KAN+AR RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASSEQ AA+DS ETD + KDV L E QDLQ   ED AI+ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDL

Query:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
        GSEDE+V  +K  SSYEDFMENLD QLN+IE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R
Subjt:  GSEDEIVKSMKGTSSYEDFMENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06510.1 unknown protein2.0e-5150.81Show/hide
Query:  SSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEKSKSRIKLFDELSAKLNEAIYVKEN
        S+T++++ LR +L  LH+EAE+TRAKANS RLRLLRLSEAAE LR QAA++VRTGKE+DARDLL QKKKVMQAL+K+K+RI+L D LS+KLNEAI VKE 
Subjt:  SSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEKSKSRIKLFDELSAKLNEAIYVKEN

Query:  QLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDLGSEDEIVKSMKGTSSYEDFMENLD
        QLIGNI  DL    E+TS  + I S +  + ED  E D    D +  +  +   +     + TN++  E  S   +         +K  SSYE F+ENLD
Subjt:  QLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDLGSEDEIVKSMKGTSSYEDFMENLD

Query:  NQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR
         +L+ IE EL TV+  ++++L  EDK KN +VQQ  E+L+ IR +R R
Subjt:  NQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTGAATTCTGCTGCTGGAATAGCTAAGCTGCATCCTCTCCTGCCTCTATCGAACCCCATACGCGCGTCTTCCACCAGCAGCACGGAACAGCTGCGCGAGGAACT
CAGTCACCTTCATTCTGAAGCAGAGAATACAAGAGCCAAAGCAAATAGTGCAAGACTGAGACTTCTGAGATTGTCGGAGGCAGCTGAGAAGCTTCGGCGACAGGCAGCTA
TTAGCGTACGAACAGGGAAGGAAGATGACGCGAGGGATCTACTTTTCCAGAAGAAGAAGGTTATGCAAGCGTTGGAGAAGTCAAAGAGTCGCATTAAGCTGTTTGATGAA
CTGTCAGCAAAGCTTAACGAGGCAATATATGTAAAAGAGAATCAGCTAATTGGGAATATTGATTCGGATCTGGCAATTAGAACTGAAGATACTTCAAGTCCAGTTCGAAT
TGCCTCTTCGGAGCAGGGAGCTGCAGAAGATTCAAAAGAAACTGATTTCGAATCTAAAGATGTAAAGCTTACTGAATATCAAGATTTGCAATCTAAGGGAGAGGATCATG
CAATCATAACTAATGACAGTGAGCAAGAGGCCCCTTCATGCTCTGATTTAGGGAGTGAAGATGAAATAGTAAAGAGTATGAAGGGAACATCATCGTATGAGGACTTCATG
GAAAACCTGGACAACCAGCTAAACATGATTGAAGGTGAACTCGATACTGTTCTGAGGGCTTCAACAGTACTATTAGAAGGCGAGGACAAACAAAAAAATTCAAGGGTGCA
GCAAATAGTGGAACTTCTAGATAGCATCCGGGTTATCAGAAATAGG
mRNA sequenceShow/hide mRNA sequence
ATGATATTGAATTCTGCTGCTGGAATAGCTAAGCTGCATCCTCTCCTGCCTCTATCGAACCCCATACGCGCGTCTTCCACCAGCAGCACGGAACAGCTGCGCGAGGAACT
CAGTCACCTTCATTCTGAAGCAGAGAATACAAGAGCCAAAGCAAATAGTGCAAGACTGAGACTTCTGAGATTGTCGGAGGCAGCTGAGAAGCTTCGGCGACAGGCAGCTA
TTAGCGTACGAACAGGGAAGGAAGATGACGCGAGGGATCTACTTTTCCAGAAGAAGAAGGTTATGCAAGCGTTGGAGAAGTCAAAGAGTCGCATTAAGCTGTTTGATGAA
CTGTCAGCAAAGCTTAACGAGGCAATATATGTAAAAGAGAATCAGCTAATTGGGAATATTGATTCGGATCTGGCAATTAGAACTGAAGATACTTCAAGTCCAGTTCGAAT
TGCCTCTTCGGAGCAGGGAGCTGCAGAAGATTCAAAAGAAACTGATTTCGAATCTAAAGATGTAAAGCTTACTGAATATCAAGATTTGCAATCTAAGGGAGAGGATCATG
CAATCATAACTAATGACAGTGAGCAAGAGGCCCCTTCATGCTCTGATTTAGGGAGTGAAGATGAAATAGTAAAGAGTATGAAGGGAACATCATCGTATGAGGACTTCATG
GAAAACCTGGACAACCAGCTAAACATGATTGAAGGTGAACTCGATACTGTTCTGAGGGCTTCAACAGTACTATTAGAAGGCGAGGACAAACAAAAAAATTCAAGGGTGCA
GCAAATAGTGGAACTTCTAGATAGCATCCGGGTTATCAGAAATAGG
Protein sequenceShow/hide protein sequence
MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEKSKSRIKLFDE
LSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSEQGAAEDSKETDFESKDVKLTEYQDLQSKGEDHAIITNDSEQEAPSCSDLGSEDEIVKSMKGTSSYEDFM
ENLDNQLNMIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNR