; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g00530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g00530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionForkhead-associated domain protein
Genome locationchr4:394261..396249
RNA-Seq ExpressionMoc04g00530
SyntenyMoc04g00530
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572449.1 hypothetical protein SDJN03_29177, partial [Cucurbita argyrosperma subsp. sororia]2.7e-10980.63Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STEQLREELSHLHSEAE+TR KAN+AR+RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASS+Q AAEDS ETD + KDV L E Q LQ   ED A++ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL

Query:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        GSEDE+V  +KG SSYEDFMENLD QLNIIE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R+SSF+LANANIR
Subjt:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

XP_022147963.1 uncharacterized protein LOC111016762 [Momordica charantia]9.2e-142100Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL

Query:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
Subjt:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

XP_022952528.1 uncharacterized protein LOC111455192 [Cucurbita moschata]5.1e-10879.58Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STE+LREELSHLHSEA++TR KAN+AR+RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKV+QALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASS+Q AAEDS ETD + KDV L E Q LQ   ED A++ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL

Query:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        GSEDE+V  +KG SSYEDFMENLD QLNIIE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R+SSF+LANANIR
Subjt:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

XP_023554111.1 uncharacterized protein LOC111811477 isoform X1 [Cucurbita pepo subsp. pepo]4.6e-10980.63Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STEQLREELSHLHSEA+ TR KAN+AR+RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASS+Q AAEDS ETD   KDV L E Q LQ   ED A++ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL

Query:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        GSEDE+VN +KG SSYEDFMENLD QLNIIE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R+SSF+LANANIR
Subjt:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

XP_038888021.1 uncharacterized protein LOC120077955 [Benincasa hispida]1.0e-10881.12Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA IAK+ PL PL  PIRASS  STEQLREEL+HLHSEAE+TR KAN+ARLRLLRLSEAAEKLRRQAAISV+TGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQ-SKGEDHA-MITNDSEQEAPSCS
        SKSRIKL DELS KL+EAIYVKE+QLIGNI SDL + TED SSP+RIASS+Q AAEDS+ETDF+SKDV L EY+ LQ S  EDHA +I +D EQEAP CS
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQ-SKGEDHA-MITNDSEQEAPSCS

Query:  DLGSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        DLGSE+++VNSMKG SSYEDFMENLD QL+IIE ELD VLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR RVSSF+LANANIR
Subjt:  DLGSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

TrEMBL top hitse value%identityAlignment
A0A0A0K5N3 Uncharacterized protein4.5e-10276.84Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA IAK  P+ PL  PIRASS  ST+QLR+EL+HLHSEAE TR KANSARLRLLRLSEAAEKLR+QAAISVRTGKE++ARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQ-SKGEDHAMITNDSEQEAPSCSD
        S SRIKL DELSAKLNEAIYVKE+QLIGNID DL + TED SSP+RIA+S+Q A +DS+ET F++KDV L E Q +  S GEDHA   ND EQE P CSD
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQ-SKGEDHAMITNDSEQEAPSCSD

Query:  LGSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        LGSEDE VNSMKG SSYEDFMENLD QLN IE ELD VLRASTVLL+ +DKQKN RVQQI+EL +SIR+IR RVSSF+LAN NIR
Subjt:  LGSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

A0A1S3BGP8 uncharacterized protein LOC1034896591.7e-10478.6Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA IA+  P+ PL  PIRASS  ST+QLREEL+HLHSEAE TR KANSARLRLLRLSEAAEKLR+QAAISVRTGKED+ARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQ-SKGEDHAMITNDSEQEAPSCSD
        S SRIKL DELSAKLNEAIYVKE+QLIGNID DL + TED SSP+RIA+S+Q A +DSK+T F+SKDV L E Q +  S  EDHA   ND EQE P CSD
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQ-SKGEDHAMITNDSEQEAPSCSD

Query:  LGSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        LGSEDEIVNSMKG SSYEDFMENLD QLNIIE ELD VLRASTVLL+GEDKQKN RVQQI+EL +SIR+IR RVSSF+LAN NIR
Subjt:  LGSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

A0A6J1D2R5 uncharacterized protein LOC1110167624.4e-142100Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL

Query:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
Subjt:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

A0A6J1GKV2 uncharacterized protein LOC1114551922.5e-10879.58Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STE+LREELSHLHSEA++TR KAN+AR+RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKV+QALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASS+Q AAEDS ETD + KDV L E Q LQ   ED A++ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL

Query:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        GSEDE+V  +KG SSYEDFMENLD QLNIIE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R+SSF+LANANIR
Subjt:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

A0A6J1HZQ7 uncharacterized protein LOC1114683925.5e-10879.58Show/hide
Query:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
        MILNSAA +AK+ P  PL  PIRASS  STEQLREEL+HLHSEA++TR KAN+AR RLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK
Subjt:  MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEK

Query:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL
        SKSRIKLFDELSAKLNEAIY+KE+QLIGNIDSDL + TED SSPVRIASS+Q AA+DS ETD + KDV L E Q LQ   ED A++ ND EQEA +CSDL
Subjt:  SKSRIKLFDELSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDL

Query:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR
        GSEDE+VN +K  SSYEDFMENLD QLNIIE EL+TVLRASTVLL+GEDKQKN RVQQIVEL DSIR+IR R+SSF+LANANIR
Subjt:  GSEDEIVNSMKGTSSYEDFMENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06510.1 unknown protein3.5e-5451.54Show/hide
Query:  SSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEKSKSRIKLFDELSAKLNEAIYVKEN
        S+T++++ LR +L  LH+EAE+TRAKANS RLRLLRLSEAAE LR QAA++VRTGKE+DARDLL QKKKVMQAL+K+K+RI+L D LS+KLNEAI VKE 
Subjt:  SSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEKSKSRIKLFDELSAKLNEAIYVKEN

Query:  QLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDLGSEDEIVNS-MKGTSSYEDFMENL
        QLIGNI  DL    E+TS  + I S K  + ED  E D          + HL S+G        +  QE    ++   ED  + S +K  SSYE F+ENL
Subjt:  QLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDLGSEDEIVNS-MKGTSSYEDFMENL

Query:  DNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANI
        D +L+ IE EL TV+  ++++L  EDK KN +VQQ  E+L+ IR +R R+++     ANI
Subjt:  DNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTGAATTCTGCTGCTGGAATAGCTAAGCTGCATCCTCTCCTGCCTCTATCGAACCCCATACGCGCGTCTTCCACCAGCAGCACGGAACAGCTGCGCGAGGAACT
CAGTCACCTTCATTCTGAAGCAGAGAATACAAGAGCCAAAGCAAATAGTGCAAGACTGAGACTTCTGAGATTGTCGGAGGCAGCTGAGAAGCTTCGGCGACAGGCAGCTA
TTAGCGTACGAACAGGGAAGGAAGATGACGCGAGGGATCTACTTTTCCAGAAGAAGAAGGTTATGCAAGCGTTGGAGAAGTCAAAGAGTCGCATTAAGCTGTTTGATGAA
CTGTCAGCAAAGCTTAACGAGGCAATATATGTAAAAGAGAATCAGCTAATTGGGAATATTGATTCGGATCTGGCAATTAGAACTGAAGATACTTCAAGTCCAGTTCGAAT
TGCCTCTTCGAAGCAGGGAGCTGCAGAAGATTCAAAAGAAACTGATTTCAAATCTAAAGATGTAAAGCTTACTGAATATCAACATTTGCAATCTAAGGGAGAGGATCATG
CAATGATAACTAATGACAGTGAGCAAGAGGCCCCTTCATGCTCTGATTTAGGGAGTGAAGATGAAATAGTAAACAGTATGAAGGGAACATCGTCGTATGAGGACTTCATG
GAAAACCTGGACAACCAGCTAAACATAATTGAAGGTGAACTCGATACTGTTCTGAGGGCTTCAACAGTACTATTAGAAGGCGAGGACAAACAAAAAAATTCAAGGGTGCA
GCAAATAGTGGAACTTCTAGATAGCATCCGGGTTATCAGAAATAGAGTCTCAAGTTTCCAGTTGGCAAATGCGAACATCAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGATATTGAATTCTGCTGCTGGAATAGCTAAGCTGCATCCTCTCCTGCCTCTATCGAACCCCATACGCGCGTCTTCCACCAGCAGCACGGAACAGCTGCGCGAGGAACT
CAGTCACCTTCATTCTGAAGCAGAGAATACAAGAGCCAAAGCAAATAGTGCAAGACTGAGACTTCTGAGATTGTCGGAGGCAGCTGAGAAGCTTCGGCGACAGGCAGCTA
TTAGCGTACGAACAGGGAAGGAAGATGACGCGAGGGATCTACTTTTCCAGAAGAAGAAGGTTATGCAAGCGTTGGAGAAGTCAAAGAGTCGCATTAAGCTGTTTGATGAA
CTGTCAGCAAAGCTTAACGAGGCAATATATGTAAAAGAGAATCAGCTAATTGGGAATATTGATTCGGATCTGGCAATTAGAACTGAAGATACTTCAAGTCCAGTTCGAAT
TGCCTCTTCGAAGCAGGGAGCTGCAGAAGATTCAAAAGAAACTGATTTCAAATCTAAAGATGTAAAGCTTACTGAATATCAACATTTGCAATCTAAGGGAGAGGATCATG
CAATGATAACTAATGACAGTGAGCAAGAGGCCCCTTCATGCTCTGATTTAGGGAGTGAAGATGAAATAGTAAACAGTATGAAGGGAACATCGTCGTATGAGGACTTCATG
GAAAACCTGGACAACCAGCTAAACATAATTGAAGGTGAACTCGATACTGTTCTGAGGGCTTCAACAGTACTATTAGAAGGCGAGGACAAACAAAAAAATTCAAGGGTGCA
GCAAATAGTGGAACTTCTAGATAGCATCCGGGTTATCAGAAATAGAGTCTCAAGTTTCCAGTTGGCAAATGCGAACATCAGATGA
Protein sequenceShow/hide protein sequence
MILNSAAGIAKLHPLLPLSNPIRASSTSSTEQLREELSHLHSEAENTRAKANSARLRLLRLSEAAEKLRRQAAISVRTGKEDDARDLLFQKKKVMQALEKSKSRIKLFDE
LSAKLNEAIYVKENQLIGNIDSDLAIRTEDTSSPVRIASSKQGAAEDSKETDFKSKDVKLTEYQHLQSKGEDHAMITNDSEQEAPSCSDLGSEDEIVNSMKGTSSYEDFM
ENLDNQLNIIEGELDTVLRASTVLLEGEDKQKNSRVQQIVELLDSIRVIRNRVSSFQLANANIR