; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033032 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033032
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionFerredoxin
Genome locationchr11:40155872..40161939
RNA-Seq ExpressionLag0033032
SyntenyLag0033032
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009055 - electron transfer activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051537 - 2 iron, 2 sulfur cluster binding (molecular function)
InterPro domainsIPR001041 - 2Fe-2S ferredoxin-type iron-sulfur binding domain
IPR012675 - Beta-grasp domain superfamily
IPR036010 - 2Fe-2S ferredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571846.1 Ferredoxin C 2, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.3e-13172.19Show/hide
Query:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD
        MRFF+KIAGILG SKDDAHDVK+ED+DVDSDNQA           RRGFSVPVQVA+NRPQPGP+L+PSSS DGGVQGLTWYAKRLRVDEDGDVAEQFLD
Subjt:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD

Query:  EVLPEMPTSTTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL----------------------------------------------
        EVLPE+  STTD+ K  PRFQINNRNRPAKVE QVILQEGKLQQCIEH+GRLLL                                              
Subjt:  EVLPEMPTSTTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL----------------------------------------------

Query:  ------------PESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFA
                    P S R+F+S NRN RFSD+LKCRRKTTS ELQAAVDVA  +ENGS SIPTHKV VHDRERGVVHEFVVPEDQYILHTAEAQSISLPFA
Subjt:  ------------PESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFA

Query:  CRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
        CRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
Subjt:  CRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE

KAG6606242.1 AP-2 complex subunit mu, partial [Cucurbita argyrosperma subsp. sororia]2.8e-13976.44Show/hide
Query:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD
        MRF +KIAG LGFSKDDAHDVK+ED++V+S NQAP RVH+QETGPRRGFSVPVQVA+N PQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD
Subjt:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD

Query:  EVLPEMPTS-TTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRL-----------------------------------------------
        EVLPE PTS TTD+ KP+PRFQINNRNR AKV+NQVILQEGKLQQCIEH+  +                                               
Subjt:  EVLPEMPTS-TTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRL-----------------------------------------------

Query:  --LLPESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTS
          L PESSR FLS NRNRRFSDLLKCRRKT STELQA VDVAG +++GSASIPTHKV VHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTS
Subjt:  --LLPESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTS

Query:  CAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
        CAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
Subjt:  CAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE

XP_022155937.1 uncharacterized protein LOC111022934 [Momordica charantia]8.0e-7094.41Show/hide
Query:  ESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRI
        E SRRFLSPNRNRRFSDLLKCRRKTTSTELQAA+DVAG ++NGSASIPTHKV VHDR+RGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRI
Subjt:  ESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRI

Query:  KSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
        KSG+IRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
Subjt:  KSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE

XP_022930965.1 uncharacterized protein LOC111437299 [Cucurbita moschata]3.0e-6987.9Show/hide
Query:  LQQCIEHKGRLLLPESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPF
        L  C   +   L PESSR FLS NRNRRFSDLLKCRRKT STELQA VDVAG +++GSASIPTHKV VHDRERGVVHEFVVPEDQYILHTAEAQSISLPF
Subjt:  LQQCIEHKGRLLLPESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPF

Query:  ACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
        ACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
Subjt:  ACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE

XP_023533597.1 uncharacterized protein LOC111795416 [Cucurbita pepo subsp. pepo]1.8e-6987.74Show/hide
Query:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD
        MRF +KIAG LGFSKDDAHDVK+ED++V+S NQAP RVH+QETGPRRGFSVPVQVA+N PQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD
Subjt:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD

Query:  EVLPEMPTS-TTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL
        EVLPEMPTS TTD+ KP+PRFQINNRNR AKV+NQVILQEGKLQQCIEH+GRLLL
Subjt:  EVLPEMPTS-TTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL

TrEMBL top hitse value%identityAlignment
A0A1S3CRI6 uncharacterized protein LOC1035035271.5e-6986.45Show/hide
Query:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETG-PRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFL
        MRFF+KIAGILGFSKDD+HDVKNEDDDVDSD  APDRVH+Q TG PRRGFSVPVQVA+NR  PGPIL+PSSSGDGGVQGLTWYAK LR+DEDGDVAEQFL
Subjt:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETG-PRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFL

Query:  DEVLPEMPTSTTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL
        +EV+PE+ TSTTD+ KPFPRFQINNRNRPAKVENQVIL+EGKLQQCIEH+GRLLL
Subjt:  DEVLPEMPTSTTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL

A0A6J1DQQ0 Ferredoxin3.9e-7094.41Show/hide
Query:  ESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRI
        E SRRFLSPNRNRRFSDLLKCRRKTTSTELQAA+DVAG ++NGSASIPTHKV VHDR+RGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRI
Subjt:  ESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRI

Query:  KSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
        KSG+IRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
Subjt:  KSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE

A0A6J1DQR1 uncharacterized protein LOC1110229405.6e-6986.27Show/hide
Query:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETG-PRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFL
        MRFF++IAGILGF+KDD+HD+KNED+DVDSDNQ  +RVH+QETG PRRGFSVPVQVA+NRPQPGPILVPSSSGDGGVQGL WYA RLRVDEDGDVAEQFL
Subjt:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETG-PRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFL

Query:  DEVLPEMPTSTTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRL
        DEVLPEMPTSTTD+ K FPRFQ+N+RNRPAKVENQVILQEGKLQQCI+H+GRL
Subjt:  DEVLPEMPTSTTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRL

A0A6J1ESD7 Ferredoxin1.5e-6987.9Show/hide
Query:  LQQCIEHKGRLLLPESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPF
        L  C   +   L PESSR FLS NRNRRFSDLLKCRRKT STELQA VDVAG +++GSASIPTHKV VHDRERGVVHEFVVPEDQYILHTAEAQSISLPF
Subjt:  LQQCIEHKGRLLLPESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPF

Query:  ACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
        ACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
Subjt:  ACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE

A0A6J1KA07 uncharacterized protein LOC1114914554.3e-6987.1Show/hide
Query:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD
        MRF +KIAG LGFSKD+AHDVKNED++V+S NQAP RVH+QETGPRRGFSVPVQVA+N PQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD
Subjt:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLD

Query:  EVLPEMPTS-TTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL
        EVLPE PTS TTD+ KP+PRFQINNRNR AKV+NQVILQEGKLQQCIEH+GRLLL
Subjt:  EVLPEMPTS-TTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL

SwissProt top hitse value%identityAlignment
P00253 Ferredoxin2.0e-1540Show/hide
Query:  THKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
        T KV + +   G  HE  VP+D+YIL  AE +   LPF+CR G C++CA ++ SG + Q +   +  +    GY L CV +PTSDV ++T  E++
Subjt:  THKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE

P08451 Ferredoxin-21.4e-1644.32Show/hide
Query:  RGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDEKLA
        +G    F    DQ +L +A+A  + LP +C  G CT+CA RI SG++ QP+A+G+  E   +GY LLCV +P SD+++ET  EDE  A
Subjt:  RGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDEKLA

P0A3C7 Ferredoxin-19.0e-1641.05Show/hide
Query:  THKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
        T KV + +   G  HE  VP+D+YIL  AE Q   LPF+CR G C++CA ++ SG + Q +   +  +    GY L CV +PTSDV ++T  E++
Subjt:  THKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE

P0A3C8 Ferredoxin-19.0e-1641.05Show/hide
Query:  THKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE
        T KV + +   G  HE  VP+D+YIL  AE Q   LPF+CR G C++CA ++ SG + Q +   +  +    GY L CV +PTSDV ++T  E++
Subjt:  THKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDE

Q9C7Y4 Ferredoxin C 2, chloroplastic3.3e-5071.21Show/hide
Query:  NRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEAL
        NRR+    +    T + E +  V+V+ PS+ GS  +P+HKV VHDR+RGVVHEF VPEDQYILH+AE+Q+ISLPFACRHGCCTSCAVR+KSG++RQP+AL
Subjt:  NRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEAL

Query:  GISAELKSKGYALLCVGFPTSDVEVETQDEDE
        GISAELKS+GYALLCVGFPTSD+EVETQDEDE
Subjt:  GISAELKSKGYALLCVGFPTSDVEVETQDEDE

Arabidopsis top hitse value%identityAlignment
AT1G10960.1 ferredoxin 15.1e-1445.95Show/hide
Query:  EDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDED
        ED Y+L  AE   + LP++CR G C+SCA ++ SG I Q +   +  E  S+GY L CV +PTSDV +ET  E+
Subjt:  EDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDED

AT1G32550.1 2Fe-2S ferredoxin-like superfamily protein2.3e-5171.21Show/hide
Query:  NRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEAL
        NRR+    +    T + E +  V+V+ PS+ GS  +P+HKV VHDR+RGVVHEF VPEDQYILH+AE+Q+ISLPFACRHGCCTSCAVR+KSG++RQP+AL
Subjt:  NRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEAL

Query:  GISAELKSKGYALLCVGFPTSDVEVETQDEDE
        GISAELKS+GYALLCVGFPTSD+EVETQDEDE
Subjt:  GISAELKSKGYALLCVGFPTSDVEVETQDEDE

AT1G32550.2 2Fe-2S ferredoxin-like superfamily protein1.5e-4562.16Show/hide
Query:  NRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEAL
        NRR+    +    T + E +  V+V+ PS+ GS  +P+HKV VHDR+RGVVHEF   EDQYILH+AE+Q+ISLPFACRHGCCTSCAVR+KSG++RQP+AL
Subjt:  NRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEFVVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEAL

Query:  GISAELKSK----------------GYALLCVGFPTSDVEVETQDEDE
        GISAELKS+                GYALLCVGFPTSD+EVETQDEDE
Subjt:  GISAELKSK----------------GYALLCVGFPTSDVEVETQDEDE

AT4G17960.1 unknown protein7.8e-3952.53Show/hide
Query:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETG-PRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFL
        M FFKK+AG+ GF ++    +KNE+DD+D      ++   +ETG PR+GF VPVQVA+ R Q GP+L P ++GDGG+QGL WY KRLRVDEDGDVA++FL
Subjt:  MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETG-PRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFL

Query:  DEVLPEMPTSTTDNQ---KPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL
        +E   E  T+  D+    K  PRFQI  + +P KV   V+  +GKLQQCIEH+GRL +
Subjt:  DEVLPEMPTSTTDNQ---KPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLL

AT5G46620.1 unknown protein8.3e-2543.04Show/hide
Query:  MRFFKKIAGILGFSKDDAHD----VKNEDDDVDSDN-----------QAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKR
        M F KK+ GILGF  +D       V++ED D   +N           +  ++   +ETGPR+GF VPVQVA+ R  PGPIL P ++ DGGVQGL WY+ R
Subjt:  MRFFKKIAGILGFSKDDAHD----VKNEDDDVDSDN-----------QAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKR

Query:  LRVDEDGDVAEQFLDEVLPEMPTSTTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQ
        L++DEDGDVA++FL++           N K  PR     + + AKV   VI  +GKLQ
Subjt:  LRVDEDGDVAEQFLDEVLPEMPTSTTDNQKPFPRFQINNRNRPAKVENQVILQEGKLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGTTCTTCAAGAAGATTGCAGGAATTCTAGGGTTTTCCAAGGACGACGCTCACGACGTCAAGAACGAAGATGATGATGTTGATTCCGATAATCAGGCTCCCGATCG
CGTCCATTTGCAGGAAACTGGTCCTCGGAGGGGCTTCAGCGTCCCTGTTCAGGTCGCTCTCAATCGTCCTCAACCTGGCCCTATTCTTGTTCCCTCTAGTTCCGGCGATG
GTGGAGTTCAGGGTTTGACATGGTACGCTAAGCGTCTCCGGGTTGATGAAGATGGAGATGTAGCTGAACAGTTCCTGGACGAGGTCTTACCTGAGATGCCAACAAGTACG
ACAGACAATCAAAAGCCATTTCCACGATTTCAGATAAATAACAGAAATAGGCCAGCTAAAGTAGAGAACCAGGTTATCTTGCAGGAGGGTAAACTTCAACAGTGTATTGA
ACATAAAGGTAGATTGCTATTGCCGGAATCCTCCCGGCGATTCCTCTCACCTAATCGGAACCGTCGCTTTTCCGACCTCTTGAAATGTCGCCGAAAAACGACCTCCACTG
AGCTCCAGGCAGCTGTCGACGTCGCCGGCCCGTCCGAAAATGGTTCTGCTTCGATTCCGACTCATAAGGTTATCGTTCACGATAGAGAGAGAGGCGTTGTTCATGAATTC
GTTGTTCCTGAGGATCAATACATATTGCACACTGCCGAAGCTCAGAGCATATCTCTTCCTTTTGCTTGCAGGCACGGTTGTTGCACTAGTTGTGCTGTTCGAATAAAATC
GGGCCAAATTAGACAGCCTGAAGCCCTTGGAATATCTGCTGAGTTGAAATCAAAGGGGTATGCACTTCTTTGCGTAGGTTTTCCAACCTCAGATGTTGAAGTAGAAACGC
AAGATGAGGATGAGAAGCTTGCTAAGTCTATTTACCCCTTTGAGCCTAATTCTCGTGTTGGCCAATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGTTCTTCAAGAAGATTGCAGGAATTCTAGGGTTTTCCAAGGACGACGCTCACGACGTCAAGAACGAAGATGATGATGTTGATTCCGATAATCAGGCTCCCGATCG
CGTCCATTTGCAGGAAACTGGTCCTCGGAGGGGCTTCAGCGTCCCTGTTCAGGTCGCTCTCAATCGTCCTCAACCTGGCCCTATTCTTGTTCCCTCTAGTTCCGGCGATG
GTGGAGTTCAGGGTTTGACATGGTACGCTAAGCGTCTCCGGGTTGATGAAGATGGAGATGTAGCTGAACAGTTCCTGGACGAGGTCTTACCTGAGATGCCAACAAGTACG
ACAGACAATCAAAAGCCATTTCCACGATTTCAGATAAATAACAGAAATAGGCCAGCTAAAGTAGAGAACCAGGTTATCTTGCAGGAGGGTAAACTTCAACAGTGTATTGA
ACATAAAGGTAGATTGCTATTGCCGGAATCCTCCCGGCGATTCCTCTCACCTAATCGGAACCGTCGCTTTTCCGACCTCTTGAAATGTCGCCGAAAAACGACCTCCACTG
AGCTCCAGGCAGCTGTCGACGTCGCCGGCCCGTCCGAAAATGGTTCTGCTTCGATTCCGACTCATAAGGTTATCGTTCACGATAGAGAGAGAGGCGTTGTTCATGAATTC
GTTGTTCCTGAGGATCAATACATATTGCACACTGCCGAAGCTCAGAGCATATCTCTTCCTTTTGCTTGCAGGCACGGTTGTTGCACTAGTTGTGCTGTTCGAATAAAATC
GGGCCAAATTAGACAGCCTGAAGCCCTTGGAATATCTGCTGAGTTGAAATCAAAGGGGTATGCACTTCTTTGCGTAGGTTTTCCAACCTCAGATGTTGAAGTAGAAACGC
AAGATGAGGATGAGAAGCTTGCTAAGTCTATTTACCCCTTTGAGCCTAATTCTCGTGTTGGCCAATCTTGA
Protein sequenceShow/hide protein sequence
MRFFKKIAGILGFSKDDAHDVKNEDDDVDSDNQAPDRVHLQETGPRRGFSVPVQVALNRPQPGPILVPSSSGDGGVQGLTWYAKRLRVDEDGDVAEQFLDEVLPEMPTST
TDNQKPFPRFQINNRNRPAKVENQVILQEGKLQQCIEHKGRLLLPESSRRFLSPNRNRRFSDLLKCRRKTTSTELQAAVDVAGPSENGSASIPTHKVIVHDRERGVVHEF
VVPEDQYILHTAEAQSISLPFACRHGCCTSCAVRIKSGQIRQPEALGISAELKSKGYALLCVGFPTSDVEVETQDEDEKLAKSIYPFEPNSRVGQS