; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g05150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g05150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:3961020..3961898
RNA-Seq ExpressionMoc09g05150
SyntenyMoc09g05150
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575681.1 Zinc finger A20 and AN1 domain-containing stress-associated protein 8, partial [Cucurbita argyrosperma subsp. sororia]7.6e-8366.32Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT
        MAAIVTRRLSSKFLRP+P   SSTF ISQ P E F  I S       P+  RRTF  P++ L  S T +    L   RS  RS +FN +SI         
Subjt:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT

Query:  APIQSKSPNFDFRVRGTTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAV
            S +PN D R+RG   S    IP  RNP+       QKYGFSST ENEN +KPA+  HQDIEGPTVERDLSALAGETR V+E MMKNVYSLSKAMA+
Subjt:  APIQSKSPNFDFRVRGTTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAV

Query:  LGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITG
        LGLVQLGIG WISYATR SP TEVSIQSFV+FGFPFSLAFILRQSLKPM+FFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VS LC+TG
Subjt:  LGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITG

XP_022145979.1 uncharacterized protein LOC111015297 [Momordica charantia]1.3e-154100Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSKS
        MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSKS
Subjt:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSKS

Query:  PNFDFRVRGTTGSDTCSIPIPRNPNLNYQKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQLGIGGW
        PNFDFRVRGTTGSDTCSIPIPRNPNLNYQKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQLGIGGW
Subjt:  PNFDFRVRGTTGSDTCSIPIPRNPNLNYQKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQLGIGGW

Query:  ISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR
        ISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR
Subjt:  ISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR

XP_022954073.1 uncharacterized protein LOC111456447 [Cucurbita moschata]2.1e-8867.33Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT
        MAAIVTRRLSSKFLRP+P   SSTF ISQNP E F  I S       P+  RRTF  P++ L NS T   +  L   RS  RS +FN S           
Subjt:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT

Query:  APIQSKSPNFDFRVRGTTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAV
              +PN D R+RG   S    IP  RNP+       QKYGFSST ENEN +KPA+  HQDIEGPTVERDLSALAGETR V+E MMKNVYSLSKAMA+
Subjt:  APIQSKSPNFDFRVRGTTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAV

Query:  LGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFAL
        LGLVQLGIG WISYATR SP TEVSIQSFV+FGFPFSLAFILRQSLKPM+FFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VS LC+TGLSVGVLFAL
Subjt:  LGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFAL

Query:  ISR
        +SR
Subjt:  ISR

XP_022991920.1 uncharacterized protein LOC111488415 [Cucurbita maxima]1.3e-8767.43Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT
        MAAIVTRRLSSK+LRP PSS+S    ISQNP E F  I S       P+  RRTF  P + L NS T   +  L   RS  RS +FN +SI         
Subjt:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT

Query:  APIQSKSPNFDFRVRG-TTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMA
            S +PN D R+RG    S    IP  RNP+       QKYGFSST ENEN +KPA+  HQDIEGPTVERDLSALAGETR V+E MMKNVYSLSKAMA
Subjt:  APIQSKSPNFDFRVRG-TTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMA

Query:  VLGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFA
        +LGLVQLGIG WISYATR SPITEVSIQSFV+FGFPFSLAFILRQSLKPM+FFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VS LC+TGLSVGVLFA
Subjt:  VLGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFA

Query:  LISR
        L+SR
Subjt:  LISR

XP_023549384.1 uncharacterized protein LOC111807746 [Cucurbita pepo subsp. pepo]2.7e-8867Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT
        MAAIVTRRLSSKFLRP+P   SSTF ISQNP E F  I S       P+  R+TF  P++ L NS T   +  L   R   RS +FN + I         
Subjt:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT

Query:  APIQSKSPNFDFRVRGTTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAV
            + SPN D R+RG   S    IP  RNP+       QKYGFSST ENEN +KPA+  HQDIEGPTVERDLSALAGETR V+E MMKNVYSLSKAMA+
Subjt:  APIQSKSPNFDFRVRGTTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAV

Query:  LGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFAL
        LGLVQLGIG WISYATR SP TEVSIQSFV+FGFPFSLAFILRQSLKPM+FFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VS LC+TGLSVGVLFAL
Subjt:  LGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFAL

Query:  ISR
        +SR
Subjt:  ISR

TrEMBL top hitse value%identityAlignment
A0A1S3CFJ7 uncharacterized protein LOC1034999119.7e-7662.42Show/hide
Query:  MAAIVTRRLSSKFLRP-LPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSK
        MAAIVTRRLSS   RP LP    STF ISQNP + F  I SP       ++PS      RT+    +L+          FN  SI+K I P+     Q+K
Subjt:  MAAIVTRRLSSKFLRP-LPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSK

Query:  SPNFDFRVRGTTGSDTCSIPIPRNPNLNY-----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQ
        SP F++ +    GS + SI    NPN  +     +K  FS+T E E+ QK  +  HQDIEGPTVERDLSALA ETR V+E MMKNVY LSKAMAVLGLVQ
Subjt:  SPNFDFRVRGTTGSDTCSIPIPRNPNLNY-----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQ

Query:  LGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR
        LG+G WISY TRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPM+FFKKMEEQGRLQILTL+LQI KNLN LFVRVR+VSLLC+TGLSVG+LFAL+SR
Subjt:  LGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR

A0A5A7UUD1 Uncharacterized protein9.7e-7662.42Show/hide
Query:  MAAIVTRRLSSKFLRP-LPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSK
        MAAIVTRRLSS   RP LP    STF ISQNP + F  I SP       ++PS      RT+    +L+          FN  SI+K I P+     Q+K
Subjt:  MAAIVTRRLSSKFLRP-LPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSK

Query:  SPNFDFRVRGTTGSDTCSIPIPRNPNLNY-----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQ
        SP F++ +    GS + SI    NPN  +     +K  FS+T E E+ QK  +  HQDIEGPTVERDLSALA ETR V+E MMKNVY LSKAMAVLGLVQ
Subjt:  SPNFDFRVRGTTGSDTCSIPIPRNPNLNY-----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQ

Query:  LGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR
        LG+G WISY TRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPM+FFKKMEEQGRLQILTL+LQI KNLN LFVRVR+VSLLC+TGLSVG+LFAL+SR
Subjt:  LGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR

A0A6J1CWU5 uncharacterized protein LOC1110152976.2e-155100Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSKS
        MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSKS
Subjt:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSKS

Query:  PNFDFRVRGTTGSDTCSIPIPRNPNLNYQKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQLGIGGW
        PNFDFRVRGTTGSDTCSIPIPRNPNLNYQKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQLGIGGW
Subjt:  PNFDFRVRGTTGSDTCSIPIPRNPNLNYQKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQLGIGGW

Query:  ISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR
        ISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR
Subjt:  ISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR

A0A6J1GQ34 uncharacterized protein LOC1114564471.0e-8867.33Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT
        MAAIVTRRLSSKFLRP+P   SSTF ISQNP E F  I S       P+  RRTF  P++ L NS T   +  L   RS  RS +FN S           
Subjt:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT

Query:  APIQSKSPNFDFRVRGTTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAV
              +PN D R+RG   S    IP  RNP+       QKYGFSST ENEN +KPA+  HQDIEGPTVERDLSALAGETR V+E MMKNVYSLSKAMA+
Subjt:  APIQSKSPNFDFRVRGTTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAV

Query:  LGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFAL
        LGLVQLGIG WISYATR SP TEVSIQSFV+FGFPFSLAFILRQSLKPM+FFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VS LC+TGLSVGVLFAL
Subjt:  LGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFAL

Query:  ISR
        +SR
Subjt:  ISR

A0A6J1JU98 uncharacterized protein LOC1114884156.5e-8867.43Show/hide
Query:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT
        MAAIVTRRLSSK+LRP PSS+S    ISQNP E F  I S       P+  RRTF  P + L NS T   +  L   RS  RS +FN +SI         
Subjt:  MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISS-------PRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFT

Query:  APIQSKSPNFDFRVRG-TTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMA
            S +PN D R+RG    S    IP  RNP+       QKYGFSST ENEN +KPA+  HQDIEGPTVERDLSALAGETR V+E MMKNVYSLSKAMA
Subjt:  APIQSKSPNFDFRVRG-TTGSDTCSIPIPRNPNLNY----QKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMA

Query:  VLGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFA
        +LGLVQLGIG WISYATR SPITEVSIQSFV+FGFPFSLAFILRQSLKPM+FFKKMEEQGRLQILTLTLQIAKNLN LFVRVR VS LC+TGLSVGVLFA
Subjt:  VLGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFA

Query:  LISR
        L+SR
Subjt:  LISR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12650.1 unknown protein6.5e-4862.87Show/hide
Query:  FSSTSENENAQKPAE--------LMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFP
        FS+ S     +KP E        + HQ+IEGPTVERDLSAL  ETR VLEGMMKN+YSLS AM  LGL QL +G  I YATR  P+ E++IQS +AFGFP
Subjt:  FSSTSENENAQKPAE--------LMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQLGIGGWISYATRGSPITEVSIQSFVAFGFP

Query:  FSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALIS
        F++A ++R+SLKPM FFKKMEE GRLQILTLTLQ+AKNLN+LFVR R VS+LC+  L  G LF L+S
Subjt:  FSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCATTGTTACGCGAAGGTTAAGCTCCAAATTTCTCAGACCGCTTCCATCTTCGTCTTCTTCCACATTTCCCATCTCCCAAAACCCTATCGAAACCTTTCCCCA
CATCTCTTCTCCACGTTTTAATCGACGAACATTTACAAACCCTTCTATTGGCCTCTTCAATTCCCGCACAGCTTCTGAAAATCCCTCCCTCAACACTGCCCGATCTTGCT
GCAGATCCACCGATTTCAATCCCAGCTCCATTACCAAAGTAATCGCTCCCAATTTTACAGCGCCGATTCAGAGTAAGAGCCCTAATTTTGATTTTCGGGTTCGGGGCACA
ACCGGGTCGGATACTTGTTCGATTCCGATTCCGAGAAACCCTAATTTGAACTATCAGAAATATGGATTTTCTTCAACCTCGGAGAACGAGAACGCGCAGAAACCGGCCGA
ATTGATGCACCAAGACATCGAAGGGCCCACTGTGGAGCGCGATCTGTCGGCGCTGGCCGGCGAAACCAGAGGAGTTCTCGAAGGGATGATGAAGAACGTGTACAGCTTAA
GCAAAGCCATGGCGGTTCTAGGTCTGGTTCAACTCGGGATCGGGGGTTGGATTTCGTACGCCACTCGCGGATCCCCGATTACAGAGGTTTCGATCCAGAGCTTCGTGGCG
TTCGGGTTTCCTTTCTCGTTGGCGTTCATTCTGCGGCAGTCACTGAAGCCGATGCTGTTCTTCAAGAAAATGGAAGAACAAGGTAGGTTGCAGATTCTAACTCTGACTCT
TCAGATTGCTAAGAATTTGAATGTCCTGTTTGTTCGAGTGCGAAGCGTTTCTTTGTTGTGCATAACGGGATTGTCTGTTGGAGTTTTGTTTGCCTTGATTTCGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCATTGTTACGCGAAGGTTAAGCTCCAAATTTCTCAGACCGCTTCCATCTTCGTCTTCTTCCACATTTCCCATCTCCCAAAACCCTATCGAAACCTTTCCCCA
CATCTCTTCTCCACGTTTTAATCGACGAACATTTACAAACCCTTCTATTGGCCTCTTCAATTCCCGCACAGCTTCTGAAAATCCCTCCCTCAACACTGCCCGATCTTGCT
GCAGATCCACCGATTTCAATCCCAGCTCCATTACCAAAGTAATCGCTCCCAATTTTACAGCGCCGATTCAGAGTAAGAGCCCTAATTTTGATTTTCGGGTTCGGGGCACA
ACCGGGTCGGATACTTGTTCGATTCCGATTCCGAGAAACCCTAATTTGAACTATCAGAAATATGGATTTTCTTCAACCTCGGAGAACGAGAACGCGCAGAAACCGGCCGA
ATTGATGCACCAAGACATCGAAGGGCCCACTGTGGAGCGCGATCTGTCGGCGCTGGCCGGCGAAACCAGAGGAGTTCTCGAAGGGATGATGAAGAACGTGTACAGCTTAA
GCAAAGCCATGGCGGTTCTAGGTCTGGTTCAACTCGGGATCGGGGGTTGGATTTCGTACGCCACTCGCGGATCCCCGATTACAGAGGTTTCGATCCAGAGCTTCGTGGCG
TTCGGGTTTCCTTTCTCGTTGGCGTTCATTCTGCGGCAGTCACTGAAGCCGATGCTGTTCTTCAAGAAAATGGAAGAACAAGGTAGGTTGCAGATTCTAACTCTGACTCT
TCAGATTGCTAAGAATTTGAATGTCCTGTTTGTTCGAGTGCGAAGCGTTTCTTTGTTGTGCATAACGGGATTGTCTGTTGGAGTTTTGTTTGCCTTGATTTCGAGATGA
Protein sequenceShow/hide protein sequence
MAAIVTRRLSSKFLRPLPSSSSSTFPISQNPIETFPHISSPRFNRRTFTNPSIGLFNSRTASENPSLNTARSCCRSTDFNPSSITKVIAPNFTAPIQSKSPNFDFRVRGT
TGSDTCSIPIPRNPNLNYQKYGFSSTSENENAQKPAELMHQDIEGPTVERDLSALAGETRGVLEGMMKNVYSLSKAMAVLGLVQLGIGGWISYATRGSPITEVSIQSFVA
FGFPFSLAFILRQSLKPMLFFKKMEEQGRLQILTLTLQIAKNLNVLFVRVRSVSLLCITGLSVGVLFALISR