; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g27770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g27770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionVARLMGL domain-containing protein
Genome locationchr5:20212667..20216130
RNA-Seq ExpressionMoc05g27770
SyntenyMoc05g27770
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017598.1 hypothetical protein SDJN02_19464 [Cucurbita argyrosperma subsp. argyrosperma]3.4e-9060.7Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLS NN SRRNKSITFVHN+NQE L +A SE+K   I   P+  S  +  IS  + P+
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE

Query:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
        + ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ +S   ESFR SSP   GK IE         
Subjt:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQQQ PSPVSVLEE+SRPR F N H C +S    +PSAN G        Q +QRKK  E D IF +SKFER ++   VVGN+   K EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
         S++AM+DS+EGVCKDI+WG   EVGRIGLALQH I G+L+EELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

XP_022935193.1 uncharacterized protein LOC111442147 [Cucurbita moschata]2.0e-9060.43Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI--P
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLS NN SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  +  P
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI--P

Query:  ELERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
          ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ +S   ESFR SSP   GK IE         
Subjt:  ELERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQQQ PSPVSVLEE+SRPR F N H C +S    +PSAN G        Q +QRKK  E D IF +SKFER ++   VVGN+   K EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
          ++AM+DS+EGVCKDI+WG   EVGRIGLALQH I G+L+EELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

XP_022983018.1 uncharacterized protein LOC111481687 [Cucurbita maxima]2.0e-9061.25Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLS NN SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  + P+
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE

Query:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
        + ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ ES   ESFR SSP   GK IE         
Subjt:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQ Q PSPVSVLEE+SRPR F N H C +S    +PSAN G        Q +QRKK  E D IF +SKFER ++   VVGN   LK EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
          ++AM+DSVEGVCKDI+WG   EVGRIGLALQH I G+LIEELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

XP_023528398.1 uncharacterized protein LOC111791339 [Cucurbita pepo subsp. pepo]3.7e-8960Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS---NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLS   NN SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  + 
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS---NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-

Query:  PEL-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSY
        P++ ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ +S   ESFR SSP   GK IE       
Subjt:  PEL-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSY

Query:  RDSMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGITM----QQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPF
           M WEQQQ PSPVSVLEE+SRPR F N H C +S    +PSAN G       QQ+QRKK  E D IF +SKFER ++   VVGN+   K EK  ESP 
Subjt:  RDSMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGITM----QQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPF

Query:  CSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
        C  ++AM+DS+EGVCKDI+WG   EVGRIG+ALQH I G+L+EELVKD     TF  +SLPFEAC+RRLC
Subjt:  CSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

XP_038905750.1 uncharacterized protein LOC120091709 [Benincasa hispida]1.2e-8459.79Show/hide
Query:  MKRQNSFLSSSSQIEISSDN-LRR------SKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRI-SGQIPEL
        MKRQNSFLSSSSQ+EISSDN LRR      S+SFGCVS LLHFLS NN SRRNKSITFVHN+  EL +AIS+   S I   P++ S+ +  I S   P++
Subjt:  MKRQNSFLSSSSQIEISSDN-LRR------SKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRI-SGQIPEL

Query:  -ERSESLISTE--ENFRGARGPIVRLMGLESCAA----TAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDS
         ERSES +ST   E FRGARGPIVRLMGLES  A     AA EKQRQ+M+ALEKCEQDLK LK+FI++FES   ESFRSSSPAG GK IEL         
Subjt:  -ERSESLISTE--ENFRGARGPIVRLMGLESCAA----TAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDS

Query:  MNWEQQQQPSPV--SVLEEISRPRRFGNVHGCHSLFSDRPSANSG----ITMQQIQRKK-AGEEDIF-KLSKFERI-----EVVVGNFNFLKGEKPVE-S
        M  +Q+++ SPV  SV+EE+SR R F N     ++F  RPSANSG      +QQ+QRKK  G+  +F  LSKF+       E+V+GN+   K EK  E S
Subjt:  MNWEQQQQPSPV--SVLEEISRPRRFGNVHGCHSLFSDRPSANSG----ITMQQIQRKK-AGEEDIF-KLSKFERI-----EVVVGNFNFLKGEKPVE-S

Query:  PFCSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFN-YSSLPFEACKRRLC
        P C SK AMRDSVE V K+I+WGQN+E+GRIGLALQ+ ICG+LIEELVKDL +  T   YSSLPFEACKRRLC
Subjt:  PFCSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFN-YSSLPFEACKRRLC

TrEMBL top hitse value%identityAlignment
A0A0A0LF86 Uncharacterized protein2.2e-5051.88Show/hide
Query:  STEENFRGARGPIVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSMNWEQQQQPSPVSV
        +T E FRGARGPIVRLMGLES   TA  EKQRQV++ALEKCE+DLKALK+FID+FES   ESFRSSSPAG GK IEL         M  +Q+++ +PV+ 
Subjt:  STEENFRGARGPIVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSMNWEQQQQPSPVSV

Query:  LEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-----IFKLSKFERI-----EVVVGNFNFL-KGEKPVESPFCSS----KK
         EE+S P  F N  G  +S+   RPS N G T     +QQ+QRKK  ++      +  +SKF+       E+V+G +    KG   +    C S    K 
Subjt:  LEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-----IFKLSKFERI-----EVVVGNFNFL-KGEKPVESPFCSS----KK

Query:  AMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNY--SSLPFEACKRRL
         MR+SVE V +DI WGQ +E+GRIGL LQ+ ICG+LIEELVKDL F  TF Y  +SLPF+ACKR L
Subjt:  AMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNY--SSLPFEACKRRL

A0A1S3B6W0 uncharacterized protein LOC1034868183.7e-8256.42Show/hide
Query:  MKRQNSFL--SSSSQIEISSDNL------RRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNA-----SSDSRRISGQ
        MKRQNSFL  SSSSQ++ISSDNL      RRS+SFGCVS LLHFLSN+  RRNKSITFVHN+  EL D IS +K+SP       A     SSDS RI+  
Subjt:  MKRQNSFL--SSSSQIEISSDNL------RRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNA-----SSDSRRISGQ

Query:  IPELERSESLISTE--ENFRGARGPIVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSM
            ERS+S +ST   E FRGARGPIVRLMGLES  +TA  EKQRQVM+ALEKCE+DLKALK+FID+FES   ESFRS SPAG GK IEL         M
Subjt:  IPELERSESLISTE--ENFRGARGPIVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSM

Query:  NWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSG----ITMQQIQRKKAGEED-----IFKLSKFERI-----EVVVGNFNFLKGEKPVES
          +QQ++ SPV+  EE+S P    N HG  +S+   RPSANSG      +QQ+QRKK  +++     +  +SKF+R      E+V+GN+   +  K +  
Subjt:  NWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSG----ITMQQIQRKKAGEED-----IFKLSKFERI-----EVVVGNFNFLKGEKPVES

Query:  PFC-SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFN-YSSLPFEACKRRLC
          C S+K AMR+SVE V KDI WGQ +E+GRIGL LQ+ ICG+LIEELVKDL F  TF  Y+SLPF+ACKR LC
Subjt:  PFC-SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFN-YSSLPFEACKRRLC

A0A6J1DLL1 uncharacterized protein LOC1110215694.5e-56100Show/hide
Query:  MKRQNSFLSSSSQIEISSDNLRRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQIPELERSESLIST
        MKRQNSFLSSSSQIEISSDNLRRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQIPELERSESLIST
Subjt:  MKRQNSFLSSSSQIEISSDNLRRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQIPELERSESLIST

Query:  EENFRGARGPIVRLMGLESCA
        EENFRGARGPIVRLMGLESCA
Subjt:  EENFRGARGPIVRLMGLESCA

A0A6J1F9V5 uncharacterized protein LOC1114421479.6e-9160.43Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI--P
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLS NN SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  +  P
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI--P

Query:  ELERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
          ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ +S   ESFR SSP   GK IE         
Subjt:  ELERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQQQ PSPVSVLEE+SRPR F N H C +S    +PSAN G        Q +QRKK  E D IF +SKFER ++   VVGN+   K EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
          ++AM+DS+EGVCKDI+WG   EVGRIGLALQH I G+L+EELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

A0A6J1J6K2 uncharacterized protein LOC1114816879.6e-9161.25Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLS NN SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  + P+
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLS-NNQSRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE

Query:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
        + ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ ES   ESFR SSP   GK IE         
Subjt:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQ Q PSPVSVLEE+SRPR F N H C +S    +PSAN G        Q +QRKK  E D IF +SKFER ++   VVGN   LK EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGC-HSLFSDRPSANSGIT-----MQQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
          ++AM+DSVEGVCKDI+WG   EVGRIGLALQH I G+LIEELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56810.1 unknown protein1.1e-0938.71Show/hide
Query:  KKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFC-------------STFNY--------------SSLPFEACKRRL
        ++++ +SV  VC D+A GQ REV +IGLAL  HIC +LI E V++L F              S+  Y              +SLPF+AC+RRL
Subjt:  KKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFC-------------STFNY--------------SSLPFEACKRRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGACAGAACAGTTTCTTATCATCGTCTTCTCAAATCGAAATTTCTTCCGATAACCTCCGCCGCTCCAAATCCTTCGGCTGCGTTTCCGGCCTCCTCCATTTCCT
CTCCAACAACCAGAGCCGCCGCAACAAATCCATCACATTCGTACACAACAATAATCAAGAACTTCACGATGCAATTTCCGAGACGAAATCGTCTCCAATCCGCATTGTTC
CTCAAAATGCCTCATCCGATTCGCGGCGGATCTCCGGCCAGATTCCGGAGCTTGAGCGATCCGAATCCCTAATTTCGACGGAGGAGAACTTCCGCGGAGCGAGAGGTCCG
ATTGTGCGACTTATGGGACTGGAAAGTTGCGCGGCAACCGCCGCGACGGAGAAGCAGAGGCAGGTAATGGATGCTTTGGAGAAATGCGAACAGGACCTGAAGGCGCTGAA
GGACTTCATCGACTCGTTTGAGTCGGCGCCGGCGGAGAGTTTCCGATCGTCGTCTCCGGCCGGTGCCGGAAAAGGAATTGAACTGTTTTCCGGCGGAAGTTATCGGGATT
CGATGAATTGGGAGCAGCAGCAGCAGCCGAGTCCGGTATCGGTGCTTGAGGAGATTAGTCGGCCGCGGCGTTTTGGTAACGTCCATGGCTGCCATAGCCTTTTCTCCGAC
CGGCCATCTGCAAATTCTGGAATAACAATGCAACAAATTCAAAGGAAGAAGGCAGGAGAAGAGGACATCTTCAAACTAAGCAAATTTGAGAGGATTGAAGTAGTAGTTGG
TAACTTCAACTTCCTCAAAGGTGAAAAACCAGTTGAATCCCCATTTTGTAGTAGCAAAAAGGCAATGAGAGACAGCGTAGAAGGAGTCTGCAAAGACATTGCTTGGGGTC
AAAACAGGGAAGTGGGAAGAATAGGACTGGCTTTACAACATCACATTTGTGGGGAATTGATTGAAGAGTTGGTAAAAGATTTGAAATTTTGTTCTACTTTTAATTATAGT
TCACTACCATTCGAGGCTTGCAAGAGAAGACTATGTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGACAGAACAGTTTCTTATCATCGTCTTCTCAAATCGAAATTTCTTCCGATAACCTCCGCCGCTCCAAATCCTTCGGCTGCGTTTCCGGCCTCCTCCATTTCCT
CTCCAACAACCAGAGCCGCCGCAACAAATCCATCACATTCGTACACAACAATAATCAAGAACTTCACGATGCAATTTCCGAGACGAAATCGTCTCCAATCCGCATTGTTC
CTCAAAATGCCTCATCCGATTCGCGGCGGATCTCCGGCCAGATTCCGGAGCTTGAGCGATCCGAATCCCTAATTTCGACGGAGGAGAACTTCCGCGGAGCGAGAGGTCCG
ATTGTGCGACTTATGGGACTGGAAAGTTGCGCGGCAACCGCCGCGACGGAGAAGCAGAGGCAGGTAATGGATGCTTTGGAGAAATGCGAACAGGACCTGAAGGCGCTGAA
GGACTTCATCGACTCGTTTGAGTCGGCGCCGGCGGAGAGTTTCCGATCGTCGTCTCCGGCCGGTGCCGGAAAAGGAATTGAACTGTTTTCCGGCGGAAGTTATCGGGATT
CGATGAATTGGGAGCAGCAGCAGCAGCCGAGTCCGGTATCGGTGCTTGAGGAGATTAGTCGGCCGCGGCGTTTTGGTAACGTCCATGGCTGCCATAGCCTTTTCTCCGAC
CGGCCATCTGCAAATTCTGGAATAACAATGCAACAAATTCAAAGGAAGAAGGCAGGAGAAGAGGACATCTTCAAACTAAGCAAATTTGAGAGGATTGAAGTAGTAGTTGG
TAACTTCAACTTCCTCAAAGGTGAAAAACCAGTTGAATCCCCATTTTGTAGTAGCAAAAAGGCAATGAGAGACAGCGTAGAAGGAGTCTGCAAAGACATTGCTTGGGGTC
AAAACAGGGAAGTGGGAAGAATAGGACTGGCTTTACAACATCACATTTGTGGGGAATTGATTGAAGAGTTGGTAAAAGATTTGAAATTTTGTTCTACTTTTAATTATAGT
TCACTACCATTCGAGGCTTGCAAGAGAAGACTATGTTTGTAG
Protein sequenceShow/hide protein sequence
MKRQNSFLSSSSQIEISSDNLRRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQIPELERSESLISTEENFRGARGP
IVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSMNWEQQQQPSPVSVLEEISRPRRFGNVHGCHSLFSD
RPSANSGITMQQIQRKKAGEEDIFKLSKFERIEVVVGNFNFLKGEKPVESPFCSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYS
SLPFEACKRRLCL