; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g1085 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g1085
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionVARLMGL domain-containing protein
Genome locationMC05:14745780..14749503
RNA-Seq ExpressionMC05g1085
SyntenyMC05g1085
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017598.1 hypothetical protein SDJN02_19464 [Cucurbita argyrosperma subsp. argyrosperma]1.84e-11360.7Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLSNN  SRRNKSITFVHN+NQE L +A SE+K   I   P+  S  +  IS  + P+
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE

Query:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
        + ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ +S   ESFR SSP   GK IE         
Subjt:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQQQ PSPVSVLEE+SRPR F N H C+ S    +PSAN G    Q     +QRKK  E D IF +SKFER ++   VVGN+   K EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
        S ++AM+DS+EGVCKDI+WG   EVGRIGLALQH I G+L+EELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

XP_022935193.1 uncharacterized protein LOC111442147 [Cucurbita moschata]9.16e-11460.43Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI--P
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLSNN  SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  +  P
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI--P

Query:  ELERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
          ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ +S   ESFR SSP   GK IE         
Subjt:  ELERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQQQ PSPVSVLEE+SRPR F N H C+ S    +PSAN G    Q     +QRKK  E D IF +SKFER ++   VVGN+   K EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
          ++AM+DS+EGVCKDI+WG   EVGRIGLALQH I G+L+EELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

XP_022983018.1 uncharacterized protein LOC111481687 [Cucurbita maxima]9.16e-11461.25Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLSNN  SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  + P+
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE

Query:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
        + ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ ES   ESFR SSP   GK IE         
Subjt:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQ Q PSPVSVLEE+SRPR F N H C+ S    +PSAN G    Q     +QRKK  E D IF +SKFER ++   VVGN   LK EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
          ++AM+DSVEGVCKDI+WG   EVGRIGLALQH I G+LIEELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

XP_023528398.1 uncharacterized protein LOC111791339 [Cucurbita pepo subsp. pepo]4.39e-11260Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ---SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLSNN    SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  + 
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ---SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-

Query:  PELE-RSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSY
        P++E RSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ +S   ESFR SSP   GK IE       
Subjt:  PELE-RSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSY

Query:  RDSMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITM----QQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPF
           M WEQQQ PSPVSVLEE+SRPR F N H C+ S    +PSAN G       QQ+QRKK  E D IF +SKFER ++   VVGN+   K EK  ESP 
Subjt:  RDSMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITM----QQIQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPF

Query:  CSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
        C  ++AM+DS+EGVCKDI+WG   EVGRIG+ALQH I G+L+EELVKD     TF  +SLPFEAC+RRLC
Subjt:  CSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

XP_038905750.1 uncharacterized protein LOC120091709 [Benincasa hispida]2.74e-10659.79Show/hide
Query:  MKRQNSFLSSSSQIEISSDN-LRRS------KSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQ-IPEL
        MKRQNSFLSSSSQ+EISSDN LRRS      +SFGCVS LLHFLSNN  SRRNKSITFVHN+  EL +AIS+   S I   P++ S+ +  IS    P++
Subjt:  MKRQNSFLSSSSQIEISSDN-LRRS------KSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQ-IPEL

Query:  -ERSESLISTE--ENFRGARGPIVRLMGLESCAAT----AATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDS
         ERSES +ST   E FRGARGPIVRLMGLES  A     AA EKQRQ+M+ALEKCEQDLK LK+FI++FES   ESFRSSSPAG GK IEL         
Subjt:  -ERSESLISTE--ENFRGARGPIVRLMGLESCAAT----AATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDS

Query:  MNWEQQQQPSPV--SVLEEISRPRRFGNVHGCHSLFSDRPSANSGIT----MQQIQRKKA-GEEDIFK-LSKFERI-----EVVVGNFNFLKGEKPVE-S
        M  +Q+++ SPV  SV+EE+SR R F N     ++F  RPSANSG      +QQ+QRKK  G+  +F  LSKF+       E+V+GN+   K EK  E S
Subjt:  MNWEQQQQPSPV--SVLEEISRPRRFGNVHGCHSLFSDRPSANSGIT----MQQIQRKKA-GEEDIFK-LSKFERI-----EVVVGNFNFLKGEKPVE-S

Query:  PFCSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNY-SSLPFEACKRRLC
        P C SK AMRDSVE V K+I+WGQN+E+GRIGLALQ+ ICG+LIEELVKDL +  T  Y SSLPFEACKRRLC
Subjt:  PFCSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNY-SSLPFEACKRRLC

TrEMBL top hitse value%identityAlignment
A0A1S3B6W0 uncharacterized protein LOC1034868181.72e-10256.42Show/hide
Query:  MKRQNSFLSSSS--QIEISSDNL------RRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNA-----SSDSRRISGQ
        MKRQNSFLSSSS  Q++ISSDNL      RRS+SFGCVS LLHFLSN+  RRNKSITFVHN+  EL D IS +K+SP       A     SSDS RI+  
Subjt:  MKRQNSFLSSSS--QIEISSDNL------RRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNA-----SSDSRRISGQ

Query:  IPELERSESLISTE--ENFRGARGPIVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSM
            ERS+S +ST   E FRGARGPIVRLMGLES  +TA  EKQRQVM+ALEKCE+DLKALK+FID+FES   ESFRS SPAG GK IEL         M
Subjt:  IPELERSESLISTE--ENFRGARGPIVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSM

Query:  NWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGIT----MQQIQRKKAGEED-----IFKLSKFERI-----EVVVGNFNFLKGEKPVES
          +QQ++ SPV+  EE+S P    N HG + S+   RPSANSG      +QQ+QRKK  +++     +  +SKF+R      E+V+GN+   +  K +  
Subjt:  NWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGIT----MQQIQRKKAGEED-----IFKLSKFERI-----EVVVGNFNFLKGEKPVES

Query:  PFCSSKK-AMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNY-SSLPFEACKRRLC
          C S K AMR+SVE V KDI WGQ +E+GRIGL LQ+ ICG+LIEELVKDL F  TF Y +SLPF+ACKR LC
Subjt:  PFCSSKK-AMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNY-SSLPFEACKRRLC

A0A5A7TPF4 Uncharacterized protein3.83e-6360.08Show/hide
Query:  MKRQNSFLSSSS--QIEISSDNL------RRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNA-----SSDSRRISGQ
        MKRQNSFLSSSS  Q++ISSDNL      RRS+SFGCVS LLHFLSN+  RRNKSITFVHN+  EL D IS +K+SP       A     SSDS RI+  
Subjt:  MKRQNSFLSSSS--QIEISSDNL------RRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNA-----SSDSRRISGQ

Query:  IPELERSESLISTE--ENFRGARGPIVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSM
            ERS+S +ST   E FRGARGPIVRLMGLES  +TA  EKQRQVM+ALEKCE+DLKALK+FID+FES   ESFRS SPAG GK IEL         M
Subjt:  IPELERSESLISTE--ENFRGARGPIVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSM

Query:  NWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSG
          +QQ++ SPV+  EE+S P    N HG + S+   RPSANSG
Subjt:  NWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSG

A0A6J1DLL1 uncharacterized protein LOC1110215698.03e-72100Show/hide
Query:  MKRQNSFLSSSSQIEISSDNLRRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQIPELERSESLIST
        MKRQNSFLSSSSQIEISSDNLRRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQIPELERSESLIST
Subjt:  MKRQNSFLSSSSQIEISSDNLRRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQIPELERSESLIST

Query:  EENFRGARGPIVRLMGLESCA
        EENFRGARGPIVRLMGLESCA
Subjt:  EENFRGARGPIVRLMGLESCA

A0A6J1F9V5 uncharacterized protein LOC1114421474.44e-11460.43Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI--P
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLSNN  SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  +  P
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI--P

Query:  ELERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
          ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ +S   ESFR SSP   GK IE         
Subjt:  ELERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQQQ PSPVSVLEE+SRPR F N H C+ S    +PSAN G    Q     +QRKK  E D IF +SKFER ++   VVGN+   K EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
          ++AM+DS+EGVCKDI+WG   EVGRIGLALQH I G+L+EELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

A0A6J1J6K2 uncharacterized protein LOC1114816874.44e-11461.25Show/hide
Query:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE
        MKRQNSFLSSSSQ+EISSD+L       RRSKSFGCVS LLHFLSNN  SRRNKSITFVHN+NQE L +A SE+K   I   P+  S+ +  IS  + P+
Subjt:  MKRQNSFLSSSSQIEISSDNL-------RRSKSFGCVSGLLHFLSNNQ-SRRNKSITFVHNNNQE-LHDAISETKSSPIRIVPQNASSDSRRISGQI-PE

Query:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD
        + ERSES IST   E F GARGPIVRLMGLES     +   A EKQR+VM+ALEKCEQDLKALK+FID+ ES   ESFR SSP   GK IE         
Subjt:  L-ERSESLISTE--ENFRGARGPIVRLMGLESC----AATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRD

Query:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC
         M WEQ Q PSPVSVLEE+SRPR F N H C+ S    +PSAN G    Q     +QRKK  E D IF +SKFER ++   VVGN   LK EK  ESP C
Subjt:  SMNWEQQQQPSPVSVLEEISRPRRFGNVHGCH-SLFSDRPSANSGITMQQ-----IQRKKAGEED-IFKLSKFERIEV---VVGNFNFLKGEKPVESPFC

Query:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC
          ++AM+DSVEGVCKDI+WG   EVGRIGLALQH I G+LIEELVKD     TF Y+SLPFEAC+RRLC
Subjt:  SSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYSSLPFEACKRRLC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56810.1 unknown protein1.1e-0938.71Show/hide
Query:  KKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFC-------------STFNY--------------SSLPFEACKRRL
        ++++ +SV  VC D+A GQ REV +IGLAL  HIC +LI E V++L F              S+  Y              +SLPF+AC+RRL
Subjt:  KKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFC-------------STFNY--------------SSLPFEACKRRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGACAGAACAGTTTCTTATCATCGTCTTCTCAAATCGAAATTTCTTCCGATAACCTCCGCCGCTCCAAATCCTTCGGCTGCGTTTCCGGCCTCCTCCATTTCCT
CTCCAACAACCAGAGCCGCCGCAACAAATCCATCACATTCGTACACAACAATAATCAAGAACTTCACGATGCAATTTCCGAGACGAAATCGTCTCCAATCCGCATTGTTC
CTCAAAATGCCTCATCCGATTCGCGGCGGATCTCCGGCCAGATTCCGGAGCTTGAGCGATCCGAATCCCTAATTTCGACGGAGGAGAACTTCCGCGGAGCGAGAGGTCCG
ATTGTGCGACTTATGGGACTGGAAAGTTGCGCGGCAACCGCCGCGACGGAGAAGCAGAGGCAGGTAATGGATGCTTTGGAGAAATGCGAACAGGACCTGAAGGCGCTGAA
GGACTTCATCGACTCGTTTGAGTCGGCGCCGGCGGAGAGTTTCCGATCGTCGTCTCCGGCCGGTGCCGGAAAAGGAATTGAACTGTTTTCCGGCGGAAGTTATCGGGATT
CGATGAATTGGGAGCAGCAGCAGCAGCCGAGTCCGGTATCGGTGCTTGAGGAGATTAGTCGGCCGCGGCGTTTTGGTAACGTCCATGGCTGCCATAGCCTTTTCTCCGAC
CGGCCATCTGCAAATTCTGGAATAACAATGCAACAAATTCAAAGGAAGAAGGCAGGAGAAGAGGACATCTTCAAACTAAGCAAATTTGAGAGGATTGAAGTAGTAGTTGG
TAACTTCAACTTCCTCAAAGGTGAAAAACCAGTTGAATCCCCATTTTGTAGTAGCAAAAAGGCAATGAGAGACAGCGTAGAAGGAGTCTGCAAAGACATTGCTTGGGGTC
AAAACAGGGAAGTGGGAAGAATAGGACTGGCTTTACAACATCACATTTGTGGGGAATTGATTGAAGAGTTGGTAAAAGATTTGAAATTTTGTTCTACTTTTAATTATAGT
TCACTACCATTCGAGGCTTGCAAGAGAAGACTATGTTTGTAG
mRNA sequenceShow/hide mRNA sequence
AAGGAAAAGAAAAAAAAATCCCTCCATTTATGATGCCGTGAAGGTCTAATATGGGAGAGAAAATCTTTGGGCAAAAGGGGTTTTAGAGTTTTGAAGATCAATGAAGAGAC
AGAACAGTTTCTTATCATCGTCTTCTCAAATCGAAATTTCTTCCGATAACCTCCGCCGCTCCAAATCCTTCGGCTGCGTTTCCGGCCTCCTCCATTTCCTCTCCAACAAC
CAGAGCCGCCGCAACAAATCCATCACATTCGTACACAACAATAATCAAGAACTTCACGATGCAATTTCCGAGACGAAATCGTCTCCAATCCGCATTGTTCCTCAAAATGC
CTCATCCGATTCGCGGCGGATCTCCGGCCAGATTCCGGAGCTTGAGCGATCCGAATCCCTAATTTCGACGGAGGAGAACTTCCGCGGAGCGAGAGGTCCGATTGTGCGAC
TTATGGGACTGGAAAGTTGCGCGGCAACCGCCGCGACGGAGAAGCAGAGGCAGGTAATGGATGCTTTGGAGAAATGCGAACAGGACCTGAAGGCGCTGAAGGACTTCATC
GACTCGTTTGAGTCGGCGCCGGCGGAGAGTTTCCGATCGTCGTCTCCGGCCGGTGCCGGAAAAGGAATTGAACTGTTTTCCGGCGGAAGTTATCGGGATTCGATGAATTG
GGAGCAGCAGCAGCAGCCGAGTCCGGTATCGGTGCTTGAGGAGATTAGTCGGCCGCGGCGTTTTGGTAACGTCCATGGCTGCCATAGCCTTTTCTCCGACCGGCCATCTG
CAAATTCTGGAATAACAATGCAACAAATTCAAAGGAAGAAGGCAGGAGAAGAGGACATCTTCAAACTAAGCAAATTTGAGAGGATTGAAGTAGTAGTTGGTAACTTCAAC
TTCCTCAAAGGTGAAAAACCAGTTGAATCCCCATTTTGTAGTAGCAAAAAGGCAATGAGAGACAGCGTAGAAGGAGTCTGCAAAGACATTGCTTGGGGTCAAAACAGGGA
AGTGGGAAGAATAGGACTGGCTTTACAACATCACATTTGTGGGGAATTGATTGAAGAGTTGGTAAAAGATTTGAAATTTTGTTCTACTTTTAATTATAGTTCACTACCAT
TCGAGGCTTGCAAGAGAAGACTATGTTTGTAGTGTCTTTTTGGCTTGTCAATATAGCATAAACTTATCCATTCTTTGATTTCATAGGTGATAAGTATTTGATATATATAT
ATTTTCTTTTTTTTTGAGTTGAATATATAGGAGTGGAGATTTGAACTTCCAACCTTTTGATTGAGAATATATATCTTAACC
Protein sequenceShow/hide protein sequence
MKRQNSFLSSSSQIEISSDNLRRSKSFGCVSGLLHFLSNNQSRRNKSITFVHNNNQELHDAISETKSSPIRIVPQNASSDSRRISGQIPELERSESLISTEENFRGARGP
IVRLMGLESCAATAATEKQRQVMDALEKCEQDLKALKDFIDSFESAPAESFRSSSPAGAGKGIELFSGGSYRDSMNWEQQQQPSPVSVLEEISRPRRFGNVHGCHSLFSD
RPSANSGITMQQIQRKKAGEEDIFKLSKFERIEVVVGNFNFLKGEKPVESPFCSSKKAMRDSVEGVCKDIAWGQNREVGRIGLALQHHICGELIEELVKDLKFCSTFNYS
SLPFEACKRRLCL