; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01830 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01830
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:1373329..1383034
RNA-Seq ExpressionMoc03g01830
SyntenyMoc03g01830
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]1.4e-8557.23Show/hide
Query:  SKANRSRGAGFRSEDQAVRDEVHGHRSHHLGPVEEERPGGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEF
        S+  ++ GA  RSE++  R+++H  R HHLGPV++  P G +DEEYTHQ GDL EHLN+KRSSSLRKG+SPSCSHRNSNQQAESSYNPITPEGVITREEF
Subjt:  SKANRSRGAGFRSEDQAVRDEVHGHRSHHLGPVEEERPGGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEF

Query:  DQLKSKFDAQ-------------------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFRR-------QQMQSSVA--------
        DQLKSKFDAQ                               A IP KFKTPTMKPYDGSKDPKDYVEVFE    F+        +  Q ++         
Subjt:  DQLKSKFDAQ-------------------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFRR-------QQMQSSVA--------

Query:  --PSRSRSPAAR---------ACGIEDCRPGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAK
          P+RS S  ++         +    D +      T RQKEGETLREYVTRFQ+EQLKVAHCSD S MCYFLT LA+ETLTVKL +EAPATF +VL+KAK
Subjt:  --PSRSRSPAAR---------ACGIEDCRPGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAK

Query:  KFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGSSSSGS
        K IDGQELLRTKT +PEK+IDQ + +++K K DSK++DKG SS  S
Subjt:  KFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGSSSSGS

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.9e-8051.23Show/hide
Query:  MGQPANSTNTVDRRALAANDGHQREVGAEVVEGQVHEGLGTEPLRRSARITTPVLPPAHPKPSKANRSRGAGFRSEDQAVRDEVHGHRSHHLGPVEEERP
        M QPANSTNT DRRALAAN GHQREVGAEVVEGQ HE LGTEPL RSARITTPVLPPAHPKPSK                                    
Subjt:  MGQPANSTNTVDRRALAANDGHQREVGAEVVEGQVHEGLGTEPLRRSARITTPVLPPAHPKPSKANRSRGAGFRSEDQAVRDEVHGHRSHHLGPVEEERP

Query:  GGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ----------------------------
                                                   AESSYNPITP GVITREEFDQLKSKFDAQ                            
Subjt:  GGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ----------------------------

Query:  ---APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------RQQMQSSVAPSRSRSPAARACGIEDCRP-----------GRSRPTP----R
           A IPPKFKTPTMKPYDGSKDPKDYVEVFE+   F+           +  +  S      R PA         R             R  PT     R
Subjt:  ---APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------RQQMQSSVAPSRSRSPAARACGIEDCRP-----------GRSRPTP----R

Query:  QKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKD
        QKEGETLREYVTRF +EQLKVAHCSDDS MCYFLTGLA+ETLTVKL +EAPATFA+VL+K KK IDGQELLRTKTG+PEK IDQ +  ++K K DSKS+D
Subjt:  QKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKD

Query:  KGSSSSGS
        KG SSS S
Subjt:  KGSSSSGS

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]2.4e-7759.02Show/hide
Query:  KDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ-------------------------------
        +DEEYTHQ GDL EHLN+KRSSSLRKG+SPS SHRNSNQQAESSYNPITP+ VITREEFDQLKSKFDAQ                               
Subjt:  KDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ-------------------------------

Query:  APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------------RQQMQSSVAPSRSRSPAAR---------ACGIEDCRPGRSRPTPRQKE
        API PKFKTPTMKPYDGSK+PKDYV+VFE    F+                   ++     P+RS S  ++         +    D +      T RQK+
Subjt:  APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------------RQQMQSSVAPSRSRSPAAR---------ACGIEDCRPGRSRPTPRQKE

Query:  GETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGS
        GETLREYVTRFQ+EQLKVAHCSDDS MCYFLTGLA++TLTVKLG+EAPATFA+VL+KAKK IDGQELLRTKTG+PEK+IDQ+++ ++K K  SKS+DKG 
Subjt:  GETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGS

Query:  SSSGS
        SSS S
Subjt:  SSSGS

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]8.8e-7253.69Show/hide
Query:  GAGFRSEDQAVRDEVHGHRSHHLGPVEEERPGGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKF
        GA  RS DQ V ++VH     H  PV+EE           H  GDL +HLN+KR+SS R  R+ +  H+NSNQQAESSYNPI PEGVITREEF+QLKSKF
Subjt:  GAGFRSEDQAVRDEVHGHRSHHLGPVEEERPGGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKF

Query:  DAQ-------------------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFRR-------QQMQSSVA----------PSRSR
        DAQ                               A IPPKFKTPTMK YDGSKDPKDYVEVFE    F+        +  Q ++           P+RS 
Subjt:  DAQ-------------------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFRR-------QQMQSSVA----------PSRSR

Query:  SPAARA---------CGIEDCRPGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQE
        S  ++              D +      T RQKEG+TL+EY+TRFQ+EQLKV HCSDDS MCYFLTGLA+ET TVKLG+EA ATFA+VL+  KKFIDGQE
Subjt:  SPAARA---------CGIEDCRPGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQE

Query:  LLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGSSSSGS
        LLRTKT +PEKQIDQKK +Q+K K DSKSKDKGSSSS S
Subjt:  LLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGSSSSGS

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]1.9e-9856.06Show/hide
Query:  MGQPANSTNTVDRRALAANDGHQREVGAEVVEGQVHEGLGTEPLRRSARITTPVLPPAHPKPSKANRSRG------------AGFRSEDQAVRDEVHGHR
        M QP +STNT DRRAL ANDGHQREVGAEVVEGQ+HEGLGTEP  RSARITTP L PAHPKP KANR RG            A  R    A++ E+   R
Subjt:  MGQPANSTNTVDRRALAANDGHQREVGAEVVEGQVHEGLGTEPLRRSARITTPVLPPAHPKPSKANRSRG------------AGFRSEDQAVRDEVHGHR

Query:  SHHLGPVE------EERPGGDKDEEYT--HQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ--------
        +  L   E      +    G + E+     + GDL +HL++KRSSSLRKGRSPSCSH+NSNQQAESSYNP+ PEGVITREEFDQLKSKFDAQ        
Subjt:  SHHLGPVE------EERPGGDKDEEYT--HQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ--------

Query:  -----------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------RQQMQSSVAPSRSRSPAARACGIEDCR----
                               A IP KFKTPTMKPYDGSKDPKDYVEVFE    F+           +  + SS      R PA         R    
Subjt:  -----------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------RQQMQSSVAPSRSRSPAARACGIEDCR----

Query:  -----------PGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEK
                         T RQKE ETLREYVT FQ+EQLKVAH SDDS +CYFLT L +ETLTVKLG+EAPATFA+VL+KAKK IDGQEL RTKTG+ EK
Subjt:  -----------PGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEK

Query:  QIDQKKLNQEKMKVDSKSKDK
        QIDQKK +QEK K +SKSKDK
Subjt:  QIDQKKLNQEKMKVDSKSKDK

TrEMBL top hitse value%identityAlignment
A0A6J1DDW5 uncharacterized protein LOC1110196346.7e-8657.23Show/hide
Query:  SKANRSRGAGFRSEDQAVRDEVHGHRSHHLGPVEEERPGGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEF
        S+  ++ GA  RSE++  R+++H  R HHLGPV++  P G +DEEYTHQ GDL EHLN+KRSSSLRKG+SPSCSHRNSNQQAESSYNPITPEGVITREEF
Subjt:  SKANRSRGAGFRSEDQAVRDEVHGHRSHHLGPVEEERPGGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEF

Query:  DQLKSKFDAQ-------------------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFRR-------QQMQSSVA--------
        DQLKSKFDAQ                               A IP KFKTPTMKPYDGSKDPKDYVEVFE    F+        +  Q ++         
Subjt:  DQLKSKFDAQ-------------------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFRR-------QQMQSSVA--------

Query:  --PSRSRSPAAR---------ACGIEDCRPGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAK
          P+RS S  ++         +    D +      T RQKEGETLREYVTRFQ+EQLKVAHCSD S MCYFLT LA+ETLTVKL +EAPATF +VL+KAK
Subjt:  --PSRSRSPAAR---------ACGIEDCRPGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAK

Query:  KFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGSSSSGS
        K IDGQELLRTKT +PEK+IDQ + +++K K DSK++DKG SS  S
Subjt:  KFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGSSSSGS

A0A6J1DHB3 uncharacterized protein LOC1110204791.9e-8051.23Show/hide
Query:  MGQPANSTNTVDRRALAANDGHQREVGAEVVEGQVHEGLGTEPLRRSARITTPVLPPAHPKPSKANRSRGAGFRSEDQAVRDEVHGHRSHHLGPVEEERP
        M QPANSTNT DRRALAAN GHQREVGAEVVEGQ HE LGTEPL RSARITTPVLPPAHPKPSK                                    
Subjt:  MGQPANSTNTVDRRALAANDGHQREVGAEVVEGQVHEGLGTEPLRRSARITTPVLPPAHPKPSKANRSRGAGFRSEDQAVRDEVHGHRSHHLGPVEEERP

Query:  GGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ----------------------------
                                                   AESSYNPITP GVITREEFDQLKSKFDAQ                            
Subjt:  GGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ----------------------------

Query:  ---APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------RQQMQSSVAPSRSRSPAARACGIEDCRP-----------GRSRPTP----R
           A IPPKFKTPTMKPYDGSKDPKDYVEVFE+   F+           +  +  S      R PA         R             R  PT     R
Subjt:  ---APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------RQQMQSSVAPSRSRSPAARACGIEDCRP-----------GRSRPTP----R

Query:  QKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKD
        QKEGETLREYVTRF +EQLKVAHCSDDS MCYFLTGLA+ETLTVKL +EAPATFA+VL+K KK IDGQELLRTKTG+PEK IDQ +  ++K K DSKS+D
Subjt:  QKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKD

Query:  KGSSSSGS
        KG SSS S
Subjt:  KGSSSSGS

A0A6J1DM55 uncharacterized protein LOC1110222671.2e-7759.02Show/hide
Query:  KDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ-------------------------------
        +DEEYTHQ GDL EHLN+KRSSSLRKG+SPS SHRNSNQQAESSYNPITP+ VITREEFDQLKSKFDAQ                               
Subjt:  KDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ-------------------------------

Query:  APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------------RQQMQSSVAPSRSRSPAAR---------ACGIEDCRPGRSRPTPRQKE
        API PKFKTPTMKPYDGSK+PKDYV+VFE    F+                   ++     P+RS S  ++         +    D +      T RQK+
Subjt:  APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------------RQQMQSSVAPSRSRSPAAR---------ACGIEDCRPGRSRPTPRQKE

Query:  GETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGS
        GETLREYVTRFQ+EQLKVAHCSDDS MCYFLTGLA++TLTVKLG+EAPATFA+VL+KAKK IDGQELLRTKTG+PEK+IDQ+++ ++K K  SKS+DKG 
Subjt:  GETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGS

Query:  SSSGS
        SSS S
Subjt:  SSSGS

A0A6J1DPN4 uncharacterized protein LOC1110230604.2e-7253.69Show/hide
Query:  GAGFRSEDQAVRDEVHGHRSHHLGPVEEERPGGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKF
        GA  RS DQ V ++VH     H  PV+EE           H  GDL +HLN+KR+SS R  R+ +  H+NSNQQAESSYNPI PEGVITREEF+QLKSKF
Subjt:  GAGFRSEDQAVRDEVHGHRSHHLGPVEEERPGGDKDEEYTHQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKF

Query:  DAQ-------------------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFRR-------QQMQSSVA----------PSRSR
        DAQ                               A IPPKFKTPTMK YDGSKDPKDYVEVFE    F+        +  Q ++           P+RS 
Subjt:  DAQ-------------------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFRR-------QQMQSSVA----------PSRSR

Query:  SPAARA---------CGIEDCRPGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQE
        S  ++              D +      T RQKEG+TL+EY+TRFQ+EQLKV HCSDDS MCYFLTGLA+ET TVKLG+EA ATFA+VL+  KKFIDGQE
Subjt:  SPAARA---------CGIEDCRPGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQE

Query:  LLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGSSSSGS
        LLRTKT +PEKQIDQKK +Q+K K DSKSKDKGSSSS S
Subjt:  LLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGSSSSGS

A0A6J1DZJ1 uncharacterized protein LOC1110257389.1e-9956.06Show/hide
Query:  MGQPANSTNTVDRRALAANDGHQREVGAEVVEGQVHEGLGTEPLRRSARITTPVLPPAHPKPSKANRSRG------------AGFRSEDQAVRDEVHGHR
        M QP +STNT DRRAL ANDGHQREVGAEVVEGQ+HEGLGTEP  RSARITTP L PAHPKP KANR RG            A  R    A++ E+   R
Subjt:  MGQPANSTNTVDRRALAANDGHQREVGAEVVEGQVHEGLGTEPLRRSARITTPVLPPAHPKPSKANRSRG------------AGFRSEDQAVRDEVHGHR

Query:  SHHLGPVE------EERPGGDKDEEYT--HQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ--------
        +  L   E      +    G + E+     + GDL +HL++KRSSSLRKGRSPSCSH+NSNQQAESSYNP+ PEGVITREEFDQLKSKFDAQ        
Subjt:  SHHLGPVE------EERPGGDKDEEYT--HQSGDLPEHLNKKRSSSLRKGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQ--------

Query:  -----------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------RQQMQSSVAPSRSRSPAARACGIEDCR----
                               A IP KFKTPTMKPYDGSKDPKDYVEVFE    F+           +  + SS      R PA         R    
Subjt:  -----------------------APIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFR-----------RQQMQSSVAPSRSRSPAARACGIEDCR----

Query:  -----------PGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEK
                         T RQKE ETLREYVT FQ+EQLKVAH SDDS +CYFLT L +ETLTVKLG+EAPATFA+VL+KAKK IDGQEL RTKTG+ EK
Subjt:  -----------PGRSRPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEK

Query:  QIDQKKLNQEKMKVDSKSKDK
        QIDQKK +QEK K +SKSKDK
Subjt:  QIDQKKLNQEKMKVDSKSKDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGCGGTCGACGTCATCTCCTGAAGGGTCTCTTTCAAGATAAGGATTTGGAGTCATCATCAATGATCTACAAGGCATTGGTTGTTGGAGGATCTCCAATCACGTC
GTCGGAGGTCGTCGGTGTCTGTCATCGCAGACTGAACTCGCTGGAGCGGGTTGTGTCAGAATGTCACTGTAATGTTGCTGCCACGGACATGAACTGTCGCCGGTCAAAAC
CTACAATGTCGTCGCCGGAGTCTGGGCATCGGTCGGAGGTTGCTACGCTAGTGGGAGTTAGATCGCTGGAGGTTGTTGCTGTGGGTGCCGTTCGTGGAGGTTATTGGAAG
TGTGAGGATAGTGGGGGTAAAAATATCTTGGACACGTGGAGGATGCTTACTGGTGTCCAAGGGTTAAAGAATCCCAGGTCCACCTCGAACCTACCGAAGAGGTCGAGGTC
ACCCATTGGAAAAAGGCTAAGTCCACTCTTGTGTTCAGGTCGAGCCGGGGACCGGGTTCGAGCTTGGTTCGTAAAGTATCGTTGTGCAGACCTTTGGATAAACATTTGGC
GTCGTCTGTGGGAAAAGGACGATCTAAGTCATCCCGATCTAAAAGCACGAACAAAAATGGGTCAGCCAGCAAACTCAACCAACACGGTGGACCGAAGAGCTCTGGCTGCC
AATGATGGCCACCAGAGGGAGGTCGGGGCAGAGGTGGTAGAGGGACAGGTTCATGAAGGCCTGGGGACCGAGCCTCTCCGCAGGTCGGCACGCATCACCACGCCCGTTCT
GCCACCAGCACATCCAAAGCCATCCAAGGCCAATCGCAGCCGAGGCGCTGGGTTTCGATCTGAGGACCAAGCGGTACGCGACGAGGTACACGGGCACAGAAGTCATCACC
TCGGCCCAGTCGAGGAAGAGCGCCCTGGAGGAGATAAGGACGAAGAGTACACTCATCAAAGTGGCGATCTTCCCGAACACCTTAACAAAAAGAGAAGCTCGTCCCTCCGA
AAGGGACGATCTCCGTCCTGCTCGCACAGGAACTCCAACCAGCAGGCCGAGTCCTCCTATAACCCCATAACTCCCGAGGGAGTGATCACAAGGGAGGAGTTCGACCAGCT
AAAGAGCAAGTTTGATGCTCAGGCTCCGATCCCTCCAAAATTCAAGACTCCCACCATGAAGCCTTATGACGGGTCTAAGGACCCAAAAGACTATGTTGAGGTCTTCGAAG
CCTCATGGATTTTCAGGCGGCAACAGATGCAATCAAGTGTCGCGCCTTCCAGATCGCGCTCACCGGCAGCGCGCGCCTGTGGTATCGAAGATTGCCGGCCAGGTCGATCT
CGACCTACTCCCAGGCAGAAAGAAGGAGAGACGTTGAGAGAATATGTCACAAGGTTCCAGGACGAGCAGTTAAAGGTCGCACACTGCTCTGATGACTCAGTCATGTGCTA
CTTTCTCACCGGCTTGGCCAATGAGACCCTCACTGTGAAGCTCGGAGATGAGGCTCCAGCAACCTTCGCCAAAGTTTTGAAAAAGGCAAAGAAATTCATTGATGGACAAG
AGCTTCTTCGAACAAAGACTGGCCAACCTGAGAAGCAGATCGACCAGAAAAAGCTAAACCAAGAGAAGATGAAGGTTGATTCCAAGTCAAAGGACAAGGGATCGTCCTCT
TCCGGTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGCGGTCGACGTCATCTCCTGAAGGGTCTCTTTCAAGATAAGGATTTGGAGTCATCATCAATGATCTACAAGGCATTGGTTGTTGGAGGATCTCCAATCACGTC
GTCGGAGGTCGTCGGTGTCTGTCATCGCAGACTGAACTCGCTGGAGCGGGTTGTGTCAGAATGTCACTGTAATGTTGCTGCCACGGACATGAACTGTCGCCGGTCAAAAC
CTACAATGTCGTCGCCGGAGTCTGGGCATCGGTCGGAGGTTGCTACGCTAGTGGGAGTTAGATCGCTGGAGGTTGTTGCTGTGGGTGCCGTTCGTGGAGGTTATTGGAAG
TGTGAGGATAGTGGGGGTAAAAATATCTTGGACACGTGGAGGATGCTTACTGGTGTCCAAGGGTTAAAGAATCCCAGGTCCACCTCGAACCTACCGAAGAGGTCGAGGTC
ACCCATTGGAAAAAGGCTAAGTCCACTCTTGTGTTCAGGTCGAGCCGGGGACCGGGTTCGAGCTTGGTTCGTAAAGTATCGTTGTGCAGACCTTTGGATAAACATTTGGC
GTCGTCTGTGGGAAAAGGACGATCTAAGTCATCCCGATCTAAAAGCACGAACAAAAATGGGTCAGCCAGCAAACTCAACCAACACGGTGGACCGAAGAGCTCTGGCTGCC
AATGATGGCCACCAGAGGGAGGTCGGGGCAGAGGTGGTAGAGGGACAGGTTCATGAAGGCCTGGGGACCGAGCCTCTCCGCAGGTCGGCACGCATCACCACGCCCGTTCT
GCCACCAGCACATCCAAAGCCATCCAAGGCCAATCGCAGCCGAGGCGCTGGGTTTCGATCTGAGGACCAAGCGGTACGCGACGAGGTACACGGGCACAGAAGTCATCACC
TCGGCCCAGTCGAGGAAGAGCGCCCTGGAGGAGATAAGGACGAAGAGTACACTCATCAAAGTGGCGATCTTCCCGAACACCTTAACAAAAAGAGAAGCTCGTCCCTCCGA
AAGGGACGATCTCCGTCCTGCTCGCACAGGAACTCCAACCAGCAGGCCGAGTCCTCCTATAACCCCATAACTCCCGAGGGAGTGATCACAAGGGAGGAGTTCGACCAGCT
AAAGAGCAAGTTTGATGCTCAGGCTCCGATCCCTCCAAAATTCAAGACTCCCACCATGAAGCCTTATGACGGGTCTAAGGACCCAAAAGACTATGTTGAGGTCTTCGAAG
CCTCATGGATTTTCAGGCGGCAACAGATGCAATCAAGTGTCGCGCCTTCCAGATCGCGCTCACCGGCAGCGCGCGCCTGTGGTATCGAAGATTGCCGGCCAGGTCGATCT
CGACCTACTCCCAGGCAGAAAGAAGGAGAGACGTTGAGAGAATATGTCACAAGGTTCCAGGACGAGCAGTTAAAGGTCGCACACTGCTCTGATGACTCAGTCATGTGCTA
CTTTCTCACCGGCTTGGCCAATGAGACCCTCACTGTGAAGCTCGGAGATGAGGCTCCAGCAACCTTCGCCAAAGTTTTGAAAAAGGCAAAGAAATTCATTGATGGACAAG
AGCTTCTTCGAACAAAGACTGGCCAACCTGAGAAGCAGATCGACCAGAAAAAGCTAAACCAAGAGAAGATGAAGGTTGATTCCAAGTCAAAGGACAAGGGATCGTCCTCT
TCCGGTAGCTGA
Protein sequenceShow/hide protein sequence
MQSGRRHLLKGLFQDKDLESSSMIYKALVVGGSPITSSEVVGVCHRRLNSLERVVSECHCNVAATDMNCRRSKPTMSSPESGHRSEVATLVGVRSLEVVAVGAVRGGYWK
CEDSGGKNILDTWRMLTGVQGLKNPRSTSNLPKRSRSPIGKRLSPLLCSGRAGDRVRAWFVKYRCADLWINIWRRLWEKDDLSHPDLKARTKMGQPANSTNTVDRRALAA
NDGHQREVGAEVVEGQVHEGLGTEPLRRSARITTPVLPPAHPKPSKANRSRGAGFRSEDQAVRDEVHGHRSHHLGPVEEERPGGDKDEEYTHQSGDLPEHLNKKRSSSLR
KGRSPSCSHRNSNQQAESSYNPITPEGVITREEFDQLKSKFDAQAPIPPKFKTPTMKPYDGSKDPKDYVEVFEASWIFRRQQMQSSVAPSRSRSPAARACGIEDCRPGRS
RPTPRQKEGETLREYVTRFQDEQLKVAHCSDDSVMCYFLTGLANETLTVKLGDEAPATFAKVLKKAKKFIDGQELLRTKTGQPEKQIDQKKLNQEKMKVDSKSKDKGSSS
SGS