; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g17440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g17440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr5:12951519..12956865
RNA-Seq ExpressionMoc05g17440
SyntenyMoc05g17440
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]4.9e-5027.24Show/hide
Query:  LTFDPEIERTVKRIRREQRLRKEK-ETQKEKEVEEEETIEMNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSS
        L +DPEIE+T KR+RREQRL+K++ + QKEKE E    +  N                             +PNPI +AD RD  MR+Y      +LNSS
Subjt:  LTFDPEIERTVKRIRREQRLRKEK-ETQKEKEVEEEETIEMNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSS

Query:  ------------------------------------------------------------------SSQDGARTWLNALEPNSINTWAELTEKFLAKYHT
                                                                          S    A  WLNA   ++I T +++ +KFL KY  
Subjt:  ------------------------------------------------------------------SSQDGARTWLNALEPNSINTWAELTEKFLAKYHT

Query:  LTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------ASNGSLLEKSVNEIVDIFNKMTDINDQ--GKIGRSL
         TRNAD+RE+I+SFRQK+NEAV  AWE FK+L+R CP+ G+P                      A+NG    KS NEIV+I +++++ NDQ   +  R+ 
Subjt:  LTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------ASNGSLLEKSVNEIVDIFNKMTDINDQ--GKIGRSL

Query:  PKKQVSAGVFELDTVASMQAQMAAMNQMLKQLEKETKTVVSA-----------IPE--------------------------------------------
         K+   A V  LD + SMQ Q+  + QMLK +EK       A           I E                                            
Subjt:  PKKQVSAGVFELDTVASMQAQMAAMNQMLKQLEKETKTVVSA-----------IPE--------------------------------------------

Query:  ---------------PSHPQQYNQQRGQ-NTTQQSGSNASL---------EAMMKEFMTRTDATI-----------------RSLEMQVGQIANDRKSRP
                       P  PQQYNQQ+      QQ+ SN  +         +A MKE MTRTDATI                 R+LEMQ+GQ+AN+ ++RP
Subjt:  ---------------PSHPQQYNQQRGQ-NTTQQSGSNASL---------EAMMKEFMTRTDATI-----------------RSLEMQVGQIANDRKSRP

Query:  RG--PSLLDGGNDAVTPVHAFTSNPQ--QEEKAEPVISEEKGKKPDKCK---------------------------------------QVVTSTTPQVDI
        +G  PS  +     V+P  +   + Q   ++  EP +S     +   C+                                       Q+ T      DI
Subjt:  RG--PSLLDGGNDAVTPVHAFTSNPQ--QEEKAEPVISEEKGKKPDKCK---------------------------------------QVVTSTTPQVDI

Query:  ISRRKKLGEHETVALTKCSSDALGNPLPVKCNDP------------------------------------------------------------------
        I+R+KKLGE+ETVALT+CSS+   +  P K  DP                                                                  
Subjt:  ISRRKKLGEHETVALTKCSSDALGNPLPVKCNDP------------------------------------------------------------------

Query:  -------------------DLEVSIIFRRPFLATGDTVFNIRKEEITMKINDEQVTFNVLDSMRLPDEVEECSTIE----AIMEELQQMMVEDLEADLEV
                           D +V II  RPFLATG+T+ +++K E+TM+++D++VTFN+LD+M+ PD+ EEC  I         EL  ++  ++EA+LE 
Subjt:  -------------------DLEVSIIFRRPFLATGDTVFNIRKEEITMKINDEQVTFNVLDSMRLPDEVEECSTIE----AIMEELQQMMVEDLEADLEV

Query:  VEKE
         EKE
Subjt:  VEKE

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]5.4e-4960.63Show/hide
Query:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSSSSQ---DGARTWLNALEPNSINTWAELTEKFLAKYHTLTR
        MNRN QDPPP QNPPVNGDMAGEGAANR GEIPN ILLADNRDV MRNYVT AFHNLNS  +      A+  L  +  + + T        + ++  LT 
Subjt:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSSSSQ---DGARTWLNALEPNSINTWAELTEKFLAKYHTLTR

Query:  NADLREDIVSFRQKKN----EAVQEAWEHFKELLRRCPSHGL-PASNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQM
        N D    + SF +  N      V E     K  L R     L  A+NGSLLEKSVNEIVDI NKM DINDQG+ GRSL KKQVSAG+FELDTVA MQAQM
Subjt:  NADLREDIVSFRQKKN----EAVQEAWEHFKELLRRCPSHGL-PASNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQM

Query:  AAMNQMLKQ--LEKETKTVVS
        AAMNQMLKQ  +EKETKTV S
Subjt:  AAMNQMLKQ--LEKETKTVVS

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]4.6e-8049.63Show/hide
Query:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSSSSQDGARTWLNALEPNS-INTWAELTEKFLAKYHT-----
        MN NPQDPP   NPPV+GD AGEGAANR GE+PNPILL DNRDV +RNYVT+AFHNLNS    DG     +  +P S + ++ E+   F     +     
Subjt:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSSSSQDGARTWLNALEPNS-INTWAELTEKFLAKYHT-----

Query:  LTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------ASNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPK
        L  NADLREDIVSFRQK+NEAVQE WE FKELLRRC SHGLP                      A+N SL EKS++EI+DI NKMTD NDQG+IGRSLPK
Subjt:  LTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------ASNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPK

Query:  KQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVVSAIPEP------------------------------------------------------
        KQVSA VFELDTVASMQAQMA +NQMLKQL  EKETKT  SA+ EP                                                      
Subjt:  KQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVVSAIPEP------------------------------------------------------

Query:  -------------------------------------SHPQQYNQQRGQNTTQQSGSNASLEAMMKEFMTRTDAT-----------IRSLEMQVGQIAND
                                             S PQQYNQQR QNTTQQ GSN SLEAM KEFMTR++AT           IR LEMQVGQIAND
Subjt:  -------------------------------------SHPQQYNQQRGQNTTQQSGSNASLEAMMKEFMTRTDAT-----------IRSLEMQVGQIAND

Query:  RKSRPRG
        +KSRP+G
Subjt:  RKSRPRG

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]7.0e-8962.87Show/hide
Query:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSS----------------------------------------
        MNRN QDPPP QNPPVNGDMAGE AANRVGEIPN ILLADNRDV MRNYVT+AFHNLNS                                         
Subjt:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSS----------------------------------------

Query:  --------------------------SSQDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHG
                                  S +DGARTW+NALEPNSINTWAELT+KFLAKYHTLT+NADLREDIVSFRQK+NEAVQEAWE FKELLRRCPSHG
Subjt:  --------------------------SSQDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHG

Query:  LPA----------------------SNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVV
        LP+                      +NGSLLEKSVNEIVD+ NKMTDINDQG++GRSLPKKQVS G+FELDTVASMQAQMAAMNQMLKQL  EKETKTV 
Subjt:  LPA----------------------SNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVV

Query:  SAIPEPS
        SAIPE S
Subjt:  SAIPEPS

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]3.4e-5164.84Show/hide
Query:  SSQDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------AS
        S +DGA TW+N LE N I TWAELT+KFLAKYHTLTRNADL+EDIVSFRQ+++EAVQEAWE FKELL+RC SHGLP                      A+
Subjt:  SSQDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------AS

Query:  NGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVVSA-IPEPS
        N SLLEKSVNEI+DI NKM DINDQ ++GRSLPKKQ SAG+FELDTV S+QAQ++AM+QMLKQL  +K  K   S  I EPS
Subjt:  NGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVVSA-IPEPS

TrEMBL top hitse value%identityAlignment
A0A6J1CPJ3 uncharacterized protein LOC1110129475.3e-5027.24Show/hide
Query:  LTFDPEIERTVKRIRREQRLRKEK-ETQKEKEVEEEETIEMNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSS
        L +DPEIE+T KR+RREQRL+K++ + QKEKE E    +  N                             +PNPI +AD RD  MR+Y      +LNSS
Subjt:  LTFDPEIERTVKRIRREQRLRKEK-ETQKEKEVEEEETIEMNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSS

Query:  ------------------------------------------------------------------SSQDGARTWLNALEPNSINTWAELTEKFLAKYHT
                                                                          S    A  WLNA   ++I T +++ +KFL KY  
Subjt:  ------------------------------------------------------------------SSQDGARTWLNALEPNSINTWAELTEKFLAKYHT

Query:  LTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------ASNGSLLEKSVNEIVDIFNKMTDINDQ--GKIGRSL
         TRNAD+RE+I+SFRQK+NEAV  AWE FK+L+R CP+ G+P                      A+NG    KS NEIV+I +++++ NDQ   +  R+ 
Subjt:  LTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------ASNGSLLEKSVNEIVDIFNKMTDINDQ--GKIGRSL

Query:  PKKQVSAGVFELDTVASMQAQMAAMNQMLKQLEKETKTVVSA-----------IPE--------------------------------------------
         K+   A V  LD + SMQ Q+  + QMLK +EK       A           I E                                            
Subjt:  PKKQVSAGVFELDTVASMQAQMAAMNQMLKQLEKETKTVVSA-----------IPE--------------------------------------------

Query:  ---------------PSHPQQYNQQRGQ-NTTQQSGSNASL---------EAMMKEFMTRTDATI-----------------RSLEMQVGQIANDRKSRP
                       P  PQQYNQQ+      QQ+ SN  +         +A MKE MTRTDATI                 R+LEMQ+GQ+AN+ ++RP
Subjt:  ---------------PSHPQQYNQQRGQ-NTTQQSGSNASL---------EAMMKEFMTRTDATI-----------------RSLEMQVGQIANDRKSRP

Query:  RG--PSLLDGGNDAVTPVHAFTSNPQ--QEEKAEPVISEEKGKKPDKCK---------------------------------------QVVTSTTPQVDI
        +G  PS  +     V+P  +   + Q   ++  EP +S     +   C+                                       Q+ T      DI
Subjt:  RG--PSLLDGGNDAVTPVHAFTSNPQ--QEEKAEPVISEEKGKKPDKCK---------------------------------------QVVTSTTPQVDI

Query:  ISRRKKLGEHETVALTKCSSDALGNPLPVKCNDP------------------------------------------------------------------
        I+R+KKLGE+ETVALT+CSS+   +  P K  DP                                                                  
Subjt:  ISRRKKLGEHETVALTKCSSDALGNPLPVKCNDP------------------------------------------------------------------

Query:  -------------------DLEVSIIFRRPFLATGDTVFNIRKEEITMKINDEQVTFNVLDSMRLPDEVEECSTIE----AIMEELQQMMVEDLEADLEV
                           D +V II  RPFLATG+T+ +++K E+TM+++D++VTFN+LD+M+ PD+ EEC  I         EL  ++  ++EA+LE 
Subjt:  -------------------DLEVSIIFRRPFLATGDTVFNIRKEEITMKINDEQVTFNVLDSMRLPDEVEECSTIE----AIMEELQQMMVEDLEADLEV

Query:  VEKE
         EKE
Subjt:  VEKE

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220072.6e-4960.63Show/hide
Query:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSSSSQ---DGARTWLNALEPNSINTWAELTEKFLAKYHTLTR
        MNRN QDPPP QNPPVNGDMAGEGAANR GEIPN ILLADNRDV MRNYVT AFHNLNS  +      A+  L  +  + + T        + ++  LT 
Subjt:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSSSSQ---DGARTWLNALEPNSINTWAELTEKFLAKYHTLTR

Query:  NADLREDIVSFRQKKN----EAVQEAWEHFKELLRRCPSHGL-PASNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQM
        N D    + SF +  N      V E     K  L R     L  A+NGSLLEKSVNEIVDI NKM DINDQG+ GRSL KKQVSAG+FELDTVA MQAQM
Subjt:  NADLREDIVSFRQKKN----EAVQEAWEHFKELLRRCPSHGL-PASNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQM

Query:  AAMNQMLKQ--LEKETKTVVS
        AAMNQMLKQ  +EKETKTV S
Subjt:  AAMNQMLKQ--LEKETKTVVS

A0A6J1DYY9 uncharacterized protein LOC1110255571.3e-5165.38Show/hide
Query:  SSQDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------AS
        S +DGA TWLN LE N I TWAELT+KFLAKYHTLTRNADL+EDIVSFRQ+++EAVQEAWE FKELL+RC SHGLP                      A+
Subjt:  SSQDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------AS

Query:  NGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVVSA-IPEPS
        N SLLEKSVNEI+DI NKM DINDQ ++GRSLPKKQ SAG+FELDTV S+QAQ++AM+QMLKQL  +K  K   S  I EPS
Subjt:  NGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVVSA-IPEPS

A0A6J1DZ19 uncharacterized protein LOC1110248242.2e-8049.63Show/hide
Query:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSSSSQDGARTWLNALEPNS-INTWAELTEKFLAKYHT-----
        MN NPQDPP   NPPV+GD AGEGAANR GE+PNPILL DNRDV +RNYVT+AFHNLNS    DG     +  +P S + ++ E+   F     +     
Subjt:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSSSSQDGARTWLNALEPNS-INTWAELTEKFLAKYHT-----

Query:  LTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------ASNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPK
        L  NADLREDIVSFRQK+NEAVQE WE FKELLRRC SHGLP                      A+N SL EKS++EI+DI NKMTD NDQG+IGRSLPK
Subjt:  LTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLP----------------------ASNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPK

Query:  KQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVVSAIPEP------------------------------------------------------
        KQVSA VFELDTVASMQAQMA +NQMLKQL  EKETKT  SA+ EP                                                      
Subjt:  KQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVVSAIPEP------------------------------------------------------

Query:  -------------------------------------SHPQQYNQQRGQNTTQQSGSNASLEAMMKEFMTRTDAT-----------IRSLEMQVGQIAND
                                             S PQQYNQQR QNTTQQ GSN SLEAM KEFMTR++AT           IR LEMQVGQIAND
Subjt:  -------------------------------------SHPQQYNQQRGQNTTQQSGSNASLEAMMKEFMTRTDAT-----------IRSLEMQVGQIAND

Query:  RKSRPRG
        +KSRP+G
Subjt:  RKSRPRG

A0A6J1E251 uncharacterized protein LOC1110253023.4e-8962.87Show/hide
Query:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSS----------------------------------------
        MNRN QDPPP QNPPVNGDMAGE AANRVGEIPN ILLADNRDV MRNYVT+AFHNLNS                                         
Subjt:  MNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNPILLADNRDVVMRNYVTYAFHNLNSS----------------------------------------

Query:  --------------------------SSQDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHG
                                  S +DGARTW+NALEPNSINTWAELT+KFLAKYHTLT+NADLREDIVSFRQK+NEAVQEAWE FKELLRRCPSHG
Subjt:  --------------------------SSQDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHG

Query:  LPA----------------------SNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVV
        LP+                      +NGSLLEKSVNEIVD+ NKMTDINDQG++GRSLPKKQVS G+FELDTVASMQAQMAAMNQMLKQL  EKETKTV 
Subjt:  LPA----------------------SNGSLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQMAAMNQMLKQL--EKETKTVV

Query:  SAIPEPS
        SAIPE S
Subjt:  SAIPEPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTCTCGCTACTGTTTACCATAACTATGATGAAGTCCATAAGAGGCTTCCTTGGGTTCTTCAGCCTGAATTCTCATCGGGGGATGTTTTTCAATGCTCCG
ATTTCGAACAAGAACTTGAAGGATCACCGTTTCTTCGTTCGCGGGCAGTGGCTAGTTGGTGACAGTCTGATGCGTTCTAAAAAAATTCTTGCAATTCCTTGGGCC
CAAAGAGTCCCCACTCTGATCATAACGAATGAAAAGTTGGCTGAACATCACCTGGTCGGGGATCCTGATGGCTCAATCATAGGCAACGATGAGCATCCTACTCGT
AGAGTGGCTCCAATAGGTAGGACGTTGAATCTTCTCCTTCCAACGTTCGACTATCGTTGCTTCCGGTTTGTTTGTTCATATCCTACGAAGTCACCATCCGTCTCA
CCCACAAGACGTGAACTCGTAATTGTATCTCTTCTTCAGATTGGCCAAGTTTCTCCGGACGGTATGGCCATTGTCGAAAGCGCTACCTCGACCAATTCCGCCACC
CTCGTTCCCTCGCTCAAGCGACGTTGTCTGGTGAGGTCGGGCGACGCTCCACCTTCGTCTGACCCTCCTCCTGTGGTGGAGCCAGTCTTCCGAATTTTAGAGATC
GTTGAGTCTAGCTCGAGCGATGCGACTTTTCGGGCGCCTTCTACTAACAAGGAGTTGCAGCTGATGGTGAGGGACGTTCCAAGTCCAAGTAGAAGAGCGACAGGT
TCTAGGGATACAGTTCTGGATGGGGGGGCCATTGATGATCCCACTTCTGAGAACTTTCGATCAGTGGTCCCTTCCACATTGTGCGGGAGTACAGATATGGGGTTT
TTGATGCATGGAAAGGACAGAGCTACTTATCAGCGTATCGAGTTAGATCGTGTCCAATCCCGAGTAGCTTTTGCTGAGGCCCGTATTAGAGAGCTGGAGAAGGAT
GTTACCCTAGCCTCTGAACAAACCAAAAACGGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGCCAGGAATTGCGCCTGGGTCTGGTTGGTTGACCTTTGAT
CCAGAGATTGAGAGAACCGTTAAGAGGATAAGGCGTGAGCAGAGGTTGAGAAAAGAGAAAGAAACTCAAAAAGAGAAAGAAGTTGAAGAAGAAGAGACCATCGAG
ATGAATAGAAATCCACAAGATCCTCCACCTTCACAAAATCCACCTGTGAATGGAGATATGGCCGGTGAAGGAGCAGCAAACCGAGTAGGAGAAATTCCCAATCCA
ATCCTTCTAGCAGACAACCGAGATGTAGTCATGCGGAATTATGTCACTTATGCGTTCCACAACCTAAATTCGAGCTCAAGTCAAGATGGTGCAAGGACTTGGCTA
AACGCACTAGAACCAAATTCTATCAACACATGGGCGGAACTAACGGAGAAATTTTTGGCAAAGTACCACACTTTGACCAGGAATGCAGACCTTCGAGAGGACATT
GTGTCTTTTAGACAGAAGAAGAACGAAGCAGTTCAAGAAGCTTGGGAGCATTTTAAGGAATTACTGAGAAGATGCCCGAGCCATGGATTGCCTGCATCCAATGGC
TCGTTGTTAGAGAAATCGGTAAATGAGATCGTTGATATCTTCAATAAGATGACGGATATTAATGACCAAGGCAAAATAGGAAGGTCATTACCAAAGAAGCAAGTA
TCAGCCGGAGTCTTTGAGTTAGACACAGTAGCGTCAATGCAAGCCCAAATGGCGGCTATGAACCAGATGTTAAAGCAGTTGGAGAAGGAAACCAAAACCGTCGTT
TCAGCGATACCTGAACCCTCCCATCCTCAACAATACAATCAGCAAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGCAATGCAAGTTTGGAGGCCATGATGAAA
GAGTTCATGACAAGAACTGATGCTACAATAAGAAGCTTGGAGATGCAAGTGGGGCAGATAGCAAATGACCGAAAATCTAGACCCCGAGGACCCTCACTTCTAGAT
GGAGGAAATGATGCAGTTACACCTGTTCATGCATTCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACCTGTAATTTCAGAAGAAAAAGGTAAGAAGCCGGAT
AAATGTAAGCAAGTAGTGACCAGCACTACTCCACAGGTAGACATCATATCTAGGCGTAAGAAGTTAGGTGAGCATGAGACGGTAGCCTTAACAAAGTGTAGTAGT
GATGCTCTAGGGAATCCATTGCCTGTTAAATGTAATGACCCAGACCTTGAGGTGTCGATCATTTTTAGGAGGCCATTTTTAGCAACTGGAGATACGGTATTCAAC
ATTAGGAAAGAAGAGATCACAATGAAGATCAATGATGAGCAGGTAACCTTCAATGTCCTTGATTCGATGCGGCTCCCGGATGAAGTCGAGGAGTGCTCTACAATA
GAGGCAATCATGGAGGAACTCCAGCAAATGATGGTGGAAGACTTAGAAGCAGATTTGGAGGTCGTAGAAAAAGAAGCCTGGCACAATTTTGCCCCAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGTCTCGCTACTGTTTACCATAACTATGATGAAGTCCATAAGAGGCTTCCTTGGGTTCTTCAGCCTGAATTCTCATCGGGGGATGTTTTTCAATGCTCCG
ATTTCGAACAAGAACTTGAAGGATCACCGTTTCTTCGTTCGCGGGCAGTGGCTAGTTGGTGACAGTCTGATGCGTTCTAAAAAAATTCTTGCAATTCCTTGGGCC
CAAAGAGTCCCCACTCTGATCATAACGAATGAAAAGTTGGCTGAACATCACCTGGTCGGGGATCCTGATGGCTCAATCATAGGCAACGATGAGCATCCTACTCGT
AGAGTGGCTCCAATAGGTAGGACGTTGAATCTTCTCCTTCCAACGTTCGACTATCGTTGCTTCCGGTTTGTTTGTTCATATCCTACGAAGTCACCATCCGTCTCA
CCCACAAGACGTGAACTCGTAATTGTATCTCTTCTTCAGATTGGCCAAGTTTCTCCGGACGGTATGGCCATTGTCGAAAGCGCTACCTCGACCAATTCCGCCACC
CTCGTTCCCTCGCTCAAGCGACGTTGTCTGGTGAGGTCGGGCGACGCTCCACCTTCGTCTGACCCTCCTCCTGTGGTGGAGCCAGTCTTCCGAATTTTAGAGATC
GTTGAGTCTAGCTCGAGCGATGCGACTTTTCGGGCGCCTTCTACTAACAAGGAGTTGCAGCTGATGGTGAGGGACGTTCCAAGTCCAAGTAGAAGAGCGACAGGT
TCTAGGGATACAGTTCTGGATGGGGGGGCCATTGATGATCCCACTTCTGAGAACTTTCGATCAGTGGTCCCTTCCACATTGTGCGGGAGTACAGATATGGGGTTT
TTGATGCATGGAAAGGACAGAGCTACTTATCAGCGTATCGAGTTAGATCGTGTCCAATCCCGAGTAGCTTTTGCTGAGGCCCGTATTAGAGAGCTGGAGAAGGAT
GTTACCCTAGCCTCTGAACAAACCAAAAACGGTAGTGAAAGAGTAGAACTCAAATCCCAAGAAAAGCCAGGAATTGCGCCTGGGTCTGGTTGGTTGACCTTTGAT
CCAGAGATTGAGAGAACCGTTAAGAGGATAAGGCGTGAGCAGAGGTTGAGAAAAGAGAAAGAAACTCAAAAAGAGAAAGAAGTTGAAGAAGAAGAGACCATCGAG
ATGAATAGAAATCCACAAGATCCTCCACCTTCACAAAATCCACCTGTGAATGGAGATATGGCCGGTGAAGGAGCAGCAAACCGAGTAGGAGAAATTCCCAATCCA
ATCCTTCTAGCAGACAACCGAGATGTAGTCATGCGGAATTATGTCACTTATGCGTTCCACAACCTAAATTCGAGCTCAAGTCAAGATGGTGCAAGGACTTGGCTA
AACGCACTAGAACCAAATTCTATCAACACATGGGCGGAACTAACGGAGAAATTTTTGGCAAAGTACCACACTTTGACCAGGAATGCAGACCTTCGAGAGGACATT
GTGTCTTTTAGACAGAAGAAGAACGAAGCAGTTCAAGAAGCTTGGGAGCATTTTAAGGAATTACTGAGAAGATGCCCGAGCCATGGATTGCCTGCATCCAATGGC
TCGTTGTTAGAGAAATCGGTAAATGAGATCGTTGATATCTTCAATAAGATGACGGATATTAATGACCAAGGCAAAATAGGAAGGTCATTACCAAAGAAGCAAGTA
TCAGCCGGAGTCTTTGAGTTAGACACAGTAGCGTCAATGCAAGCCCAAATGGCGGCTATGAACCAGATGTTAAAGCAGTTGGAGAAGGAAACCAAAACCGTCGTT
TCAGCGATACCTGAACCCTCCCATCCTCAACAATACAATCAGCAAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGCAATGCAAGTTTGGAGGCCATGATGAAA
GAGTTCATGACAAGAACTGATGCTACAATAAGAAGCTTGGAGATGCAAGTGGGGCAGATAGCAAATGACCGAAAATCTAGACCCCGAGGACCCTCACTTCTAGAT
GGAGGAAATGATGCAGTTACACCTGTTCATGCATTCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACCTGTAATTTCAGAAGAAAAAGGTAAGAAGCCGGAT
AAATGTAAGCAAGTAGTGACCAGCACTACTCCACAGGTAGACATCATATCTAGGCGTAAGAAGTTAGGTGAGCATGAGACGGTAGCCTTAACAAAGTGTAGTAGT
GATGCTCTAGGGAATCCATTGCCTGTTAAATGTAATGACCCAGACCTTGAGGTGTCGATCATTTTTAGGAGGCCATTTTTAGCAACTGGAGATACGGTATTCAAC
ATTAGGAAAGAAGAGATCACAATGAAGATCAATGATGAGCAGGTAACCTTCAATGTCCTTGATTCGATGCGGCTCCCGGATGAAGTCGAGGAGTGCTCTACAATA
GAGGCAATCATGGAGGAACTCCAGCAAATGATGGTGGAAGACTTAGAAGCAGATTTGGAGGTCGTAGAAAAAGAAGCCTGGCACAATTTTGCCCCAATTTGA
Protein sequenceShow/hide protein sequence
MTVSLLFTITMMKSIRGFLGFFSLNSHRGMFFNAPISNKNLKDHRFFVRGQWLVGDSLMRSKKILAIPWAQRVPTLIITNEKLAEHHLVGDPDGSIIGNDEHPTR
RVAPIGRTLNLLLPTFDYRCFRFVCSYPTKSPSVSPTRRELVIVSLLQIGQVSPDGMAIVESATSTNSATLVPSLKRRCLVRSGDAPPSSDPPPVVEPVFRILEI
VESSSSDATFRAPSTNKELQLMVRDVPSPSRRATGSRDTVLDGGAIDDPTSENFRSVVPSTLCGSTDMGFLMHGKDRATYQRIELDRVQSRVAFAEARIRELEKD
VTLASEQTKNGSERVELKSQEKPGIAPGSGWLTFDPEIERTVKRIRREQRLRKEKETQKEKEVEEEETIEMNRNPQDPPPSQNPPVNGDMAGEGAANRVGEIPNP
ILLADNRDVVMRNYVTYAFHNLNSSSSQDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNADLREDIVSFRQKKNEAVQEAWEHFKELLRRCPSHGLPASNG
SLLEKSVNEIVDIFNKMTDINDQGKIGRSLPKKQVSAGVFELDTVASMQAQMAAMNQMLKQLEKETKTVVSAIPEPSHPQQYNQQRGQNTTQQSGSNASLEAMMK
EFMTRTDATIRSLEMQVGQIANDRKSRPRGPSLLDGGNDAVTPVHAFTSNPQQEEKAEPVISEEKGKKPDKCKQVVTSTTPQVDIISRRKKLGEHETVALTKCSS
DALGNPLPVKCNDPDLEVSIIFRRPFLATGDTVFNIRKEEITMKINDEQVTFNVLDSMRLPDEVEECSTIEAIMEELQQMMVEDLEADLEVVEKEAWHNFAPI