; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009879 (gene) of Chayote v1 genome

Gene IDSed0009879
OrganismSechium edule (Chayote v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationLG01:1261280..1262254
RNA-Seq ExpressionSed0009879
SyntenySed0009879
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN62167.1 hypothetical protein VITISV_007470 [Vitis vinifera]7.6e-8451.23Show/hide
Query:  RKLRRSPPPRRNVDYATDPSPSQSFYASNEDDYD------------------------ASESI----NFQPTNSAESAAET-----GSSYINIAPLPVFR
        RK  R     +  D  T+ SPSQS Y  +E++ D                        A ESI    +F+P++S  S++ +      SSYINIAPLP+FR
Subjt:  RKLRRSPPPRRNVDYATDPSPSQSFYASNEDDYD------------------------ASESI----NFQPTNSAESAAET-----GSSYINIAPLPVFR

Query:  GGADECPAMHLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQ
        G +DECP  HLSRF KVCRANN ++V+ +MRIFPVTL+GEAALWYDLNIEPY  +SWEE+KS F++A+ R+ LTD+L+SELM INQ  EE+VRSYFLRLQ
Subjt:  GGADECPAMHLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQ

Query:  LILKKWPMGYELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLW-KSREEKKAV
         ILK+WP  + L DGLL+GIF+DGLR++F++W+IPQKPSSLNEALRLAF +E+V+S+R     R   CGFC G H E  CE+RERMR LW KS+++ +  
Subjt:  LILKKWPMGYELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLW-KSREEKKAV

Query:  D---------EATAELTRSVSARSRNEAAEED-------GGKKKSQCQCGKHQCGMKKLDRSFSMVS
                  E   E   SV   SR+    E+       G KKKSQCQCGKHQC  KKL+R+ S+++
Subjt:  D---------EATAELTRSVSARSRNEAAEED-------GGKKKSQCQCGKHQCGMKKLDRSFSMVS

EEF44287.1 conserved hypothetical protein [Ricinus communis]4.6e-8148.31Show/hide
Query:  ARKLRRSPPPRRNVDYATDPSPSQSFYASNEDDYDASESINFQP---------------TNSAESAAETGSSYINIAPLPVFRGGADECPAMHLSRFAKV
        A+  R+S       DY+   SPSQS Y SN+DD +  +    QP               ++S+ S ++  +SYIN+APLPVF G ++ECP  HLSRF KV
Subjt:  ARKLRRSPPPRRNVDYATDPSPSQSFYASNEDDYDASESINFQP---------------TNSAESAAETGSSYINIAPLPVFRGGADECPAMHLSRFAKV

Query:  CRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLL
        CRANNA++ D MMRIFPVTLE EAALWYDLNI+PYP +SW+E+   F+EA+ RI+L DQL+S+LM +NQ  +E+VRSYF+RLQ ILK+WP  + LSD +L
Subjt:  CRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLL

Query:  KGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLWKSREEKKAV-------DEATAELTRSV
        K IF+DGL   FK+W+IP KP+SLNEALRLAF FEQV+S+R   + + ++CGFCEG H+E  C VRE+MR L+++ ++K  +        EA  E+  + 
Subjt:  KGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLWKSREEKKAV-------DEATAELTRSV

Query:  SARSRNEAAEEDGGKK-------------KSQCQCGKHQCGMKKLDRSFSMVSKSS
          +   E  E D G               KS CQC KH C MKK +RS S+ +++S
Subjt:  SARSRNEAAEEDGGKK-------------KSQCQCGKHQCGMKKLDRSFSMVSKSS

EXB78111.1 hypothetical protein L484_004813 [Morus notabilis]4.9e-8352.97Show/hide
Query:  PRRNVDYATDPSPSQSFYASN--EDDYDA-SESINFQPTNSAES-----------AAETG-SSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAV
        P  + D   D S + +  A+N   D + + SE IN +  + + S            ++TG +SY+NIA  P+FRGG++ECP  HLSRFAKVCRANN +++
Subjt:  PRRNVDYATDPSPSQSFYASN--EDDYDA-SESINFQPTNSAES-----------AAETG-SSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAV

Query:  DTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLR
        D MM+IFPVTLE EAALWYDLN+EPY  +SWEE+KS F  A+ +IELT+QL+S+LMTINQ D E+VRSYFLRLQ ILKKWP  + LSD LLKG+F+DGLR
Subjt:  DTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLR

Query:  EEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLW----------KSREEKKAVD--EATAELTRSVS---
         +F+EWM PQKP SLN+ALRLAF FEQV+S+R   RN  ++CGFC G H+E  CEVRERMR LW          K   E+  ++  E   EL RSVS   
Subjt:  EEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLW----------KSREEKKAVD--EATAELTRSVS---

Query:  ARS-----RNEAAEEDG---------GKKKSQCQCGKHQCGMKKLDRSFSMVS
        +RS     +N+  EEDG          KK+SQCQCGKHQC  K ++R+ S VS
Subjt:  ARS-----RNEAAEEDG---------GKKKSQCQCGKHQCGMKKLDRSFSMVS

KAF3973300.1 hypothetical protein CMV_003263 [Castanea mollissima]7.4e-7957.14Show/hide
Query:  SSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQR
        +SYINIAP P+F G  +ECP  H+SRFAKVC ANN +  D MM IFPVTLE EAALWYDLNI+PYP ++WEE+KS F+ A+ +I++ DQL+SELM INQ 
Subjt:  SSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQR

Query:  DEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMR
        DEE+VRSYFLRLQ ILK+WP  + + DGLLKG+F+DGLREEF++W+ PQKP SL+EALRLAF FEQV+S+R   +   L+CGFC+G H+E  CEVRERMR
Subjt:  DEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMR

Query:  LLWKSREEK----------KAVDEATAELTRSV-----SARSRNEAAEEDG--GKKKSQCQCGKHQCGMKKLDRSFSMVS
         LW+  +EK          ++ D+   EL RSV     S+  +N   EE G    KK+Q Q  K+Q  MKKL+R+ S++S
Subjt:  LLWKSREEK----------KAVDEATAELTRSV-----SARSRNEAAEEDG--GKKKSQCQCGKHQCGMKKLDRSFSMVS

KAG6604769.1 hypothetical protein SDJN03_02086, partial [Cucurbita argyrosperma subsp. sororia]5.2e-12570.87Show/hide
Query:  MARKLRRSPPPRRNVDYAT--DPSPSQSFYASNEDDYDASESINFQ-------------------PTN-SAESAAETGSSYINIAPLPVFRGGADECPAM
        MA KLRRSPPP R  +YAT  D S SQS  ASNEDDYDASES NFQ                   PTN  + +AA T   YINIAPLPVF GG+DECPA 
Subjt:  MARKLRRSPPPRRNVDYAT--DPSPSQSFYASNEDDYDASESINFQ-------------------PTN-SAESAAETGSSYINIAPLPVFRGGADECPAM

Query:  HLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMG
        HLSRFAKVCRANNAA+V+ MMRIFPVTL+GEA LWYDLNIEPYPP+SWEELKS F++A+++IEL +QL+SELMTI+QR EENVRSYFLRLQLILKKWP G
Subjt:  HLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMG

Query:  YELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLWKSREEKKAVDEA------T
         ELSDG LK IFMDGLREEFKEWMIPQKP SLNEALRLAFG EQV  +RTS   RFLRCGFCEG H+E+VCEVRERMR LWKSRE+K   D A      T
Subjt:  YELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLWKSREEKKAVDEA------T

Query:  AELTRSVSARSRNEA-AEEDGG-----KKKSQCQCGKHQCGMKKLDRSFSMVSKSSK
        AEL RSVSA SRNEA   +DGG     KKK QCQC KHQCGMKKLDR+ SM+SK+SK
Subjt:  AELTRSVSARSRNEA-AEEDGG-----KKKSQCQCGKHQCGMKKLDRSFSMVSKSSK

TrEMBL top hitse value%identityAlignment
A0A6A4LHT8 Retrotrans_gag domain-containing protein (Fragment)4.7e-7949.85Show/hide
Query:  KLRRSPPPRRNVDYATDPSPSQSFYASNEDDYDASESINFQPT--------NSAESAAETGSSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAV
        KL RSPP      Y  DPS  ++     E++  +  S N Q T        NS +   +T  SY+NIAP PVFRG   ECPA HL+RF+KVCRANN ++V
Subjt:  KLRRSPPPRRNVDYATDPSPSQSFYASNEDDYDASESINFQPT--------NSAESAAETGSSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAV

Query:  DTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLR
        + MM+IFPVTLE EAALWYDLN+EPYP ++WEE+KSLF +A+   +   +L+ EL+ +NQ   E+VRSYFLRLQ IL +WP G+ + D L+KGIF+DGLR
Subjt:  DTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLR

Query:  EEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLWKSREEKKAV------DEATAELTRSVSARSRNEAAE
         +FK+W++PQKP SL +ALRLAF +EQVR +R+ D     +CGFC+G H+E  CEVR RMR  W  RE +KA          T EL  +  +R   +  E
Subjt:  EEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLWKSREEKKAV------DEATAELTRSVSARSRNEAAE

Query:  EDG------GKKKSQCQCGKHQCGMKKLDRSFSMVSKSS
        E+       G+KKS C+C KHQCG K+L R+ S+V+K+S
Subjt:  EDG------GKKKSQCQCGKHQCGMKKLDRSFSMVSKSS

A0A7N2R9A7 Retrotrans_gag domain-containing protein4.1e-8360.36Show/hide
Query:  SSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQR
        +SY+NIAP+P+F G  +ECP  H+SRFAKVC ANN +  D MMRIFPVTLE EAALWYDLNIEPYP ++WEE+KS F+ A+ +IE+ DQL+SELM INQ 
Subjt:  SSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQR

Query:  DEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMR
        DEE+VRSYFLRLQ ILK+WP  + +SDGLLKG+F+DGLREEF+ W+IPQKP SL+EALRLAFGFEQV+S+R   +   L+CGFC+G H+E  CEVRERMR
Subjt:  DEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMR

Query:  LLWK-SREEKKAV---------DEATAELTRSV-----SARSRNEAAEEDG--GKKKSQCQCGKHQCGMKKLDRSFSMVS
         LW+ S+E+++AV         DE   EL RSV     S+  +N   EE G    KK+Q Q GK+Q  MKKL+R+ S++S
Subjt:  LLWK-SREEKKAV---------DEATAELTRSV-----SARSRNEAAEEDG--GKKKSQCQCGKHQCGMKKLDRSFSMVS

A5C7E6 Retrotrans_gag domain-containing protein6.3e-8451.23Show/hide
Query:  RKLRRSPPPRRNVDYATDPSPSQSFYASNEDDYD------------------------ASESI----NFQPTNSAESAAET-----GSSYINIAPLPVFR
        RK  R     +  D  T+ SPSQS Y  +E++ D                        A ESI    +F+P++S  S++ +      SSYINIAPLP+FR
Subjt:  RKLRRSPPPRRNVDYATDPSPSQSFYASNEDDYD------------------------ASESI----NFQPTNSAESAAET-----GSSYINIAPLPVFR

Query:  GGADECPAMHLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQ
        G +DECP  HLSRF KVCRANN ++V+ +MRIFPVTL+GEAALWYDLNIEPY  +SWEE+KS F++A+ R  LTD+L+SELM INQ  EE+VRSYFLRLQ
Subjt:  GGADECPAMHLSRFAKVCRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQ

Query:  LILKKWPMGYELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLW-KSREEKKAV
         ILK+WP  + L DGLL+GIF+DGLR++F++W+IPQKPSSLNEALRLAF +E+V+S+R     R   CGFC G H E  CE+RERMR LW KS+++ +  
Subjt:  LILKKWPMGYELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLW-KSREEKKAV

Query:  D---------EATAELTRSVSARSRNEAAEED-------GGKKKSQCQCGKHQCGMKKLDRSFSMVS
                  E   E   SV   SR+    E+       G KKKSQCQCGKHQC  KKL+R+ S+++
Subjt:  D---------EATAELTRSVSARSRNEAAEED-------GGKKKSQCQCGKHQCGMKKLDRSFSMVS

B9RWN5 Retrotrans_gag domain-containing protein2.2e-8148.31Show/hide
Query:  ARKLRRSPPPRRNVDYATDPSPSQSFYASNEDDYDASESINFQP---------------TNSAESAAETGSSYINIAPLPVFRGGADECPAMHLSRFAKV
        A+  R+S       DY+   SPSQS Y SN+DD +  +    QP               ++S+ S ++  +SYIN+APLPVF G ++ECP  HLSRF KV
Subjt:  ARKLRRSPPPRRNVDYATDPSPSQSFYASNEDDYDASESINFQP---------------TNSAESAAETGSSYINIAPLPVFRGGADECPAMHLSRFAKV

Query:  CRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLL
        CRANNA++ D MMRIFPVTLE EAALWYDLNI+PYP +SW+E+   F+EA+ RI+L DQL+S+LM +NQ  +E+VRSYF+RLQ ILK+WP  + LSD +L
Subjt:  CRANNAAAVDTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLL

Query:  KGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLWKSREEKKAV-------DEATAELTRSV
        K IF+DGL   FK+W+IP KP+SLNEALRLAF FEQV+S+R   + + ++CGFCEG H+E  C VRE+MR L+++ ++K  +        EA  E+  + 
Subjt:  KGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLWKSREEKKAV-------DEATAELTRSV

Query:  SARSRNEAAEEDGGKK-------------KSQCQCGKHQCGMKKLDRSFSMVSKSS
          +   E  E D G               KS CQC KH C MKK +RS S+ +++S
Subjt:  SARSRNEAAEEDGGKK-------------KSQCQCGKHQCGMKKLDRSFSMVSKSS

W9R9S0 Retrotrans_gag domain-containing protein2.4e-8352.97Show/hide
Query:  PRRNVDYATDPSPSQSFYASN--EDDYDA-SESINFQPTNSAES-----------AAETG-SSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAV
        P  + D   D S + +  A+N   D + + SE IN +  + + S            ++TG +SY+NIA  P+FRGG++ECP  HLSRFAKVCRANN +++
Subjt:  PRRNVDYATDPSPSQSFYASN--EDDYDA-SESINFQPTNSAES-----------AAETG-SSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAV

Query:  DTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLR
        D MM+IFPVTLE EAALWYDLN+EPY  +SWEE+KS F  A+ +IELT+QL+S+LMTINQ D E+VRSYFLRLQ ILKKWP  + LSD LLKG+F+DGLR
Subjt:  DTMMRIFPVTLEGEAALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLR

Query:  EEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLW----------KSREEKKAVD--EATAELTRSVS---
         +F+EWM PQKP SLN+ALRLAF FEQV+S+R   RN  ++CGFC G H+E  CEVRERMR LW          K   E+  ++  E   EL RSVS   
Subjt:  EEFKEWMIPQKPSSLNEALRLAFGFEQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLW----------KSREEKKAVD--EATAELTRSVS---

Query:  ARS-----RNEAAEEDG---------GKKKSQCQCGKHQCGMKKLDRSFSMVS
        +RS     +N+  EEDG          KK+SQCQCGKHQC  K ++R+ S VS
Subjt:  ARS-----RNEAAEEDG---------GKKKSQCQCGKHQCGMKKLDRSFSMVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCGAAAACTCCGGCGCTCACCGCCGCCGCGACGAAATGTCGACTACGCCACCGATCCTTCACCATCTCAATCTTTCTACGCATCAAACGAAGACGACTACGACGC
ATCGGAATCAATTAACTTCCAACCCACAAACTCTGCAGAATCCGCCGCCGAAACCGGTTCAAGTTACATCAACATTGCACCGTTGCCGGTTTTCCGCGGCGGGGCGGACG
AGTGTCCGGCAATGCATTTAAGCAGATTCGCAAAAGTTTGCCGTGCGAATAACGCCGCCGCCGTGGATACGATGATGAGGATATTTCCGGTGACGTTGGAGGGCGAGGCT
GCGCTTTGGTACGACTTGAACATCGAGCCGTACCCTCCGGTTTCATGGGAAGAATTGAAGTCTTTGTTCATGGAGGCGTTTAGTAGAATTGAATTGACTGATCAGTTGCA
ATCGGAGCTTATGACGATCAATCAACGGGATGAAGAGAATGTTCGTTCTTATTTTCTGAGGCTGCAATTGATTTTGAAGAAATGGCCGATGGGTTACGAACTTTCTGATG
GATTGTTGAAAGGGATTTTCATGGATGGATTGAGAGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCCAGCTCTCTGAATGAGGCATTGAGACTTGCATTTGGGTTT
GAACAAGTTCGAAGCGTCCGTACATCGGATCGAAATCGGTTTCTCCGGTGCGGGTTTTGTGAGGGGCCGCATCAGGAAGTGGTTTGTGAGGTTAGGGAGAGAATGAGACT
GTTATGGAAGAGTAGGGAAGAGAAGAAAGCTGTTGATGAGGCCACGGCGGAGCTTACGAGATCGGTTTCGGCGAGAAGTAGAAATGAGGCGGCTGAAGAGGATGGTGGGA
AGAAGAAGAGTCAATGTCAGTGTGGGAAGCATCAGTGTGGGATGAAGAAATTGGATCGAAGTTTTAGCATGGTATCTAAAAGTTCTAAAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCGAAAACTCCGGCGCTCACCGCCGCCGCGACGAAATGTCGACTACGCCACCGATCCTTCACCATCTCAATCTTTCTACGCATCAAACGAAGACGACTACGACGC
ATCGGAATCAATTAACTTCCAACCCACAAACTCTGCAGAATCCGCCGCCGAAACCGGTTCAAGTTACATCAACATTGCACCGTTGCCGGTTTTCCGCGGCGGGGCGGACG
AGTGTCCGGCAATGCATTTAAGCAGATTCGCAAAAGTTTGCCGTGCGAATAACGCCGCCGCCGTGGATACGATGATGAGGATATTTCCGGTGACGTTGGAGGGCGAGGCT
GCGCTTTGGTACGACTTGAACATCGAGCCGTACCCTCCGGTTTCATGGGAAGAATTGAAGTCTTTGTTCATGGAGGCGTTTAGTAGAATTGAATTGACTGATCAGTTGCA
ATCGGAGCTTATGACGATCAATCAACGGGATGAAGAGAATGTTCGTTCTTATTTTCTGAGGCTGCAATTGATTTTGAAGAAATGGCCGATGGGTTACGAACTTTCTGATG
GATTGTTGAAAGGGATTTTCATGGATGGATTGAGAGAAGAGTTTAAGGAATGGATGATTCCACAGAAACCCAGCTCTCTGAATGAGGCATTGAGACTTGCATTTGGGTTT
GAACAAGTTCGAAGCGTCCGTACATCGGATCGAAATCGGTTTCTCCGGTGCGGGTTTTGTGAGGGGCCGCATCAGGAAGTGGTTTGTGAGGTTAGGGAGAGAATGAGACT
GTTATGGAAGAGTAGGGAAGAGAAGAAAGCTGTTGATGAGGCCACGGCGGAGCTTACGAGATCGGTTTCGGCGAGAAGTAGAAATGAGGCGGCTGAAGAGGATGGTGGGA
AGAAGAAGAGTCAATGTCAGTGTGGGAAGCATCAGTGTGGGATGAAGAAATTGGATCGAAGTTTTAGCATGGTATCTAAAAGTTCTAAAGGCTAA
Protein sequenceShow/hide protein sequence
MARKLRRSPPPRRNVDYATDPSPSQSFYASNEDDYDASESINFQPTNSAESAAETGSSYINIAPLPVFRGGADECPAMHLSRFAKVCRANNAAAVDTMMRIFPVTLEGEA
ALWYDLNIEPYPPVSWEELKSLFMEAFSRIELTDQLQSELMTINQRDEENVRSYFLRLQLILKKWPMGYELSDGLLKGIFMDGLREEFKEWMIPQKPSSLNEALRLAFGF
EQVRSVRTSDRNRFLRCGFCEGPHQEVVCEVRERMRLLWKSREEKKAVDEATAELTRSVSARSRNEAAEEDGGKKKSQCQCGKHQCGMKKLDRSFSMVSKSSKG