; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026070 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026070
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr10:28650693..28652952
RNA-Seq ExpressionLag0026070
SyntenyLag0026070
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY86564.1 hypothetical protein Acr_05g0002030 [Actinidia rufa]8.7e-5533.61Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDRRNDPVQHLDTYRSWMGFHGASEATKC------------------------------------LF
        +++L+ Q DPPF + +++  +  KFKLPT    Y+ + DP+ HLD+Y+S M   G S+   C                                      
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDRRNDPVQHLDTYRSWMGFHGASEATKC------------------------------------LF

Query:  GCKG-PKKAAVQFVDYQAEVEGEPEWVYHMLQQRSCAGEDQPRTYAEFVSRAQKYINTEELMKSKRAEREAQRVTTIDKSRRKEERGKRPREEDGDWSHL
         C+   K A+  F  +Q E E   ++V    Q      +   +T +   S+A KYI  EEL ++KR  R  +     +   R+ +  +  R +  D    
Subjt:  GCKG-PKKAAVQFVDYQAEVEGEPEWVYHMLQQRSCAGEDQPRTYAEFVSRAQKYINTEELMKSKRAEREAQRVTTIDKSRRKEERGKRPREEDGDWSHL

Query:  RYSSGRSHLDQREGRGRPEFREGIRTSIVTSIGIMDTTRNCIQLRDEIESLIKEGYLKEFVKGE------ERK--GSRPT-------REGRDGGGRIESK
        R ++ R     R    RPEF      S+ T +          QLR++I  LIK GYL++++         ERK   +RPT         G   GG   S 
Subjt:  RYSSGRSHLDQREGRGRPEFREGIRTSIVTSIGIMDTTRNCIQLRDEIESLIKEGYLKEFVKGE------ERK--GSRPT-------REGRDGGGRIESK

Query:  RKAAAREARIELEEQRVYSVQ--VSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFG
        RK  AR A   +EE+ VY++    ++  P + F+  +  G+H PH+DALVVS +IAN  V R+L+D GSSADIL    FE MK+G  +L     PL+GFG
Subjt:  RKAAAREARIELEEQRVYSVQ--VSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFG

Query:  GERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALR
        G    P G I LP+T G      T   +F+VVDC S YN+ILGRP L  +KA+ STYH  MKF    G+G V G+QK +R+C+ +A++
Subjt:  GERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALR

GFZ11327.1 hypothetical protein Acr_22g0007250 [Actinidia rufa]6.2e-5332.49Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDRRNDPVQHLDTYRSWMGFHGASEATKC-LFGC--KGP----------------------------
        +++L+ Q DPPF + +++  +  KFKLPT    Y+ + DP+ HLD+Y+S M   G S+   C  F    KGP                            
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDRRNDPVQHLDTYRSWMGFHGASEATKC-LFGC--KGP----------------------------

Query:  ------KKAAVQFVDYQAEVEGEPEWVYHMLQQ-------------------------RSCAGEDQPRTYAEFVSRAQKYINTEELMKSKRAEREAQRVT
              K A+  F  +Q E E   ++V    Q                               ++   T +   S+A KYI  EEL ++KR  R  +   
Subjt:  ------KKAAVQFVDYQAEVEGEPEWVYHMLQQ-------------------------RSCAGEDQPRTYAEFVSRAQKYINTEELMKSKRAEREAQRVT

Query:  TIDKSRRKEERGKRPREEDGDWSHLRYSSGRSHLDQREGRGRPEFR-EGIRTSIVTSIGIMDTTRNCIQLRDEIESLIKEGYLKEFV------KGEERK-
          +   R+ +  +  R +  D    R ++ R     R    RPEF    + T +    G    T +C QLR++I   IK GYL++++         ERK 
Subjt:  TIDKSRRKEERGKRPREEDGDWSHLRYSSGRSHLDQREGRGRPEFR-EGIRTSIVTSIGIMDTTRNCIQLRDEIESLIKEGYLKEFV------KGEERK-

Query:  -GSRPT-------REGRDGGGRIESKRKAAAREARIELEEQRVYSVQ--VSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADIL
          +RPT         G   GG   S RK  AR A   +EE+ VY++    ++  P + F+  +  G+H PH+DALVVS +IAN  V R+L+D GSSADIL
Subjt:  -GSRPT-------REGRDGGGRIESKRKAAAREARIELEEQRVYSVQ--VSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADIL

Query:  STKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWG
            FE MK+G  +L     PL+GFGG    P G I LP+T G      T   +F+VVDC S YN+ILGRP L   K + STYH  MKF    G+G V G
Subjt:  STKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWG

Query:  EQKASRECYFTALR
        +QK +R+C+ +A++
Subjt:  EQKASRECYFTALR

XP_022155186.1 uncharacterized protein LOC111022321 [Momordica charantia]1.4e-5256.78Show/hide
Query:  ESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGF
        E KRKA  REAR   ++  VY   ++N   ++EF+E EA+ +  PHNDALV++L IAN +VHR+LVDGGSSADI+S   ++AM LG+  L++S APLVGF
Subjt:  ESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGF

Query:  GGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERTGQHG
        GGERV+ EG IELPVT+G G   VTKMV+FLVV+  S+YNAILGRP +H LKA+ STYH+  KF    GVG + GEQ+ SRECY T++R  ++RT   G
Subjt:  GGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERTGQHG

XP_022156748.1 uncharacterized protein LOC111023587 [Momordica charantia]5.6e-5449.59Show/hide
Query:  TRNCIQLRDEIESLIKEGYLKEFVKGEERKGSRPTREGRDGGGRIESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQPHNDALVVSLM
        T++C  L++E+E LI+ GYLKE                     RI  K+KA  REAR   E+  VY   +++   ++EF+E EA+ +   HNDALV++L 
Subjt:  TRNCIQLRDEIESLIKEGYLKEFVKGEERKGSRPTREGRDGGGRIESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQPHNDALVVSLM

Query:  IANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVA
        IAN +VHR+LVDGGSSADI+S   ++AM L +  L++S APLVGFGGERV+ EG IELPVT+G G   VTKMV+FLVV+  S+YN ILGR  +H LK + 
Subjt:  IANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVA

Query:  STYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERTGQHG
        STYH+ MKF    GV  + GEQ+ SRECY+T++RG ++RT   G
Subjt:  STYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERTGQHG

XP_022158093.1 uncharacterized protein LOC111024662 [Momordica charantia]8.1e-5346.61Show/hide
Query:  TRNCIQLRDEIESLIKEGYLKEFVK------GEERKGSRPTRE-----GRDGGGRIESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQ
        T++C  L++E+E LI+ GYLKE+V+        E     P +E     G      +  KRKA+ +EAR      +VY    +     ++F+E E + +  
Subjt:  TRNCIQLRDEIESLIKEGYLKEFVK------GEERKGSRPTRE-----GRDGGGRIESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQ

Query:  PHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILG
        PHNDALV++L I NT+VHR+LVDGGSS  I+S   ++AM LG+  L++++APLVGFGGERV+ +  I+LPVT+G G   +TK+V FLVVD  S+YNAILG
Subjt:  PHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILG

Query:  RPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERT
        RP +H LKA+ STYHK +KF  + G+  V GEQ+ S ECY+T+LRG    T
Subjt:  RPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERT

TrEMBL top hitse value%identityAlignment
A0A6J1DQY2 uncharacterized protein LOC1110223216.7e-5356.78Show/hide
Query:  ESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGF
        E KRKA  REAR   ++  VY   ++N   ++EF+E EA+ +  PHNDALV++L IAN +VHR+LVDGGSSADI+S   ++AM LG+  L++S APLVGF
Subjt:  ESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGF

Query:  GGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERTGQHG
        GGERV+ EG IELPVT+G G   VTKMV+FLVV+  S+YNAILGRP +H LKA+ STYH+  KF    GVG + GEQ+ SRECY T++R  ++RT   G
Subjt:  GGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERTGQHG

A0A6J1DRG9 uncharacterized protein LOC1110235872.7e-5449.59Show/hide
Query:  TRNCIQLRDEIESLIKEGYLKEFVKGEERKGSRPTREGRDGGGRIESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQPHNDALVVSLM
        T++C  L++E+E LI+ GYLKE                     RI  K+KA  REAR   E+  VY   +++   ++EF+E EA+ +   HNDALV++L 
Subjt:  TRNCIQLRDEIESLIKEGYLKEFVKGEERKGSRPTREGRDGGGRIESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQPHNDALVVSLM

Query:  IANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVA
        IAN +VHR+LVDGGSSADI+S   ++AM L +  L++S APLVGFGGERV+ EG IELPVT+G G   VTKMV+FLVV+  S+YN ILGR  +H LK + 
Subjt:  IANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVA

Query:  STYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERTGQHG
        STYH+ MKF    GV  + GEQ+ SRECY+T++RG ++RT   G
Subjt:  STYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERTGQHG

A0A6J1DV51 uncharacterized protein LOC1110246623.9e-5346.61Show/hide
Query:  TRNCIQLRDEIESLIKEGYLKEFVK------GEERKGSRPTRE-----GRDGGGRIESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQ
        T++C  L++E+E LI+ GYLKE+V+        E     P +E     G      +  KRKA+ +EAR      +VY    +     ++F+E E + +  
Subjt:  TRNCIQLRDEIESLIKEGYLKEFVK------GEERKGSRPTRE-----GRDGGGRIESKRKAAAREARIELEEQRVYSVQVSNGLPSVEFTELEASGIHQ

Query:  PHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILG
        PHNDALV++L I NT+VHR+LVDGGSS  I+S   ++AM LG+  L++++APLVGFGGERV+ +  I+LPVT+G G   +TK+V FLVVD  S+YNAILG
Subjt:  PHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILG

Query:  RPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERT
        RP +H LKA+ STYHK +KF  + G+  V GEQ+ S ECY+T+LRG    T
Subjt:  RPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERT

A0A7J0EJC6 Retrotrans_gag domain-containing protein4.2e-5533.61Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDRRNDPVQHLDTYRSWMGFHGASEATKC------------------------------------LF
        +++L+ Q DPPF + +++  +  KFKLPT    Y+ + DP+ HLD+Y+S M   G S+   C                                      
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDRRNDPVQHLDTYRSWMGFHGASEATKC------------------------------------LF

Query:  GCKG-PKKAAVQFVDYQAEVEGEPEWVYHMLQQRSCAGEDQPRTYAEFVSRAQKYINTEELMKSKRAEREAQRVTTIDKSRRKEERGKRPREEDGDWSHL
         C+   K A+  F  +Q E E   ++V    Q      +   +T +   S+A KYI  EEL ++KR  R  +     +   R+ +  +  R +  D    
Subjt:  GCKG-PKKAAVQFVDYQAEVEGEPEWVYHMLQQRSCAGEDQPRTYAEFVSRAQKYINTEELMKSKRAEREAQRVTTIDKSRRKEERGKRPREEDGDWSHL

Query:  RYSSGRSHLDQREGRGRPEFREGIRTSIVTSIGIMDTTRNCIQLRDEIESLIKEGYLKEFVKGE------ERK--GSRPT-------REGRDGGGRIESK
        R ++ R     R    RPEF      S+ T +          QLR++I  LIK GYL++++         ERK   +RPT         G   GG   S 
Subjt:  RYSSGRSHLDQREGRGRPEFREGIRTSIVTSIGIMDTTRNCIQLRDEIESLIKEGYLKEFVKGE------ERK--GSRPT-------REGRDGGGRIESK

Query:  RKAAAREARIELEEQRVYSVQ--VSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFG
        RK  AR A   +EE+ VY++    ++  P + F+  +  G+H PH+DALVVS +IAN  V R+L+D GSSADIL    FE MK+G  +L     PL+GFG
Subjt:  RKAAAREARIELEEQRVYSVQ--VSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFG

Query:  GERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALR
        G    P G I LP+T G      T   +F+VVDC S YN+ILGRP L  +KA+ STYH  MKF    G+G V G+QK +R+C+ +A++
Subjt:  GERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALR

A0A7J0GKI2 Retrotrans_gag domain-containing protein3.0e-5332.49Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDRRNDPVQHLDTYRSWMGFHGASEATKC-LFGC--KGP----------------------------
        +++L+ Q DPPF + +++  +  KFKLPT    Y+ + DP+ HLD+Y+S M   G S+   C  F    KGP                            
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPT-FPQYDRRNDPVQHLDTYRSWMGFHGASEATKC-LFGC--KGP----------------------------

Query:  ------KKAAVQFVDYQAEVEGEPEWVYHMLQQ-------------------------RSCAGEDQPRTYAEFVSRAQKYINTEELMKSKRAEREAQRVT
              K A+  F  +Q E E   ++V    Q                               ++   T +   S+A KYI  EEL ++KR  R  +   
Subjt:  ------KKAAVQFVDYQAEVEGEPEWVYHMLQQ-------------------------RSCAGEDQPRTYAEFVSRAQKYINTEELMKSKRAEREAQRVT

Query:  TIDKSRRKEERGKRPREEDGDWSHLRYSSGRSHLDQREGRGRPEFR-EGIRTSIVTSIGIMDTTRNCIQLRDEIESLIKEGYLKEFV------KGEERK-
          +   R+ +  +  R +  D    R ++ R     R    RPEF    + T +    G    T +C QLR++I   IK GYL++++         ERK 
Subjt:  TIDKSRRKEERGKRPREEDGDWSHLRYSSGRSHLDQREGRGRPEFR-EGIRTSIVTSIGIMDTTRNCIQLRDEIESLIKEGYLKEFV------KGEERK-

Query:  -GSRPT-------REGRDGGGRIESKRKAAAREARIELEEQRVYSVQ--VSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADIL
          +RPT         G   GG   S RK  AR A   +EE+ VY++    ++  P + F+  +  G+H PH+DALVVS +IAN  V R+L+D GSSADIL
Subjt:  -GSRPT-------REGRDGGGRIESKRKAAAREARIELEEQRVYSVQ--VSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADIL

Query:  STKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWG
            FE MK+G  +L     PL+GFGG    P G I LP+T G      T   +F+VVDC S YN+ILGRP L   K + STYH  MKF    G+G V G
Subjt:  STKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVVTKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWG

Query:  EQKASRECYFTALR
        +QK +R+C+ +A++
Subjt:  EQKASRECYFTALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCAGGAAAACCCAGTGGGCGAAGGAGGTCGACAACCCAAGGCAAACACCCCAGACATAGAGATGGACGTCCTTAGAGGAAGGATGAACGAGATGGGGCAGAGTTT
GGCTGAGATTTTTGGTATCCTGAAACAACCGAATCCCAGCACGAAGCACCAGAAGAGTTTTGTGCGAGATCGTGAGAAGGGAAAATGGGTTTTCGACGAGGAAGAAGGGG
AAACAGATAGTGCAACCAGCAAGCTGCGCAAGCTGAGTGATGGCAAGGAACTTGCCTTAAAGGAGCCAGGGCCAAGCAGAAGGGGAGAGCGCAAAGGCGCACTTGATGTC
CCAGATGAGGTCAGTACTGTGGGTTCGCACAGGAGGATCGAAGTCGAGGCCGAGGCCAAGATGAGGGAGAAGGTTGAGCTTGAAGCAAAGATCCGGGCCGAGCTTGAAGG
TAAGTTCAGAGCCGAAGCCGAAGCTACGGTCAAAGCAAAGGCCGAGGCTAAGGCCAAGGTTGGTACCCAGGGAAACACACAGTCTAGGGATGTGGATAAAGAAAACTTAG
AAAGCTTAATAGGACAGGCCGACCCACCCTTTGTCGATGAGATTATGCAAGCGGAAGTTCCGCATAAGTTTAAATTACCGACTTTTCCACAGTATGACAGGAGGAACGAC
CCGGTTCAACATCTAGATACCTACCGATCCTGGATGGGGTTTCATGGGGCTTCTGAGGCAACAAAGTGTCTTTTTGGGTGCAAGGGACCGAAGAAAGCCGCAGTTCAATT
TGTTGACTATCAAGCAGAAGTCGAGGGAGAGCCTGAATGGGTATATCACATGCTTCAGCAACGAAGTTGTGCAGGGGAGGATCAACCAAGAACATACGCTGAATTCGTTT
CCAGGGCGCAAAAGTATATAAACACGGAGGAGCTGATGAAATCCAAGCGGGCGGAAAGAGAAGCGCAGAGGGTAACCACTATTGACAAGAGCAGGAGAAAAGAGGAAAGG
GGTAAGAGGCCGCGGGAAGAGGACGGAGACTGGAGCCACCTTAGATACTCCTCTGGTCGGAGTCATCTAGACCAGAGGGAGGGCCGAGGCCGACCAGAATTCAGAGAAGG
GATAAGAACAAGTATTGTCACTTCCATCGGGATCATGGACACAACGCGCAATTGCATACAGCTTCGGGACGAGATAGAGAGTCTGATCAAGGAAGGGTATTTGAAAGAGT
TTGTCAAAGGCGAGGAGAGAAAAGGGTCGAGGCCGACCAGGGAGGGTCGCGATGGGGGAGGGAGAATCGAATCGAAAAGGAAAGCAGCGGCCCGAGAAGCGAGGATTGAG
CTTGAAGAGCAGAGAGTGTACTCAGTACAGGTTTCAAATGGGTTGCCGTCGGTAGAGTTCACTGAGTTAGAAGCCTCCGGTATACACCAACCTCACAACGATGCACTGGT
GGTGAGCCTGATGATTGCTAATACACGAGTTCATCGGGTTTTGGTTGATGGCGGGAGTTCAGCAGACATCCTATCCACAAAAGTGTTTGAGGCCATGAAACTAGGGAAAG
GACGACTAAGAACAAGTGTAGCACCGCTGGTTGGCTTCGGGGGAGAAAGGGTGATGCCAGAAGGTAGCATAGAGTTGCCTGTGACCTATGGGGAAGGACATAATGTGGTG
ACGAAGATGGTTAATTTTCTGGTAGTAGACTGTGTATCCGCCTACAATGCCATTCTCGGAAGGCCAGCGCTGCATGAATTGAAGGCTGTTGCGTCAACTTATCACAAGAT
GATGAAGTTCCTGGCCAACGATGGGGTAGGGGTCGTGTGGGGAGAACAAAAAGCTTCCCGTGAGTGTTATTTCACTGCACTCAGAGGAACGAGCGAACGAACGGGGCAGC
ATGGCCCGTGGACGGGCGAGGGTCGAGGCCGAGCAGAGGAATGTTCTTCGCCAATGGAACACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCAGGAAAACCCAGTGGGCGAAGGAGGTCGACAACCCAAGGCAAACACCCCAGACATAGAGATGGACGTCCTTAGAGGAAGGATGAACGAGATGGGGCAGAGTTT
GGCTGAGATTTTTGGTATCCTGAAACAACCGAATCCCAGCACGAAGCACCAGAAGAGTTTTGTGCGAGATCGTGAGAAGGGAAAATGGGTTTTCGACGAGGAAGAAGGGG
AAACAGATAGTGCAACCAGCAAGCTGCGCAAGCTGAGTGATGGCAAGGAACTTGCCTTAAAGGAGCCAGGGCCAAGCAGAAGGGGAGAGCGCAAAGGCGCACTTGATGTC
CCAGATGAGGTCAGTACTGTGGGTTCGCACAGGAGGATCGAAGTCGAGGCCGAGGCCAAGATGAGGGAGAAGGTTGAGCTTGAAGCAAAGATCCGGGCCGAGCTTGAAGG
TAAGTTCAGAGCCGAAGCCGAAGCTACGGTCAAAGCAAAGGCCGAGGCTAAGGCCAAGGTTGGTACCCAGGGAAACACACAGTCTAGGGATGTGGATAAAGAAAACTTAG
AAAGCTTAATAGGACAGGCCGACCCACCCTTTGTCGATGAGATTATGCAAGCGGAAGTTCCGCATAAGTTTAAATTACCGACTTTTCCACAGTATGACAGGAGGAACGAC
CCGGTTCAACATCTAGATACCTACCGATCCTGGATGGGGTTTCATGGGGCTTCTGAGGCAACAAAGTGTCTTTTTGGGTGCAAGGGACCGAAGAAAGCCGCAGTTCAATT
TGTTGACTATCAAGCAGAAGTCGAGGGAGAGCCTGAATGGGTATATCACATGCTTCAGCAACGAAGTTGTGCAGGGGAGGATCAACCAAGAACATACGCTGAATTCGTTT
CCAGGGCGCAAAAGTATATAAACACGGAGGAGCTGATGAAATCCAAGCGGGCGGAAAGAGAAGCGCAGAGGGTAACCACTATTGACAAGAGCAGGAGAAAAGAGGAAAGG
GGTAAGAGGCCGCGGGAAGAGGACGGAGACTGGAGCCACCTTAGATACTCCTCTGGTCGGAGTCATCTAGACCAGAGGGAGGGCCGAGGCCGACCAGAATTCAGAGAAGG
GATAAGAACAAGTATTGTCACTTCCATCGGGATCATGGACACAACGCGCAATTGCATACAGCTTCGGGACGAGATAGAGAGTCTGATCAAGGAAGGGTATTTGAAAGAGT
TTGTCAAAGGCGAGGAGAGAAAAGGGTCGAGGCCGACCAGGGAGGGTCGCGATGGGGGAGGGAGAATCGAATCGAAAAGGAAAGCAGCGGCCCGAGAAGCGAGGATTGAG
CTTGAAGAGCAGAGAGTGTACTCAGTACAGGTTTCAAATGGGTTGCCGTCGGTAGAGTTCACTGAGTTAGAAGCCTCCGGTATACACCAACCTCACAACGATGCACTGGT
GGTGAGCCTGATGATTGCTAATACACGAGTTCATCGGGTTTTGGTTGATGGCGGGAGTTCAGCAGACATCCTATCCACAAAAGTGTTTGAGGCCATGAAACTAGGGAAAG
GACGACTAAGAACAAGTGTAGCACCGCTGGTTGGCTTCGGGGGAGAAAGGGTGATGCCAGAAGGTAGCATAGAGTTGCCTGTGACCTATGGGGAAGGACATAATGTGGTG
ACGAAGATGGTTAATTTTCTGGTAGTAGACTGTGTATCCGCCTACAATGCCATTCTCGGAAGGCCAGCGCTGCATGAATTGAAGGCTGTTGCGTCAACTTATCACAAGAT
GATGAAGTTCCTGGCCAACGATGGGGTAGGGGTCGTGTGGGGAGAACAAAAAGCTTCCCGTGAGTGTTATTTCACTGCACTCAGAGGAACGAGCGAACGAACGGGGCAGC
ATGGCCCGTGGACGGGCGAGGGTCGAGGCCGAGCAGAGGAATGTTCTTCGCCAATGGAACACTGA
Protein sequenceShow/hide protein sequence
MGQENPVGEGGRQPKANTPDIEMDVLRGRMNEMGQSLAEIFGILKQPNPSTKHQKSFVRDREKGKWVFDEEEGETDSATSKLRKLSDGKELALKEPGPSRRGERKGALDV
PDEVSTVGSHRRIEVEAEAKMREKVELEAKIRAELEGKFRAEAEATVKAKAEAKAKVGTQGNTQSRDVDKENLESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDRRND
PVQHLDTYRSWMGFHGASEATKCLFGCKGPKKAAVQFVDYQAEVEGEPEWVYHMLQQRSCAGEDQPRTYAEFVSRAQKYINTEELMKSKRAEREAQRVTTIDKSRRKEER
GKRPREEDGDWSHLRYSSGRSHLDQREGRGRPEFREGIRTSIVTSIGIMDTTRNCIQLRDEIESLIKEGYLKEFVKGEERKGSRPTREGRDGGGRIESKRKAAAREARIE
LEEQRVYSVQVSNGLPSVEFTELEASGIHQPHNDALVVSLMIANTRVHRVLVDGGSSADILSTKVFEAMKLGKGRLRTSVAPLVGFGGERVMPEGSIELPVTYGEGHNVV
TKMVNFLVVDCVSAYNAILGRPALHELKAVASTYHKMMKFLANDGVGVVWGEQKASRECYFTALRGTSERTGQHGPWTGEGRGRAEECSSPMEH