; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022590 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022590
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr7:33925842..33928348
RNA-Seq ExpressionLag0022590
SyntenyLag0022590
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK27213.1 Ulp1-like peptidase [Cucumis melo var. makuwa]6.9e-2121.76Show/hide
Query:  NDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYETVSSLT
        +D+D VK+AL YFIE++++G++R+ ++D     I D+W    N DW +I+F  T+ +LK+                          A  VW YE++ ++ 
Subjt:  NDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYETVSSLT

Query:  GRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKAP
        G   + V D+AIPR+LRW C  SP    +S +VF S    I                              P+   + +   ++  + R+ ++  S    
Subjt:  GRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKAP

Query:  DRTC----------KKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTHARSGPGAKGADTPLNVATSSAVQTDTQQ
         +            KK +   S+++ ++ A++ L   +  +E  L +IK  +  L    M     +   L         K ++  ++    S    + + 
Subjt:  DRTC----------KKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTHARSGPGAKGADTPLNVATSSAVQTDTQQ

Query:  KSPVMEGTTGGGGGPVQGWQGTTSHQG-QAALGVQSTSQQNEPIE--RRGTRKRKTAWKLRTPWKDTREDGKRRKVLTYNPIPEIPEDWSTKFKTWLDSE
        +     GT      P +        +G +  L  +      EPI+    G R R       T  +  R   +   ++ Y+P+ +I +    + + W+  +
Subjt:  KSPVMEGTTGGGGGPVQGWQGTTSHQG-QAALGVQSTSQQNEPIE--RRGTRKRKTAWKLRTPWKDTREDGKRRKVLTYNPIPEIPEDWSTKFKTWLDSE

Query:  DPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL------------------------------
           D +R T +G   K++F DL    +W+ DE +D++F+F++ K +       + FTTAD +F+ + +S                               
Subjt:  DPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL------------------------------

Query:  -----------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGLW-IQGRNS
                    +Y P+N+ G HWVL+C+DL   +V V +SL +L   E +   L       P     +G +  +GR+S
Subjt:  -----------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGLW-IQGRNS

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]6.9e-2132.94Show/hide
Query:  MKGFELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKKAC----------------------
        +K  EL+ +F    F +DED VK+ + YFIELAMMG+ERKQ +DT+LLG++D WE  CN DWS +IFD TI SLK A                       
Subjt:  MKGFELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKKAC----------------------

Query:  --------LVWTYETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHI--------GTCGHRGRGSIYEPCDGAAPCPPP-------
                 VW YET+S+L+        D+AIPR+LRWSC +S    VL+ EVF +    +            H  R  I  P     P PP        
Subjt:  --------LVWTYETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHI--------GTCGHRGRGSIYEPCDGAAPCPPP-------

Query:  PPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKAPDRTCKKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTHARS
        P PP +P+   +     DVE     ED    + A D    + R   +  EG+E  +K+            K  K+  RRL +++  CV   E  L     
Subjt:  PPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKAPDRTCKKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTHARS

Query:  GPGAKGADTPLNVATSSAVQTDTQQKSPVMEGTTGGGGGP
        G   KG    L              K P      GGGGGP
Subjt:  GPGAKGADTPLNVATSSAVQTDTQQKSPVMEGTTGGGGGP

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]2.1e-3049.35Show/hide
Query:  MKGFELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKKAC----------------------
        +K  EL+ +F    FENDEDAVKIA+ YFIELAMMG+ERK +MDTSLLGI+D WE  CN DWS +IF+ T+ SLK A                       
Subjt:  MKGFELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKKAC----------------------

Query:  --------LVWTYETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVF
                 VW YET+S+L+ RVA  + D+AIPR+LRWSC++S    VL +EVF
Subjt:  --------LVWTYETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVF

XP_022158673.1 uncharacterized protein LOC111025136 [Momordica charantia]1.6e-2238.73Show/hide
Query:  ELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKKACL----------------------VWT
        E +  +  ++FE+D DAVKI++  F+EL + GR+R  ++D SLLG++D+ E  CN  W+++ F+ TI+SLK+A                        VW 
Subjt:  ELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKKACL----------------------VWT

Query:  YETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVF
        YET+S LT RVA+ ++ + +PRIL+W C +SP   V+ KE+F
Subjt:  YETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVF

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]5.6e-2340.41Show/hide
Query:  ELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK----------------------ACLVWT
        EL+ ++ ++RFE+D DAVK+ L YF+EL ++GRER  + D  LLGI+D+WE  CN DW+ + FD TI SL++                      A  VW 
Subjt:  ELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK----------------------ACLVWT

Query:  YETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDT
        YE +SSL+G +   V  + +PRIL+W   HS    +L++E+F S T
Subjt:  YETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDT

TrEMBL top hitse value%identityAlignment
A0A5A7TPK2 Ulp1-like peptidase6.3e-2021.22Show/hide
Query:  LRFENDED-AVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYET
        L +E D+D  VK+AL YFIE++++G++R+ ++D     I D+W    N DW +I+F  T+ +L +                          A  VW YE+
Subjt:  LRFENDED-AVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYET

Query:  VSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGT
        + ++ G   + V D+AIPR+LRW C  SP    +S +VF S    I         ++ E                 P+   + +   ++  + R+  +  
Subjt:  VSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGT

Query:  SSKAPDRTC----------KKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTHARSGPGAKGADTPLNVATSSAVQ
        S     +            KK +   S+++ ++ A++ L   +  +E  L +IK  +  L    M     +   L         K ++  ++    S   
Subjt:  SSKAPDRTC----------KKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTHARSGPGAKGADTPLNVATSSAVQ

Query:  TDTQQKSPVMEGTTGGGGGPVQGWQGTTSHQG-QAALGVQSTSQQNEPIE-------------------RRGTRKRKTAWKLRTPWK-------DTREDG
         + + K     GT      P +        +G +  L  +      EPI+                    R  R+++ +  L TP+         +    
Subjt:  TDTQQKSPVMEGTTGGGGGPVQGWQGTTSHQG-QAALGVQSTSQQNEPIE-------------------RRGTRKRKTAWKLRTPWK-------DTREDG

Query:  KRRKVLTYNPIPEIPEDWSTKFKTWLDSEDPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL-
         + + + Y+P+ +I +    + + W+  +   D +R T +G   K++F DL    +W++DE +D++F+F++ K +       + FTTAD +F+ + +S  
Subjt:  KRRKVLTYNPIPEIPEDWSTKFKTWLDSEDPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL-

Query:  ----------------------------------------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGL
                                                 +Y P+N+ G HWVL+C+DL   +V V +SL +L   E +   L       P     +G 
Subjt:  ----------------------------------------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGL

Query:  W-IQGRNS
        +  +GR+S
Subjt:  W-IQGRNS

A0A5A7UQ48 Ulp1-like peptidase1.4e-1921.59Show/hide
Query:  NDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYETVSSLT
        +D+D VK+AL YFIE++++G++R+ ++D     I D+W    N DW +I+F  T+ +LK+                          A  VW YE++ ++ 
Subjt:  NDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYETVSSLT

Query:  GRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKA-
        G   + V D+AIPR+LRW C  SP    +S +VF S    I                  A     P         G   ++    T  ++++ G+     
Subjt:  GRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKA-

Query:  ---PDRTCKKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTH-ARSGPGAKGADTPLNVATSSAVQTDTQQKSPVM
            +   KK +   S+++ ++ A++ L   +  +E  L  IK  +  L            + L H      G +G            ++   +    V+
Subjt:  ---PDRTCKKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTH-ARSGPGAKGADTPLNVATSSAVQTDTQQKSPVM

Query:  EGTTG-GGGGPVQGWQGTTSHQGQAALGVQSTSQQNEPIERRGTRKRKTAWKLRTPWK-------DTREDGKRRKVLTYNPIPEIPEDWSTKFKTWLDSE
        E     G   P+           +  + ++    Q   +  R  R+++ +  L TP+         +       + + Y+P+ +I +    + + W+  +
Subjt:  EGTTG-GGGGPVQGWQGTTSHQGQAALGVQSTSQQNEPIERRGTRKRKTAWKLRTPWK-------DTREDGKRRKVLTYNPIPEIPEDWSTKFKTWLDSE

Query:  DPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL------------------------------
           D +R T +G   K++F DL    +W++DE +D++F+F++ K +       + FTTAD +F+ + +S                               
Subjt:  DPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL------------------------------

Query:  -----------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGLW-IQGRNS
                    +Y P+N+ G HWVL+C+DL   +V V +SL +L   E +   L       P     +G +  +GR+S
Subjt:  -----------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGLW-IQGRNS

A0A5A7VPK4 Ulp1-like peptidase1.3e-2022.11Show/hide
Query:  NDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYETVSSLT
        +D+D VK+AL YFIE++++G++R+ ++D     I D+W    N DW +I+F  T+ +LK+                          A  VW YE++ ++ 
Subjt:  NDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYETVSSLT

Query:  GRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKA-
        G   + V D+AIPR+LRW C  SP    +S +VF S    I                  A     P         G   ++    T  ++++ G+     
Subjt:  GRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKA-

Query:  ---PDRTCKKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTH-ARSGPGAKGADTPLNVATSSAVQTDTQQKSPVM
            +   KK +   S+++ ++ A++ L   +  +E  L  IK  +  L            + L H      G +G            ++   +    V+
Subjt:  ---PDRTCKKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTH-ARSGPGAKGADTPLNVATSSAVQTDTQQKSPVM

Query:  EGTTG-GGGGPVQGWQGTTSHQGQAALGVQSTSQQNEPIERRGTRKRKTAWKLRTPWK-------DTREDGKRRKVLTYNPIPEIPEDWSTKFKTWLDSE
        E     G   P+           +  + ++    Q   +  R  R+++ +  L TP+         +       + + Y+P+ +I +    + + W+  +
Subjt:  EGTTG-GGGGPVQGWQGTTSHQGQAALGVQSTSQQNEPIERRGTRKRKTAWKLRTPWK-------DTREDGKRRKVLTYNPIPEIPEDWSTKFKTWLDSE

Query:  DPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL------------------------------
           D +R T +G   K++F DL    +W++DE +D++F+F++ K +       + FTTAD +F+S+ +S                               
Subjt:  DPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL------------------------------

Query:  -----------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGLW-IQGRNS-LSEDGLSV
                    +Y P+N+ G HWVL+C+DL   +V V +SL +L   E +   L       P     +G +  +GR+S   E GL V
Subjt:  -----------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGLW-IQGRNS-LSEDGLSV

A0A5D3DYH3 Ulp1-like peptidase1.4e-1921.59Show/hide
Query:  NDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYETVSSLT
        +D+D VK+AL YFIE++++G++R+ ++D     I D+W    N DW +I+F  T+ +LK+                          A  VW YE++ ++ 
Subjt:  NDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKK--------------------------ACLVWTYETVSSLT

Query:  GRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKA-
        G   + V D+AIPR+LRW C  SP    +S +VF S    I                  A     P         G   ++    T  ++++ G+     
Subjt:  GRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKA-

Query:  ---PDRTCKKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTH-ARSGPGAKGADTPLNVATSSAVQTDTQQKSPVM
            +   KK +   S+++ ++ A++ L   +  +E  L  IK  +  L            + L H      G +G            ++   +    V+
Subjt:  ---PDRTCKKCRLLDSRVEGIENAVKELNGNMKGIERDLKAIKKFMRRLSKVNMCCVSYRESSLTH-ARSGPGAKGADTPLNVATSSAVQTDTQQKSPVM

Query:  EGTTG-GGGGPVQGWQGTTSHQGQAALGVQSTSQQNEPIERRGTRKRKTAWKLRTPWK-------DTREDGKRRKVLTYNPIPEIPEDWSTKFKTWLDSE
        E     G   P+           +  + ++    Q   +  R  R+++ +  L TP+         +       + + Y+P+ +I +    + + W+  +
Subjt:  EGTTG-GGGGPVQGWQGTTSHQGQAALGVQSTSQQNEPIERRGTRKRKTAWKLRTPWK-------DTREDGKRRKVLTYNPIPEIPEDWSTKFKTWLDSE

Query:  DPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL------------------------------
           D +R T +G   K++F DL    +W++DE +D++F+F++ K +       + FTTAD +F+ + +S                               
Subjt:  DPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTAD-LFVSVCISL------------------------------

Query:  -----------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGLW-IQGRNS
                    +Y P+N+ G HWVL+C+DL   +V V +SL +L   E +   L       P     +G +  +GR+S
Subjt:  -----------SLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGLW-IQGRNS

A0A6J1DRZ7 uncharacterized protein LOC1110238471.0e-3049.35Show/hide
Query:  MKGFELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKKAC----------------------
        +K  EL+ +F    FENDEDAVKIA+ YFIELAMMG+ERK +MDTSLLGI+D WE  CN DWS +IF+ T+ SLK A                       
Subjt:  MKGFELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKKAC----------------------

Query:  --------LVWTYETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVF
                 VW YET+S+L+ RVA  + D+AIPR+LRWSC++S    VL +EVF
Subjt:  --------LVWTYETVSSLTGRVANHVRDNAIPRILRWSCSHSPTLAVLSKEVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGATTTGAGTTAGATATATTATTCCCCAACCTTCGATTTGAGAATGATGAGGACGCAGTTAAGATAGCGTTGTTTTATTTCATCGAGCTTGCAATGATGGGGAG
AGAGAGAAAACAACAAATGGATACAAGCCTGCTAGGTATAATTGATGAGTGGGAGAGACTTTGCAATGAAGATTGGAGCAAAATCATATTTGATAATACCATTAAGTCAT
TGAAGAAAGCTTGTCTGGTATGGACGTATGAGACTGTTTCGTCATTAACTGGACGTGTTGCTAACCACGTCAGAGACAATGCCATCCCACGGATTCTTAGATGGTCATGT
TCCCACTCGCCCACTTTGGCAGTGCTTAGTAAAGAAGTTTTTGCATCAGACACGGGTCACATTGGAACTTGTGGCCACAGAGGAAGAGGTTCAATTTATGAACCGTGTGA
TGGAGCCGCCCCATGCCCACCTCCTCCTCCTCCTCCTCCAGCTCCAAAACTTCCTGGTATGAATGTTGACGATGTAGATGTTGAGACTCATGATAGGACGGAGGATGTTG
GGACTAGTTCTAAGGCTCCTGACCGAACTTGCAAGAAGTGTAGACTCCTTGATAGTCGTGTTGAGGGCATTGAAAATGCTGTCAAGGAGTTAAATGGAAATATGAAGGGA
ATTGAAAGAGACCTGAAGGCAATAAAGAAGTTCATGCGTCGATTGTCTAAGGTTAATATGTGTTGTGTTTCATACAGAGAAAGTTCGTTGACGCATGCAAGATCTGGACC
TGGCGCGAAAGGAGCAGACACCCCCTTGAATGTGGCTACTAGTTCAGCTGTCCAGACTGACACACAACAAAAATCTCCAGTAATGGAAGGGACCACGGGTGGTGGTGGTG
GTCCAGTTCAAGGTTGGCAAGGAACTACAAGTCACCAGGGTCAAGCTGCCCTGGGTGTCCAGTCTACTTCACAACAAAACGAGCCCATTGAACGACGGGGGACTCGGAAG
AGGAAGACTGCATGGAAGTTGAGAACTCCATGGAAAGACACAAGGGAAGACGGGAAGAGGCGAAAGGTCCTGACGTACAATCCTATCCCGGAGATCCCTGAAGATTGGTC
TACGAAATTCAAGACATGGTTAGACAGTGAGGACCCAAAAGATCGTGTCCGAAGGACCGAATATGGTGTTACAGACAAGTCGTGGTTCGTAGACCTTCTAACTCCATCTA
AATGGATGACCGATGAGGTTATCGATTCGATCTTCATGTTTGTCCAAAAGAAGTTCGAACAACGACCACAACTATGCCGTAGAAAGTTCACAACTGCGGATCTATTTGTT
TCGGTGTGTATCAGTCTAAGTTTGTACTTGCCTTATAATCTCGGTGGTCTCCATTGGGTTTTGGTGTGCATTGATTTGGAGGTCGGTGAGGTGGTCGTGTCAAATTCGCT
CAGGGCATTGAACAAGGAAGAGGTGGTTGAGGAGGAGTTAAGGTCCTTTGCCACGTCGTGTCCAATGTACTTTGGAAGATCGGGGCTATGGATTCAAGGAAGGAACTCCC
TGTCGGAAGATGGCCTCTCCGTCTGGAAAAATCAAGGCCGCAGCAGCCTAATAGTGGTGACTGTGGAGTGTTTGTATGTAAATTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGATTTGAGTTAGATATATTATTCCCCAACCTTCGATTTGAGAATGATGAGGACGCAGTTAAGATAGCGTTGTTTTATTTCATCGAGCTTGCAATGATGGGGAG
AGAGAGAAAACAACAAATGGATACAAGCCTGCTAGGTATAATTGATGAGTGGGAGAGACTTTGCAATGAAGATTGGAGCAAAATCATATTTGATAATACCATTAAGTCAT
TGAAGAAAGCTTGTCTGGTATGGACGTATGAGACTGTTTCGTCATTAACTGGACGTGTTGCTAACCACGTCAGAGACAATGCCATCCCACGGATTCTTAGATGGTCATGT
TCCCACTCGCCCACTTTGGCAGTGCTTAGTAAAGAAGTTTTTGCATCAGACACGGGTCACATTGGAACTTGTGGCCACAGAGGAAGAGGTTCAATTTATGAACCGTGTGA
TGGAGCCGCCCCATGCCCACCTCCTCCTCCTCCTCCTCCAGCTCCAAAACTTCCTGGTATGAATGTTGACGATGTAGATGTTGAGACTCATGATAGGACGGAGGATGTTG
GGACTAGTTCTAAGGCTCCTGACCGAACTTGCAAGAAGTGTAGACTCCTTGATAGTCGTGTTGAGGGCATTGAAAATGCTGTCAAGGAGTTAAATGGAAATATGAAGGGA
ATTGAAAGAGACCTGAAGGCAATAAAGAAGTTCATGCGTCGATTGTCTAAGGTTAATATGTGTTGTGTTTCATACAGAGAAAGTTCGTTGACGCATGCAAGATCTGGACC
TGGCGCGAAAGGAGCAGACACCCCCTTGAATGTGGCTACTAGTTCAGCTGTCCAGACTGACACACAACAAAAATCTCCAGTAATGGAAGGGACCACGGGTGGTGGTGGTG
GTCCAGTTCAAGGTTGGCAAGGAACTACAAGTCACCAGGGTCAAGCTGCCCTGGGTGTCCAGTCTACTTCACAACAAAACGAGCCCATTGAACGACGGGGGACTCGGAAG
AGGAAGACTGCATGGAAGTTGAGAACTCCATGGAAAGACACAAGGGAAGACGGGAAGAGGCGAAAGGTCCTGACGTACAATCCTATCCCGGAGATCCCTGAAGATTGGTC
TACGAAATTCAAGACATGGTTAGACAGTGAGGACCCAAAAGATCGTGTCCGAAGGACCGAATATGGTGTTACAGACAAGTCGTGGTTCGTAGACCTTCTAACTCCATCTA
AATGGATGACCGATGAGGTTATCGATTCGATCTTCATGTTTGTCCAAAAGAAGTTCGAACAACGACCACAACTATGCCGTAGAAAGTTCACAACTGCGGATCTATTTGTT
TCGGTGTGTATCAGTCTAAGTTTGTACTTGCCTTATAATCTCGGTGGTCTCCATTGGGTTTTGGTGTGCATTGATTTGGAGGTCGGTGAGGTGGTCGTGTCAAATTCGCT
CAGGGCATTGAACAAGGAAGAGGTGGTTGAGGAGGAGTTAAGGTCCTTTGCCACGTCGTGTCCAATGTACTTTGGAAGATCGGGGCTATGGATTCAAGGAAGGAACTCCC
TGTCGGAAGATGGCCTCTCCGTCTGGAAAAATCAAGGCCGCAGCAGCCTAATAGTGGTGACTGTGGAGTGTTTGTATGTAAATTTTTAG
Protein sequenceShow/hide protein sequence
MKGFELDILFPNLRFENDEDAVKIALFYFIELAMMGRERKQQMDTSLLGIIDEWERLCNEDWSKIIFDNTIKSLKKACLVWTYETVSSLTGRVANHVRDNAIPRILRWSC
SHSPTLAVLSKEVFASDTGHIGTCGHRGRGSIYEPCDGAAPCPPPPPPPPAPKLPGMNVDDVDVETHDRTEDVGTSSKAPDRTCKKCRLLDSRVEGIENAVKELNGNMKG
IERDLKAIKKFMRRLSKVNMCCVSYRESSLTHARSGPGAKGADTPLNVATSSAVQTDTQQKSPVMEGTTGGGGGPVQGWQGTTSHQGQAALGVQSTSQQNEPIERRGTRK
RKTAWKLRTPWKDTREDGKRRKVLTYNPIPEIPEDWSTKFKTWLDSEDPKDRVRRTEYGVTDKSWFVDLLTPSKWMTDEVIDSIFMFVQKKFEQRPQLCRRKFTTADLFV
SVCISLSLYLPYNLGGLHWVLVCIDLEVGEVVVSNSLRALNKEEVVEEELRSFATSCPMYFGRSGLWIQGRNSLSEDGLSVWKNQGRSSLIVVTVECLYVNF