; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010682 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010682
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr1:3763433..3772183
RNA-Seq ExpressionLag0010682
SyntenyLag0010682
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]2.8e-4035.14Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS
        ELAA ++G+DILTEALGT EH   VRGVGEFVSP +YFN+    S+ +    N +    S P     +G+ I +    + +    +M     H    SV 
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS

Query:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA
        N   V +     ++       +V G  L ++  ++        MV+ V             I IPV GEIE L+Q  G FVAWPR LVIL+ +K +SS  
Subjt:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA

Query:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------
          +     TQ SKHTD+HV+IKLLNRYV+LSM+ +DT+ I LS  IFG++K I+L R+DIM Y  M+E+ +                             
Subjt:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------

Query:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT
                                            HWMLI+IN  EN +YVL+SLR K++E +Q +INT
Subjt:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]2.8e-4035.14Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS
        ELAA ++G+DILTEALGT EH   VRGVGEFVSP +YFN+    S+ +    N +    S P     +G+ I +    + +    +M     H    SV 
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS

Query:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA
        N   V +     ++       +V G  L ++  ++        MV+ V             I IPV GEIE L+Q  G FVAWPR LVIL+ +K +SS  
Subjt:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA

Query:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------
          +     TQ SKHTD+HV+IKLLNRYV+LSM+ +DT+ I LS  IFG++K I+L R+DIM Y  M+E+ +                             
Subjt:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------

Query:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT
                                            HWMLI+IN  EN +YVL+SLR K++E +Q +INT
Subjt:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]2.8e-4035.14Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS
        ELAA ++G+DILTEALGT EH   VRGVGEFVSP +YFN+    S+ +    N +    S P     +G+ I +    + +    +M     H    SV 
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS

Query:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA
        N   V +     ++       +V G  L ++  ++        MV+ V             I IPV GEIE L+Q  G FVAWPR LVIL+ +K +SS  
Subjt:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA

Query:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------
          +     TQ SKHTD+HV+IKLLNRYV+LSM+ +DT+ I LS  IFG++K I+L R+DIM Y  M+E+ +                             
Subjt:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------

Query:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT
                                            HWMLI+IN  EN +YVL+SLR K++E +Q +INT
Subjt:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]1.2e-3832.75Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRPQEGRLIESACRPLSLDTPARMSPTWTHWINTSVSNTKR
        ELAA  +GQDILTEALGTPEHR  +RGVGEFVSP +++N+     +      N     +S+ ++             +T        T    +SV   K 
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRPQEGRLIESACRPLSLDTPARMSPTWTHWINTSVSNTKR

Query:  VVSYSLTRIRRSVRVRRSVRGNQLVI--------------------NV----TQLRQKKKCYGMVEF------VKWGVFVGFRGDVLISIPVVGEIEMLS
               +  R+V+ R+ V   ++V+                    N+    T      +C  + E       V+  V +    DV + IP   +I+ L 
Subjt:  VVSYSLTRIRRSVRVRRSVRGNQLVI--------------------NV----TQLRQKKKCYGMVEF------VKWGVFVGFRGDVLISIPVVGEIEMLS

Query:  QVKGSFVAWPRELVILNNQKKVSSPAKPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF---
        Q  G+FVAWPR+LVI   +KK  SP   K    I QSSK+TD+HVTIKLLNRY + SM+ DD I I LS+ I G++KTI+L RDDI+ Y GM E+ +   
Subjt:  QVKGSFVAWPRELVILNNQKKVSSPAKPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF---

Query:  ---------------------------------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT
                                                                       HW+LI+IN  EN +YV++SLRSK+ E FQG+INT
Subjt:  ---------------------------------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]9.1e-3932.83Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRPQEGRLIESACRPLSLDTPARMSPTWTHWINTSVSNTKR
        ELAA  +GQDILTEALGTPEHR  +RGVGEFVSP +++N+     +      N     +S+ ++             +T        T    +SV   K 
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRPQEGRLIESACRPLSLDTPARMSPTWTHWINTSVSNTKR

Query:  VVSYSLTRIRRSVRVRRSVRGNQLVI--------------------NV----TQLRQKKKCYGMVEF------VKWGVFVGFRGDVLISIPVVGEIEMLS
               +  R+V+ R+ V   ++V+                    N+    T      +C  + E       V+  V +    DV + IP   +I+ L 
Subjt:  VVSYSLTRIRRSVRVRRSVRGNQLVI--------------------NV----TQLRQKKKCYGMVEF------VKWGVFVGFRGDVLISIPVVGEIEMLS

Query:  QVKGSFVAWPRELVILNNQKKVSSPAKPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF---
        Q  G+FVAWPR+LVI   +KK  SP   K    I QSSK+TD+HVTIKLLNRY + SM+ DD I I LS+ I G++KTI+L RDDI+ Y GM E+ +   
Subjt:  QVKGSFVAWPRELVILNNQKKVSSPAKPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF---

Query:  --------------------------------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT
                                                                      HW+LI+IN  EN +YV++SLRSK+ E FQG+INT
Subjt:  --------------------------------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.2e-3630.42Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRPQEGRLIESACRPLSLDTPARMSPTWTHWINTSVSNTKR
        ELAA  +GQDILTEALGTPEHR  +RGVGEFVSP ++ N+   N +      +     +S+ +     E+       +T        T    +SVS  K+
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRPQEGRLIESACRPLSLDTPARMSPTWTHWINTSVSNTKR

Query:  VVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKK------CYGMV-----------------------------EFVKWGVFVGFRGDVLISIPVVGE
             + + ++  + +  V+ ++  + V  L++ +       C+  +                             E ++  V +    DV + IP+ G+
Subjt:  VVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKK------CYGMV-----------------------------EFVKWGVFVGFRGDVLISIPVVGE

Query:  IEMLSQVKGSFVAWPRELVILNNQKKVSSPAKPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVEL
        IE L+Q  G+FVAWPR+LVI+  +KK  S    +     TQSSK+TD+HVTIKLLNRY + +M+ +D I I LS+ IFG++KTI+L RDDI+ Y GM E+
Subjt:  IEMLSQVKGSFVAWPRELVILNNQKKVSSPAKPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVEL

Query:  WF------------------------------------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGII
         +                                                                  HW+LI+I+  EN +YV++ LRSK+   FQG+I
Subjt:  WF------------------------------------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGII

Query:  N
        N
Subjt:  N

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.2e-3630.42Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRPQEGRLIESACRPLSLDTPARMSPTWTHWINTSVSNTKR
        ELAA  +GQDILTEALGTPEHR  +RGVGEFVSP ++ N+   N +      +     +S+ +     E+       +T        T    +SVS  K+
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRPQEGRLIESACRPLSLDTPARMSPTWTHWINTSVSNTKR

Query:  VVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKK------CYGMV-----------------------------EFVKWGVFVGFRGDVLISIPVVGE
             + + ++  + +  V+ ++  + V  L++ +       C+  +                             E ++  V +    DV + IP+ G+
Subjt:  VVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKK------CYGMV-----------------------------EFVKWGVFVGFRGDVLISIPVVGE

Query:  IEMLSQVKGSFVAWPRELVILNNQKKVSSPAKPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVEL
        IE L+Q  G+FVAWPR+LVI+  +KK  S    +     TQSSK+TD+HVTIKLLNRY + +M+ +D I I LS+ IFG++KTI+L RDDI+ Y GM E+
Subjt:  IEMLSQVKGSFVAWPRELVILNNQKKVSSPAKPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVEL

Query:  WF------------------------------------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGII
         +                                                                  HW+LI+I+  EN +YV++ LRSK+   FQG+I
Subjt:  WF------------------------------------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGII

Query:  N
        N
Subjt:  N

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X11.4e-4035.14Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS
        ELAA ++G+DILTEALGT EH   VRGVGEFVSP +YFN+    S+ +    N +    S P     +G+ I +    + +    +M     H    SV 
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS

Query:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA
        N   V +     ++       +V G  L ++  ++        MV+ V             I IPV GEIE L+Q  G FVAWPR LVIL+ +K +SS  
Subjt:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA

Query:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------
          +     TQ SKHTD+HV+IKLLNRYV+LSM+ +DT+ I LS  IFG++K I+L R+DIM Y  M+E+ +                             
Subjt:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------

Query:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT
                                            HWMLI+IN  EN +YVL+SLR K++E +Q +INT
Subjt:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.4e-4035.14Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS
        ELAA ++G+DILTEALGT EH   VRGVGEFVSP +YFN+    S+ +    N +    S P     +G+ I +    + +    +M     H    SV 
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS

Query:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA
        N   V +     ++       +V G  L ++  ++        MV+ V             I IPV GEIE L+Q  G FVAWPR LVIL+ +K +SS  
Subjt:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA

Query:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------
          +     TQ SKHTD+HV+IKLLNRYV+LSM+ +DT+ I LS  IFG++K I+L R+DIM Y  M+E+ +                             
Subjt:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------

Query:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT
                                            HWMLI+IN  EN +YVL+SLR K++E +Q +INT
Subjt:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.4e-4035.14Show/hide
Query:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS
        ELAA ++G+DILTEALGT EH   VRGVGEFVSP +YFN+    S+ +    N +    S P     +G+ I +    + +    +M     H    SV 
Subjt:  ELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRSRP----QEGRLIESACRPLSLDTPARMSPTWTHWINTSVS

Query:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA
        N   V +     ++       +V G  L ++  ++        MV+ V             I IPV GEIE L+Q  G FVAWPR LVIL+ +K +SS  
Subjt:  NTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLSQVKGSFVAWPRELVILNNQKKVSSPA

Query:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------
          +     TQ SKHTD+HV+IKLLNRYV+LSM+ +DT+ I LS  IFG++K I+L R+DIM Y  M+E+ +                             
Subjt:  KPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWF-----------------------------

Query:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT
                                            HWMLI+IN  EN +YVL+SLR K++E +Q +INT
Subjt:  ------------------------------------HWMLIVINPGENTIYVLNSLRSKLEESFQGIINT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGTGAGGGCACAGGGATTCATAACTAATGTATTGAATGCGATTGTTACTATGAGTGGTTCAAGCAGCAACAGTAATGATGAAGTAAACGTTGCTATTCATATGGA
AGTCAGACTGAATGCTGGACGAGGTCTCACTATTATGCTCGAGCTAGCTGCAACAAATCAAGGTCAAGATATATTAACTGAAGCATTAGGCACACCAGAACATAGAAGGC
ATGTTAGAGGAGTGGGTGAGTTTGTTTCACCATTCATCTACTTCAATCTAAAGGATTGGAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACCCCCCCCACTCGGTCT
CGTCCCCAAGAAGGTAGGCTTATTGAGTCGGCGTGCAGGCCACTCTCACTCGATACCCCCGCTCGCATGTCTCCTACATGGACGCATTGGATCAATACGTCTGTATCGAA
TACAAAGCGGGTCGTATCATATAGTCTTACCAGGATAAGAAGAAGTGTTAGAGTCAGAAGAAGTGTTAGAGGTAATCAACTAGTAATCAATGTCACCCAGTTGCGTCAGA
AGAAGAAGTGTTATGGAATGGTAGAATTCGTTAAATGGGGTGTTTTTGTTGGCTTTCGAGGAGATGTTCTCATATCAATTCCTGTGGTTGGAGAAATAGAGATGCTTAGT
CAAGTAAAAGGTAGCTTCGTGGCATGGCCTCGCGAGCTTGTGATTTTGAATAACCAGAAAAAGGTATCTTCTCCCGCAAAACCTAAAATGAATGTGCCCATTACACAATC
TTCCAAACATACAGATATCCACGTTACCATTAAGTTGCTGAATCGATATGTCGTTCTTTCCATGGAAGAGGATGACACAATTCATATCAAGTTGAGTGACACAATTTTTG
GAGAGGATAAAACAATTTTCCTACATCGTGATGACATCATGCATTATTACGGGATGGTTGAGTTATGGTTTCATTGGATGTTGATTGTGATCAATCCTGGAGAGAATACC
ATTTATGTATTGAACTCATTACGTAGTAAGCTTGAAGAAAGTTTTCAAGGAATTATCAATACGATCGCGATGCGCGCCCTAGCAGATTCGGAAAGACTCGCAGCGTCGAG
ACGCTGCCTCAAATTCACACCTTCTAACTTGTGGGGCGGCGTTGAGACGCTAGCTATGGGGCGTCTCAACGCTACCCTTTCTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGTGAGGGCACAGGGATTCATAACTAATGTATTGAATGCGATTGTTACTATGAGTGGTTCAAGCAGCAACAGTAATGATGAAGTAAACGTTGCTATTCATATGGA
AGTCAGACTGAATGCTGGACGAGGTCTCACTATTATGCTCGAGCTAGCTGCAACAAATCAAGGTCAAGATATATTAACTGAAGCATTAGGCACACCAGAACATAGAAGGC
ATGTTAGAGGAGTGGGTGAGTTTGTTTCACCATTCATCTACTTCAATCTAAAGGATTGGAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACCCCCCCCACTCGGTCT
CGTCCCCAAGAAGGTAGGCTTATTGAGTCGGCGTGCAGGCCACTCTCACTCGATACCCCCGCTCGCATGTCTCCTACATGGACGCATTGGATCAATACGTCTGTATCGAA
TACAAAGCGGGTCGTATCATATAGTCTTACCAGGATAAGAAGAAGTGTTAGAGTCAGAAGAAGTGTTAGAGGTAATCAACTAGTAATCAATGTCACCCAGTTGCGTCAGA
AGAAGAAGTGTTATGGAATGGTAGAATTCGTTAAATGGGGTGTTTTTGTTGGCTTTCGAGGAGATGTTCTCATATCAATTCCTGTGGTTGGAGAAATAGAGATGCTTAGT
CAAGTAAAAGGTAGCTTCGTGGCATGGCCTCGCGAGCTTGTGATTTTGAATAACCAGAAAAAGGTATCTTCTCCCGCAAAACCTAAAATGAATGTGCCCATTACACAATC
TTCCAAACATACAGATATCCACGTTACCATTAAGTTGCTGAATCGATATGTCGTTCTTTCCATGGAAGAGGATGACACAATTCATATCAAGTTGAGTGACACAATTTTTG
GAGAGGATAAAACAATTTTCCTACATCGTGATGACATCATGCATTATTACGGGATGGTTGAGTTATGGTTTCATTGGATGTTGATTGTGATCAATCCTGGAGAGAATACC
ATTTATGTATTGAACTCATTACGTAGTAAGCTTGAAGAAAGTTTTCAAGGAATTATCAATACGATCGCGATGCGCGCCCTAGCAGATTCGGAAAGACTCGCAGCGTCGAG
ACGCTGCCTCAAATTCACACCTTCTAACTTGTGGGGCGGCGTTGAGACGCTAGCTATGGGGCGTCTCAACGCTACCCTTTCTTTGTAA
Protein sequenceShow/hide protein sequence
MIVRAQGFITNVLNAIVTMSGSSSNSNDEVNVAIHMEVRLNAGRGLTIMLELAATNQGQDILTEALGTPEHRRHVRGVGEFVSPFIYFNLKDWNSQFRYSGSNWTPPTRS
RPQEGRLIESACRPLSLDTPARMSPTWTHWINTSVSNTKRVVSYSLTRIRRSVRVRRSVRGNQLVINVTQLRQKKKCYGMVEFVKWGVFVGFRGDVLISIPVVGEIEMLS
QVKGSFVAWPRELVILNNQKKVSSPAKPKMNVPITQSSKHTDIHVTIKLLNRYVVLSMEEDDTIHIKLSDTIFGEDKTIFLHRDDIMHYYGMVELWFHWMLIVINPGENT
IYVLNSLRSKLEESFQGIINTIAMRALADSERLAASRRCLKFTPSNLWGGVETLAMGRLNATLSL