; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003222 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003222
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationscaffold4:31196179..31205002
RNA-Seq ExpressionSpg003222
SyntenySpg003222
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.9e-2630Show/hide
Query:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA
        W++FV ARLSEE++ LS+     +     +      G+          S PS   +   EA     G  + +F       +   DEL      +DILTEA
Subjt:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA

Query:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-
        LGT EH G VRGVG+FVSP  YFNVV+ KSK      N S+T   +    K + KEI +              GKPC LAV S+ +I A GT+ ++    
Subjt:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-

Query:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM
              PLG   V                                VAWPR LV L ++K                                  +  +D +
Subjt:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM

Query:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM
         + +S  IF   K +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E      F IVD   I+P   ++E R + L      V   Q+
Subjt:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]1.9e-2630Show/hide
Query:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA
        W++FV ARLSEE++ LS+     +     +      G+          S PS   +   EA     G  + +F       +   DEL      +DILTEA
Subjt:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA

Query:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-
        LGT EH G VRGVG+FVSP  YFNVV+ KSK      N S+T   +    K + KEI +              GKPC LAV S+ +I A GT+ ++    
Subjt:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-

Query:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM
              PLG   V                                VAWPR LV L ++K                                  +  +D +
Subjt:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM

Query:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM
         + +S  IF   K +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E      F IVD   I+P   ++E R + L      V   Q+
Subjt:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]1.9e-2630Show/hide
Query:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA
        W++FV ARLSEE++ LS+     +     +      G+          S PS   +   EA     G  + +F       +   DEL      +DILTEA
Subjt:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA

Query:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-
        LGT EH G VRGVG+FVSP  YFNVV+ KSK      N S+T   +    K + KEI +              GKPC LAV S+ +I A GT+ ++    
Subjt:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-

Query:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM
              PLG   V                                VAWPR LV L ++K                                  +  +D +
Subjt:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM

Query:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM
         + +S  IF   K +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E      F IVD   I+P   ++E R + L      V   Q+
Subjt:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]1.9e-2630Show/hide
Query:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA
        W++FV ARLSEE++ LS+     +     +      G+          S PS   +   EA     G  + +F       +   DEL      +DILTEA
Subjt:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA

Query:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-
        LGT EH G VRGVG+FVSP  YFNVV+ KSK      N S+T   +    K + KEI +              GKPC LAV S+ +I A GT+ ++    
Subjt:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-

Query:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM
              PLG   V                                VAWPR LV L ++K                                  +  +D +
Subjt:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM

Query:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM
         + +S  IF   K +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E      F IVD   I+P   ++E R + L      V   Q+
Subjt:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]4.7e-2527.73Show/hide
Query:  WDAFVRARLSEEFQKLS---QRRDATQTASRRYRNSGFF-LADHLASRRSSPSVSTL--GLLEALNTDFGLLSSFFWAFWASTSNDELQQNDPDKDILTE
        W++FV+ARLSEE++  S   + R A    +      G+  LA  L       + +TL     +  N ++  +++   A       DEL      +DILTE
Subjt:  WDAFVRARLSEEFQKLS---QRRDATQTASRRYRNSGFF-LADHLASRRSSPSVSTL--GLLEALNTDFGLLSSFFWAFWASTSNDELQQNDPDKDILTE

Query:  ALGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKLANDSSTPIQSTQLMKIED------------------------------------KEIPHGK------
        ALGTPEH G +RGVG+FVSP  ++NV + K KL  +S    ++ Q    ++                                    K++P GK      
Subjt:  ALGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKLANDSSTPIQSTQLMKIED------------------------------------KEIPHGK------

Query:  -------PCKLAVGSISHIGATGTMLESEAS-------PLGPTGVA-------------------------------VAWPRALVALCKDK---------
               PC LA+GS+ +I A GTM ES+A        PLGP  V                                VAWPR LV   K+K         
Subjt:  -------PCKLAVGSISHIGATGTMLESEAS-------PLGPTGVA-------------------------------VAWPRALVALCKDK---------

Query:  -------------------------LGKDDLMRVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVF
                                 +  DD++++ +S++I    KT+YL  DDI+Q+C M EI  +C+L YIA LW +  ++     F IVD   I+   
Subjt:  -------------------------LGKDDLMRVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVF

Query:  GTKESRAKGLTTVFSSVQPGQM
          +E R+K L      V   Q+
Subjt:  GTKESRAKGLTTVFSSVQPGQM

TrEMBL top hitse value%identityAlignment
A0A5A7UV39 Transposase2.8e-2328.77Show/hide
Query:  DWDAFVRARLSEE---FQKLSQRRDATQTASRRYRNSGFF-LADHLASRRSSPSVSTL--GLLEALNTDFGLLSSFFWAFWASTSNDELQQNDPDKDILT
        DW++FV ARLSEE   + ++ + R      +      G+  LAD L         STL     +  N D+         F  +T +    +N+   DILT
Subjt:  DWDAFVRARLSEE---FQKLSQRRDATQTASRRYRNSGFF-LADHLASRRSSPSVSTL--GLLEALNTDFGLLSSFFWAFWASTSNDELQQNDPDKDILT

Query:  EALGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----------------ANDSSTPIQSTQLMKIEDKEIP-----HGKPCKLAVGSISHIGATGTMLE
        +ALG+ EHGG VRGVG FVS   YFN V+ K K+                +N S + I S  +    D++ P      G PC+L++GSI++I A  T+ +
Subjt:  EALGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----------------ANDSSTPIQSTQLMKIEDKEIP-----HGKPCKLAVGSISHIGATGTMLE

Query:  SEASPLGPT-GVAVAWPRALVALCKDK----LGKD----------------------------DLMRVPISDRIFTANKTLYLMLDDIMQFCSMVEISNT
         +   L    G  + WPR LV+   DK      KD                            D++R+P+++ IF ++K +YL  +D++ +C MVEI   
Subjt:  SEASPLGPT-GVAVAWPRALVALCKDK----LGKD----------------------------DLMRVPISDRIFTANKTLYLMLDDIMQFCSMVEISNT

Query:  CVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSV
        C+L YI  L     +      F ++D + I+     ++ R++ L     +V
Subjt:  CVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X19.3e-2730Show/hide
Query:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA
        W++FV ARLSEE++ LS+     +     +      G+          S PS   +   EA     G  + +F       +   DEL      +DILTEA
Subjt:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA

Query:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-
        LGT EH G VRGVG+FVSP  YFNVV+ KSK      N S+T   +    K + KEI +              GKPC LAV S+ +I A GT+ ++    
Subjt:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-

Query:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM
              PLG   V                                VAWPR LV L ++K                                  +  +D +
Subjt:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM

Query:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM
         + +S  IF   K +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E      F IVD   I+P   ++E R + L      V   Q+
Subjt:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X49.3e-2730Show/hide
Query:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA
        W++FV ARLSEE++ LS+     +     +      G+          S PS   +   EA     G  + +F       +   DEL      +DILTEA
Subjt:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA

Query:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-
        LGT EH G VRGVG+FVSP  YFNVV+ KSK      N S+T   +    K + KEI +              GKPC LAV S+ +I A GT+ ++    
Subjt:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-

Query:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM
              PLG   V                                VAWPR LV L ++K                                  +  +D +
Subjt:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM

Query:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM
         + +S  IF   K +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E      F IVD   I+P   ++E R + L      V   Q+
Subjt:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM

A0A6J1C398 uncharacterized protein LOC111007859 isoform X39.3e-2730Show/hide
Query:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA
        W++FV ARLSEE++ LS+     +     +      G+          S PS   +   EA     G  + +F       +   DEL      +DILTEA
Subjt:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA

Query:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-
        LGT EH G VRGVG+FVSP  YFNVV+ KSK      N S+T   +    K + KEI +              GKPC LAV S+ +I A GT+ ++    
Subjt:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-

Query:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM
              PLG   V                                VAWPR LV L ++K                                  +  +D +
Subjt:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM

Query:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM
         + +S  IF   K +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E      F IVD   I+P   ++E R + L      V   Q+
Subjt:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X29.3e-2730Show/hide
Query:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA
        W++FV ARLSEE++ LS+     +     +      G+          S PS   +   EA     G  + +F       +   DEL      +DILTEA
Subjt:  WDAFVRARLSEEFQKLSQRRDATQTA---SRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFF--WAFWASTSNDELQQNDPDKDILTEA

Query:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-
        LGT EH G VRGVG+FVSP  YFNVV+ KSK      N S+T   +    K + KEI +              GKPC LAV S+ +I A GT+ ++    
Subjt:  LGTPEHGGCVRGVGDFVSPYTYFNVVRSKSKL----ANDSSTPIQSTQLMKIEDKEIPH--------------GKPCKLAVGSISHIGATGTMLESEAS-

Query:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM
              PLG   V                                VAWPR LV L ++K                                  +  +D +
Subjt:  ------PLGPTGVA-------------------------------VAWPRALVALCKDK----------------------------------LGKDDLM

Query:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM
         + +S  IF   K +YL  +DIMQ+C+M+EI  +C+L YIA+LW  + E      F IVD   I+P   ++E R + L      V   Q+
Subjt:  RVPISDRIFTANKTLYLMLDDIMQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTATATCATGGACTCCTCTGGCAGAAGCAGCAGCGATGAAGGAGATTAGCCTGATAGAAGATTCGTTCCTAGAGGCACCAAGGGGCGCCTTATATAGGCTCGGAAATAG
GACAGCGTCGCGACGCTGTGACGTCACCCAACTTGCTGGCCAAGAAATGGGACAGCGTCTCGACGCTGTGCAGATTCCAGAACAGTCCACGCGGTTTGGGCTTGAAATTT
GGGCCTTTCTTCATTTCTTTTTGGATCTTCTTTGCATGGTTGACTTCTTGGGCTCCCCCTTTGACCCATGGATGGAAATTTGGGCTGATTGGGATGCATTTGTTCGAGCC
AGGTTATCTGAAGAATTTCAGAAGTTGTCGCAGCGTCGCGACGCTACGCAGACAGCGTCGCGACGCTACCGCAATTCTGGATTTTTCCTCGCTGATCACCTAGCGTCCAG
ACGCTCTAGTCCTAGCGTCTCGACGCTAGGCCTTCTAGAAGCCTTAAACACGGATTTTGGCCTCCTTTCTTCTTTCTTTTGGGCTTTTTGGGCCTCTACTTCAAATGATG
AATTACAACAAAACGACCCAGACAAAGATATTCTGACTGAAGCACTCGGGACACCTGAACATGGTGGCTGTGTCAGAGGAGTGGGGGATTTTGTGTCGCCATATACGTAC
TTCAATGTTGTGCGATCTAAATCGAAGTTGGCGAATGATTCATCAACGCCGATTCAAAGTACTCAATTAATGAAGATTGAAGACAAAGAAATCCCACATGGAAAACCATG
TAAATTAGCTGTAGGCTCGATATCTCACATTGGTGCAACGGGCACAATGCTCGAAAGTGAGGCAAGTCCATTAGGTCCCACTGGTGTTGCAGTTGCTTGGCCTCGTGCTT
TGGTTGCTCTATGTAAAGATAAGCTAGGCAAGGATGATCTGATGCGAGTGCCCATTAGCGATAGGATATTTACAGCAAACAAAACATTATATCTCATGCTCGATGATATA
ATGCAATTTTGTAGTATGGTCGAGATATCAAACACTTGTGTATTGGTCTATATTGCGTTCCTTTGGACGCATTTTAAGGAGACTAGTAGACTAGACATGTTTAAGATCGT
GGACTCAAACGACATTGCACCGGTCTTTGGGACCAAGGAAAGCCGTGCAAAAGGTTTAACTACCGTATTTTCTTCAGTACAACCAGGACAAATGAGCATTGAGGGTTTGT
ATGGCACAAAGTACATCGAAGCAACAACGAAAGACGACTTCTTGGATACCTGTAAAGGTAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATTATATCATGGACTCCTCTGGCAGAAGCAGCAGCGATGAAGGAGATTAGCCTGATAGAAGATTCGTTCCTAGAGGCACCAAGGGGCGCCTTATATAGGCTCGGAAATAG
GACAGCGTCGCGACGCTGTGACGTCACCCAACTTGCTGGCCAAGAAATGGGACAGCGTCTCGACGCTGTGCAGATTCCAGAACAGTCCACGCGGTTTGGGCTTGAAATTT
GGGCCTTTCTTCATTTCTTTTTGGATCTTCTTTGCATGGTTGACTTCTTGGGCTCCCCCTTTGACCCATGGATGGAAATTTGGGCTGATTGGGATGCATTTGTTCGAGCC
AGGTTATCTGAAGAATTTCAGAAGTTGTCGCAGCGTCGCGACGCTACGCAGACAGCGTCGCGACGCTACCGCAATTCTGGATTTTTCCTCGCTGATCACCTAGCGTCCAG
ACGCTCTAGTCCTAGCGTCTCGACGCTAGGCCTTCTAGAAGCCTTAAACACGGATTTTGGCCTCCTTTCTTCTTTCTTTTGGGCTTTTTGGGCCTCTACTTCAAATGATG
AATTACAACAAAACGACCCAGACAAAGATATTCTGACTGAAGCACTCGGGACACCTGAACATGGTGGCTGTGTCAGAGGAGTGGGGGATTTTGTGTCGCCATATACGTAC
TTCAATGTTGTGCGATCTAAATCGAAGTTGGCGAATGATTCATCAACGCCGATTCAAAGTACTCAATTAATGAAGATTGAAGACAAAGAAATCCCACATGGAAAACCATG
TAAATTAGCTGTAGGCTCGATATCTCACATTGGTGCAACGGGCACAATGCTCGAAAGTGAGGCAAGTCCATTAGGTCCCACTGGTGTTGCAGTTGCTTGGCCTCGTGCTT
TGGTTGCTCTATGTAAAGATAAGCTAGGCAAGGATGATCTGATGCGAGTGCCCATTAGCGATAGGATATTTACAGCAAACAAAACATTATATCTCATGCTCGATGATATA
ATGCAATTTTGTAGTATGGTCGAGATATCAAACACTTGTGTATTGGTCTATATTGCGTTCCTTTGGACGCATTTTAAGGAGACTAGTAGACTAGACATGTTTAAGATCGT
GGACTCAAACGACATTGCACCGGTCTTTGGGACCAAGGAAAGCCGTGCAAAAGGTTTAACTACCGTATTTTCTTCAGTACAACCAGGACAAATGAGCATTGAGGGTTTGT
ATGGCACAAAGTACATCGAAGCAACAACGAAAGACGACTTCTTGGATACCTGTAAAGGTAAGTAG
Protein sequenceShow/hide protein sequence
IISWTPLAEAAAMKEISLIEDSFLEAPRGALYRLGNRTASRRCDVTQLAGQEMGQRLDAVQIPEQSTRFGLEIWAFLHFFLDLLCMVDFLGSPFDPWMEIWADWDAFVRA
RLSEEFQKLSQRRDATQTASRRYRNSGFFLADHLASRRSSPSVSTLGLLEALNTDFGLLSSFFWAFWASTSNDELQQNDPDKDILTEALGTPEHGGCVRGVGDFVSPYTY
FNVVRSKSKLANDSSTPIQSTQLMKIEDKEIPHGKPCKLAVGSISHIGATGTMLESEASPLGPTGVAVAWPRALVALCKDKLGKDDLMRVPISDRIFTANKTLYLMLDDI
MQFCSMVEISNTCVLVYIAFLWTHFKETSRLDMFKIVDSNDIAPVFGTKESRAKGLTTVFSSVQPGQMSIEGLYGTKYIEATTKDDFLDTCKGK