; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021083 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021083
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase
Genome locationscaffold9:4175155..4177602
RNA-Seq ExpressionSpg021083
SyntenySpg021083
Gene Ontology termsNA
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0066848.1 uncharacterized protein E6C27_scaffold271G002180 [Cucumis melo var. makuwa]6.8e-3230.73Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSKLAHDSSTSVQSTQMKTEDKEIPHVS------------DQNTKDVKEISPVSTQ
        DEL   + ++DILT+ALG+ EH GRVRGVG FVS   YFN V+ K K+ H        ++ K++ K   H              D++T   K +     Q
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSKLAHDSSTSVQSTQMKTEDKEIPHVS------------DQNTKDVKEISPVSTQ

Query:  KTEHVKRLHR-EASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVI---------------
         +  +  ++   A  T+    +G  N KV VD  +V  EN  IP PVKG+I+ L+Q++G  + WP  LV+   DK   S RK V+               
Subjt:  KTEHVKRLHR-EASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVI---------------

Query:  -KHSFPNSTTTH--------------------PSDIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVFSSVQ
         +H+  N                           D++ +C MVEI   C+L     LW   ++      F V+D + I+     +   +R L     +V 
Subjt:  -KHSFPNSTTTH--------------------PSDIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVFSSVQ

Query:  PGQMVFLTYNIGALRVCMAESTSKQLQKT-SWIPVKCPRQQGCVECGYYVMKFMREILHNPEKPIIALLAV
          Q V + YN   L+   A+   ++ + T  W PVKCPRQ   V CGYYV K++ EI+HN    I  L+ +
Subjt:  PGQMVFLTYNIGALRVCMAESTSKQLQKT-SWIPVKCPRQQGCVECGYYVMKFMREILHNPEKPIIALLAV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]9.2e-3733.33Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT
        DEL      +DILTEALGT EH+GRVRGVG+FVSP  YFNVV+ KSK   L  + ST+  S  ++ K++ KEI +V ++     +   E  P  ++ +  
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT

Query:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS
        +++  +     +     TV+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWP  LV L ++K+  S R +  +      T  H S
Subjt:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS

Query:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF
                                              DIMQ+C+M+EI  +C+L   A LW  +E      +F +VD   I+P   +Q+   R L    
Subjt:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF

Query:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHN
          V   Q+V + Y  G         LR   V + +S  +++Q+                         T W  +KCP Q G VECGYYV K++REI+ N
Subjt:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHN

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]9.2e-3733.33Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT
        DEL      +DILTEALGT EH+GRVRGVG+FVSP  YFNVV+ KSK   L  + ST+  S  ++ K++ KEI +V ++     +   E  P  ++ +  
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT

Query:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS
        +++  +     +     TV+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWP  LV L ++K+  S R +  +      T  H S
Subjt:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS

Query:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF
                                              DIMQ+C+M+EI  +C+L   A LW  +E      +F +VD   I+P   +Q+   R L    
Subjt:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF

Query:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHN
          V   Q+V + Y  G         LR   V + +S  +++Q+                         T W  +KCP Q G VECGYYV K++REI+ N
Subjt:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHN

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]2.6e-3934.84Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT
        DEL      +DILTEALGT EH+GRVRGVG+FVSP  YFNVV+ KSK   L  + ST+  S  ++ K++ KEI +V ++     +   E  P  ++ +  
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT

Query:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS
        +++  +     +     TV+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWP  LV L ++K+  S R +  +      T  H S
Subjt:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS

Query:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF
                                              DIMQ+C+M+EI  +C+L   A LW  +E      +F +VD   I+P   +Q+   R L    
Subjt:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF

Query:  SSVQPGQMVFLTYNIG------------ALRVCMAE-STSKQLQKTSWIPVKCPRQQGCVECGYYVMKFMREILHN
          V   Q+V + Y  G            +L++  A+ S  +    T W  +KCP Q G VECGYYV K++REI+ N
Subjt:  SSVQPGQMVFLTYNIG------------ALRVCMAE-STSKQLQKTSWIPVKCPRQQGCVECGYYVMKFMREILHN

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]2.4e-3733.33Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT
        DEL      +DILTEALGT EH+GRVRGVG+FVSP  YFNVV+ KSK   L  + ST+  S  ++ K++ KEI +V ++     +   E  P  ++ +  
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT

Query:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS
        +++  +     +     TV+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWP  LV L ++K+  S R +  +      T  H S
Subjt:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS

Query:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF
                                              DIMQ+C+M+EI  +C+L   A LW  +E      +F +VD   I+P   +Q+   R L    
Subjt:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF

Query:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHNP
          V   Q+V + Y  G         LR   V + +S  +++Q+                         T W  +KCP Q G VECGYYV K++REI+ N 
Subjt:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHNP

Query:  EKPII
           II
Subjt:  EKPII

TrEMBL top hitse value%identityAlignment
A0A5A7VGF2 DUF4216 domain-containing protein3.3e-3230.73Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSKLAHDSSTSVQSTQMKTEDKEIPHVS------------DQNTKDVKEISPVSTQ
        DEL   + ++DILT+ALG+ EH GRVRGVG FVS   YFN V+ K K+ H        ++ K++ K   H              D++T   K +     Q
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSKLAHDSSTSVQSTQMKTEDKEIPHVS------------DQNTKDVKEISPVSTQ

Query:  KTEHVKRLHR-EASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVI---------------
         +  +  ++   A  T+    +G  N KV VD  +V  EN  IP PVKG+I+ L+Q++G  + WP  LV+   DK   S RK V+               
Subjt:  KTEHVKRLHR-EASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVI---------------

Query:  -KHSFPNSTTTH--------------------PSDIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVFSSVQ
         +H+  N                           D++ +C MVEI   C+L     LW   ++      F V+D + I+     +   +R L     +V 
Subjt:  -KHSFPNSTTTH--------------------PSDIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVFSSVQ

Query:  PGQMVFLTYNIGALRVCMAESTSKQLQKT-SWIPVKCPRQQGCVECGYYVMKFMREILHNPEKPIIALLAV
          Q V + YN   L+   A+   ++ + T  W PVKCPRQ   V CGYYV K++ EI+HN    I  L+ +
Subjt:  PGQMVFLTYNIGALRVCMAESTSKQLQKT-SWIPVKCPRQQGCVECGYYVMKFMREILHNPEKPIIALLAV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X14.5e-3733.33Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT
        DEL      +DILTEALGT EH+GRVRGVG+FVSP  YFNVV+ KSK   L  + ST+  S  ++ K++ KEI +V ++     +   E  P  ++ +  
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT

Query:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS
        +++  +     +     TV+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWP  LV L ++K+  S R +  +      T  H S
Subjt:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS

Query:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF
                                              DIMQ+C+M+EI  +C+L   A LW  +E      +F +VD   I+P   +Q+   R L    
Subjt:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF

Query:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHN
          V   Q+V + Y  G         LR   V + +S  +++Q+                         T W  +KCP Q G VECGYYV K++REI+ N
Subjt:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHN

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.2e-3733.33Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT
        DEL      +DILTEALGT EH+GRVRGVG+FVSP  YFNVV+ KSK   L  + ST+  S  ++ K++ KEI +V ++     +   E  P  ++ +  
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT

Query:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS
        +++  +     +     TV+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWP  LV L ++K+  S R +  +      T  H S
Subjt:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS

Query:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF
                                              DIMQ+C+M+EI  +C+L   A LW  +E      +F +VD   I+P   +Q+   R L    
Subjt:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF

Query:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHNP
          V   Q+V + Y  G         LR   V + +S  +++Q+                         T W  +KCP Q G VECGYYV K++REI+ N 
Subjt:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHNP

Query:  EKPII
           II
Subjt:  EKPII

A0A6J1C398 uncharacterized protein LOC111007859 isoform X31.3e-3934.84Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT
        DEL      +DILTEALGT EH+GRVRGVG+FVSP  YFNVV+ KSK   L  + ST+  S  ++ K++ KEI +V ++     +   E  P  ++ +  
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT

Query:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS
        +++  +     +     TV+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWP  LV L ++K+  S R +  +      T  H S
Subjt:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS

Query:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF
                                              DIMQ+C+M+EI  +C+L   A LW  +E      +F +VD   I+P   +Q+   R L    
Subjt:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF

Query:  SSVQPGQMVFLTYNIG------------ALRVCMAE-STSKQLQKTSWIPVKCPRQQGCVECGYYVMKFMREILHN
          V   Q+V + Y  G            +L++  A+ S  +    T W  +KCP Q G VECGYYV K++REI+ N
Subjt:  SSVQPGQMVFLTYNIG------------ALRVCMAE-STSKQLQKTSWIPVKCPRQQGCVECGYYVMKFMREILHN

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X24.5e-3733.33Show/hide
Query:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT
        DEL      +DILTEALGT EH+GRVRGVG+FVSP  YFNVV+ KSK   L  + ST+  S  ++ K++ KEI +V ++     +   E  P  ++ +  
Subjt:  DELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSK---LAHDSSTSVQS--TQMKTEDKEIPHVSDQ---NTKDVKEISP--VSTQKT

Query:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS
        +++  +     +     TV+GV LG  N +V VD++I   E   IPIPV+GEIE L+Q+IG  VAWP  LV L ++K+  S R +  +      T  H S
Subjt:  EHVKRL-----HREASETVNGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKD--SKRKTVIKHSFPNSTTTHPS

Query:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF
                                              DIMQ+C+M+EI  +C+L   A LW  +E      +F +VD   I+P   +Q+   R L    
Subjt:  --------------------------------------DIMQFCSMVEISNTCVLVCAAILWTHFEETGRLDRFKVVDSNDIAPVFGTQKRCARILTTVF

Query:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHN
          V   Q+V + Y  G         LR   V + +S  +++Q+                         T W  +KCP Q G VECGYYV K++REI+ N
Subjt:  SSVQPGQMVFLTYNIGA--------LR---VCMAESTSKQLQK-------------------------TSWIPVKCPRQQGCVECGYYVMKFMREILHN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACGATTCCACCCGATGAATTACAACAAAGAGACTCAGACAAAGATATTTTGACTGAAGCATTGGGGACACCTGAACATGCTGGTCGTGTCAGAGGAGTGGGAGA
TTTTGTGTCGCCATATACGTACTTCAATGTTGTGCGATCTAAATCAAAGTTGGCGCACGATTCATCAACGTCGGTTCAAAGTACTCAAATGAAGACTGAAGACAAAGAAA
TCCCACATGTGAGTGATCAAAATACTAAAGACGTCAAAGAGATCTCACCTGTCAGTACTCAAAAGACTGAACATGTTAAAAGACTTCACCGTGAGGCGAGTGAGACGGTC
AATGGAGTTTTCCTAGGAAAACATAATGCCAAAGTGTTTGTTGACATGATCATTGTCGAAAAAGAGAACCCTCGCATTCCAATTCCAGTGAAAGGTGAGATAGAGTTTCT
CTCCCAATCTATAGGTGCTGCAGTTGCTTGGCCTTGTGCTTTGGTTGCTCTATGTAAAGATAAGGACTCGAAACGTAAAACCGTGATAAAACACTCATTTCCTAATTCGA
CAACCACACATCCATCTGATATAATGCAATTTTGTAGTATGGTCGAGATATCAAATACTTGTGTATTGGTCTGTGCTGCGATCCTTTGGACGCATTTTGAGGAGACTGGT
AGACTAGACAGGTTTAAGGTCGTGGACTCAAACGACATTGCACCGGTGTTTGGGACCCAAAAAAGATGTGCAAGAATTTTAACTACCGTCTTTTCTTCAGTACAACCGGG
GCAAATGGTATTCCTTACATATAATATTGGAGCATTGAGGGTTTGTATGGCAGAAAGTACATCGAAGCAACTACAAAAGACTTCTTGGATACCTGTAAAGTGTCCTCGCC
AACAAGGTTGCGTTGAATGTGGGTACTACGTGATGAAGTTTATGAGAGAAATTCTACATAATCCAGAGAAGCCCATCATTGCTCTCCTAGCTGTTTCCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAACGATTCCACCCGATGAATTACAACAAAGAGACTCAGACAAAGATATTTTGACTGAAGCATTGGGGACACCTGAACATGCTGGTCGTGTCAGAGGAGTGGGAGA
TTTTGTGTCGCCATATACGTACTTCAATGTTGTGCGATCTAAATCAAAGTTGGCGCACGATTCATCAACGTCGGTTCAAAGTACTCAAATGAAGACTGAAGACAAAGAAA
TCCCACATGTGAGTGATCAAAATACTAAAGACGTCAAAGAGATCTCACCTGTCAGTACTCAAAAGACTGAACATGTTAAAAGACTTCACCGTGAGGCGAGTGAGACGGTC
AATGGAGTTTTCCTAGGAAAACATAATGCCAAAGTGTTTGTTGACATGATCATTGTCGAAAAAGAGAACCCTCGCATTCCAATTCCAGTGAAAGGTGAGATAGAGTTTCT
CTCCCAATCTATAGGTGCTGCAGTTGCTTGGCCTTGTGCTTTGGTTGCTCTATGTAAAGATAAGGACTCGAAACGTAAAACCGTGATAAAACACTCATTTCCTAATTCGA
CAACCACACATCCATCTGATATAATGCAATTTTGTAGTATGGTCGAGATATCAAATACTTGTGTATTGGTCTGTGCTGCGATCCTTTGGACGCATTTTGAGGAGACTGGT
AGACTAGACAGGTTTAAGGTCGTGGACTCAAACGACATTGCACCGGTGTTTGGGACCCAAAAAAGATGTGCAAGAATTTTAACTACCGTCTTTTCTTCAGTACAACCGGG
GCAAATGGTATTCCTTACATATAATATTGGAGCATTGAGGGTTTGTATGGCAGAAAGTACATCGAAGCAACTACAAAAGACTTCTTGGATACCTGTAAAGTGTCCTCGCC
AACAAGGTTGCGTTGAATGTGGGTACTACGTGATGAAGTTTATGAGAGAAATTCTACATAATCCAGAGAAGCCCATCATTGCTCTCCTAGCTGTTTCCTTTTAG
Protein sequenceShow/hide protein sequence
MTTIPPDELQQRDSDKDILTEALGTPEHAGRVRGVGDFVSPYTYFNVVRSKSKLAHDSSTSVQSTQMKTEDKEIPHVSDQNTKDVKEISPVSTQKTEHVKRLHREASETV
NGVFLGKHNAKVFVDMIIVEKENPRIPIPVKGEIEFLSQSIGAAVAWPCALVALCKDKDSKRKTVIKHSFPNSTTTHPSDIMQFCSMVEISNTCVLVCAAILWTHFEETG
RLDRFKVVDSNDIAPVFGTQKRCARILTTVFSSVQPGQMVFLTYNIGALRVCMAESTSKQLQKTSWIPVKCPRQQGCVECGYYVMKFMREILHNPEKPIIALLAVSF