; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg023877 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg023877
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold13:11239536..11243851
RNA-Seq ExpressionSpg023877
SyntenySpg023877
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042614.1 Ulp1-like peptidase [Cucumis melo var. makuwa]2.0e-4134.43Show/hide
Query:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK
        KYFG+    DM+  TFE+ Y +N+ F +D DAVKV+LVYYT LAM+GKD+ K+++++SL  +VEDL ++NS+DWG  +WERT++GL+N LK++VD +KKK
Subjt:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK

Query:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV
             ++ VKYSL  FPHAFQVWA E +S+M  G+   R+  DA+P  LRW C      K+L++ +F   +    +   ++ + E+     Q      +V
Subjt:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV

Query:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNS--------------PRTTQPGTSADGIRIE
         D  E  P          G  D  G  +   L   D+++    HL       E   +  PH +   D NS              P  +      +G   E
Subjt:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNS--------------PRTTQPGTSADGIRIE

Query:  GGDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP
            ++ + L+ +D     R+  VE  L E++S+++ +TSLLR  C+ +NV +        DSR P
Subjt:  GGDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP

KGN49944.1 hypothetical protein Csa_000148 [Cucumis sativus]5.3e-4233Show/hide
Query:  PLSPDPQWLLFDPVVAKYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIK
        P  P P+      +  KYFG+    DM+  TFE+ Y +N+ F +D DAVKV+LVYYT LAM+GKD+ K+++++SL  +VEDL ++NS+DWG  +WE+T++
Subjt:  PLSPDPQWLLFDPVVAKYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIK

Query:  GLKNPLKDRVDTFKKKAMEKSDHKVKYSLSDFPHAFQVWANETMSTMEGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPK--------TSYRAK
        GL+N LK++VD +KKK     ++ VKYSL  FPHAFQVWA E +S++ GK   R+  +A+P ILRW C      K+L++ VF   +         S   K
Subjt:  GLKNPLKDRVDTFKKKAMEKSDHKVKYSLSDFPHAFQVWANETMSTMEGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPK--------TSYRAK

Query:  RGIVGEIEE------GGEHNQRHVGDLVEFNPHGSSDVHG-TETGVLDVCDVASARIDHLMESDEGMVDRQPHVDVGNDRN-------------SPRTTQ
        +    E++E              V D+ E +  G  D  G  +   L   D+++   +HL +      +R  + D  ND +               R  +
Subjt:  RGIVGEIEE------GGEHNQRHVGDLVEFNPHGSSDVHG-TETGVLDVCDVASARIDHLMESDEGMVDRQPHVDVGNDRN-------------SPRTTQ

Query:  PGTSADGIRIEG-----GDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSR----------VPNIQPDADT
        P  S    + EG      + ++ + L+ +D     R+  VE  L E++S+++ +TSLL   C+ +NV +        DSR           P+I+    T
Subjt:  PGTSADGIRIEG-----GDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSR----------VPNIQPDADT

Query:  GVD
        G+D
Subjt:  GVD

XP_008437500.1 PREDICTED: uncharacterized protein LOC103482899 isoform X1 [Cucumis melo]1.5e-4134.97Show/hide
Query:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK
        KYFG+    DM+  TFE+ Y +N+ F +D DAVKV+LVYYT LAM+GKD+ K+++++SL  +VEDL ++NS+DWG  +WERT++GL+N LK++VD +KKK
Subjt:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK

Query:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV
             ++ VKYSL  FPHAFQVWA E +S+M  G+   R+  DA+P  LRW C      K+L++ +F   +    +   ++ + E+     Q      +V
Subjt:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV

Query:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNSPR---------TTQPGTSADGIRIEGGDGD
         D  E  P          G  D  G  +   L   D+++    HL       E   +  PH +   D NS R           +P  S    + EG   +
Subjt:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNSPR---------TTQPGTSADGIRIEGGDGD

Query:  -----MIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP
             + + L+ +D     R+  VE  L E++S+++ +TSLLR  C+ +NV +        DSR P
Subjt:  -----MIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP

XP_008437501.1 PREDICTED: uncharacterized protein LOC103482899 isoform X2 [Cucumis melo]1.5e-4134.97Show/hide
Query:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK
        KYFG+    DM+  TFE+ Y +N+ F +D DAVKV+LVYYT LAM+GKD+ K+++++SL  +VEDL ++NS+DWG  +WERT++GL+N LK++VD +KKK
Subjt:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK

Query:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV
             ++ VKYSL  FPHAFQVWA E +S+M  G+   R+  DA+P  LRW C      K+L++ +F   +    +   ++ + E+     Q      +V
Subjt:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV

Query:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNSPR---------TTQPGTSADGIRIEGGDGD
         D  E  P          G  D  G  +   L   D+++    HL       E   +  PH +   D NS R           +P  S    + EG   +
Subjt:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNSPR---------TTQPGTSADGIRIEGGDGD

Query:  -----MIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP
             + + L+ +D     R+  VE  L E++S+++ +TSLLR  C+ +NV +        DSR P
Subjt:  -----MIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP

XP_011654656.1 uncharacterized protein LOC105435430 isoform X1 [Cucumis sativus]5.3e-4233Show/hide
Query:  PLSPDPQWLLFDPVVAKYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIK
        P  P P+      +  KYFG+    DM+  TFE+ Y +N+ F +D DAVKV+LVYYT LAM+GKD+ K+++++SL  +VEDL ++NS+DWG  +WE+T++
Subjt:  PLSPDPQWLLFDPVVAKYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIK

Query:  GLKNPLKDRVDTFKKKAMEKSDHKVKYSLSDFPHAFQVWANETMSTMEGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPK--------TSYRAK
        GL+N LK++VD +KKK     ++ VKYSL  FPHAFQVWA E +S++ GK   R+  +A+P ILRW C      K+L++ VF   +         S   K
Subjt:  GLKNPLKDRVDTFKKKAMEKSDHKVKYSLSDFPHAFQVWANETMSTMEGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPK--------TSYRAK

Query:  RGIVGEIEE------GGEHNQRHVGDLVEFNPHGSSDVHG-TETGVLDVCDVASARIDHLMESDEGMVDRQPHVDVGNDRN-------------SPRTTQ
        +    E++E              V D+ E +  G  D  G  +   L   D+++   +HL +      +R  + D  ND +               R  +
Subjt:  RGIVGEIEE------GGEHNQRHVGDLVEFNPHGSSDVHG-TETGVLDVCDVASARIDHLMESDEGMVDRQPHVDVGNDRN-------------SPRTTQ

Query:  PGTSADGIRIEG-----GDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSR----------VPNIQPDADT
        P  S    + EG      + ++ + L+ +D     R+  VE  L E++S+++ +TSLL   C+ +NV +        DSR           P+I+    T
Subjt:  PGTSADGIRIEG-----GDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSR----------VPNIQPDADT

Query:  GVD
        G+D
Subjt:  GVD

TrEMBL top hitse value%identityAlignment
A0A0A0KM59 DUF1985 domain-containing protein2.6e-4233Show/hide
Query:  PLSPDPQWLLFDPVVAKYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIK
        P  P P+      +  KYFG+    DM+  TFE+ Y +N+ F +D DAVKV+LVYYT LAM+GKD+ K+++++SL  +VEDL ++NS+DWG  +WE+T++
Subjt:  PLSPDPQWLLFDPVVAKYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIK

Query:  GLKNPLKDRVDTFKKKAMEKSDHKVKYSLSDFPHAFQVWANETMSTMEGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPK--------TSYRAK
        GL+N LK++VD +KKK     ++ VKYSL  FPHAFQVWA E +S++ GK   R+  +A+P ILRW C      K+L++ VF   +         S   K
Subjt:  GLKNPLKDRVDTFKKKAMEKSDHKVKYSLSDFPHAFQVWANETMSTMEGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPK--------TSYRAK

Query:  RGIVGEIEE------GGEHNQRHVGDLVEFNPHGSSDVHG-TETGVLDVCDVASARIDHLMESDEGMVDRQPHVDVGNDRN-------------SPRTTQ
        +    E++E              V D+ E +  G  D  G  +   L   D+++   +HL +      +R  + D  ND +               R  +
Subjt:  RGIVGEIEE------GGEHNQRHVGDLVEFNPHGSSDVHG-TETGVLDVCDVASARIDHLMESDEGMVDRQPHVDVGNDRN-------------SPRTTQ

Query:  PGTSADGIRIEG-----GDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSR----------VPNIQPDADT
        P  S    + EG      + ++ + L+ +D     R+  VE  L E++S+++ +TSLL   C+ +NV +        DSR           P+I+    T
Subjt:  PGTSADGIRIEG-----GDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSR----------VPNIQPDADT

Query:  GVD
        G+D
Subjt:  GVD

A0A1S3ATU8 uncharacterized protein LOC103482899 isoform X17.5e-4234.97Show/hide
Query:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK
        KYFG+    DM+  TFE+ Y +N+ F +D DAVKV+LVYYT LAM+GKD+ K+++++SL  +VEDL ++NS+DWG  +WERT++GL+N LK++VD +KKK
Subjt:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK

Query:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV
             ++ VKYSL  FPHAFQVWA E +S+M  G+   R+  DA+P  LRW C      K+L++ +F   +    +   ++ + E+     Q      +V
Subjt:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV

Query:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNSPR---------TTQPGTSADGIRIEGGDGD
         D  E  P          G  D  G  +   L   D+++    HL       E   +  PH +   D NS R           +P  S    + EG   +
Subjt:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNSPR---------TTQPGTSADGIRIEGGDGD

Query:  -----MIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP
             + + L+ +D     R+  VE  L E++S+++ +TSLLR  C+ +NV +        DSR P
Subjt:  -----MIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP

A0A1S3AUB0 uncharacterized protein LOC103482899 isoform X27.5e-4234.97Show/hide
Query:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK
        KYFG+    DM+  TFE+ Y +N+ F +D DAVKV+LVYYT LAM+GKD+ K+++++SL  +VEDL ++NS+DWG  +WERT++GL+N LK++VD +KKK
Subjt:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK

Query:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV
             ++ VKYSL  FPHAFQVWA E +S+M  G+   R+  DA+P  LRW C      K+L++ +F   +    +   ++ + E+     Q      +V
Subjt:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV

Query:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNSPR---------TTQPGTSADGIRIEGGDGD
         D  E  P          G  D  G  +   L   D+++    HL       E   +  PH +   D NS R           +P  S    + EG   +
Subjt:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNSPR---------TTQPGTSADGIRIEGGDGD

Query:  -----MIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP
             + + L+ +D     R+  VE  L E++S+++ +TSLLR  C+ +NV +        DSR P
Subjt:  -----MIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP

A0A5A7TGU0 Ulp1-like peptidase9.8e-4234.43Show/hide
Query:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK
        KYFG+    DM+  TFE+ Y +N+ F +D DAVKV+LVYYT LAM+GKD+ K+++++SL  +VEDL ++NS+DWG  +WERT++GL+N LK++VD +KKK
Subjt:  KYFGS-CPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKK

Query:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV
             ++ VKYSL  FPHAFQVWA E +S+M  G+   R+  DA+P  LRW C      K+L++ +F   +    +   ++ + E+     Q      +V
Subjt:  AMEKSDHKVKYSLSDFPHAFQVWANETMSTM-EGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQ-----RHV

Query:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNS--------------PRTTQPGTSADGIRIE
         D  E  P          G  D  G  +   L   D+++    HL       E   +  PH +   D NS              P  +      +G   E
Subjt:  GDLVEFNP---------HGSSDVHGT-ETGVLDVCDVASARIDHLM---ESDEGMVDRQPHVDVGNDRNS--------------PRTTQPGTSADGIRIE

Query:  GGDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP
            ++ + L+ +D     R+  VE  L E++S+++ +TSLLR  C+ +NV +        DSR P
Subjt:  GGDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVP

A0A6J1DSS5 uncharacterized protein LOC1110239696.8e-3547.77Show/hide
Query:  DMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKKAMEKSDHKV
        D+ L  FE+EY + +VF ND+DAVKVSL+YYT + M+GK+K K+ VD+ L+ +VEDL +FN++DWGT IW+RT+KGL++ +KD+V  +K K       +V
Subjt:  DMSLSTFEKEYMENVVFDNDEDAVKVSLVYYTNLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKKAMEKSDHKV

Query:  KYSLSDFPHAFQVWANETMSTMEGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVF
        +YSL+ FP AFQVWA E + ++     +R+   A+P I R+ C +  + KVLE+ VF
Subjt:  KYSLSDFPHAFQVWANETMSTMEGKFADRMRRDAIPHILRWKCRRVASYKVLEKGVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAGGAAGTCAGTTTCCCGGTGAGGTACGGAGTTCAGAAGACGGATGAGCATGGAGACATAGTCGCTCGATACTTAAATCAGTCGTTGGGAACGGCACGCGCGAT
TATTTCGTCTAAAGATTGCGTAGGCCGTAGCACCTGGCAGCCTGATGTGTTCCCCCTGTCTCCCGATCCGCAGTGGTTGCTCTTCGACCCAGTGGTGGCCAAATACTTTG
GATCGTGCCCGACTGACATGAGCCTTTCCACCTTCGAGAAGGAGTATATGGAAAATGTGGTATTCGACAATGACGAGGATGCAGTTAAGGTGTCATTGGTGTACTACACG
AACCTGGCAATGTTGGGAAAAGACAAACAAAAGACATTGGTCGACGAATCGTTGTTTTGGGAAGTGGAGGATTTGGCACACTTCAACAGCATTGACTGGGGTACCAGAAT
ATGGGAGAGGACGATCAAGGGCTTGAAAAATCCCCTGAAGGATAGAGTGGACACTTTTAAGAAGAAGGCGATGGAGAAGAGTGACCACAAAGTGAAGTACAGTCTGAGTG
ACTTTCCCCATGCTTTCCAGGTATGGGCGAACGAGACCATGTCCACTATGGAGGGGAAGTTCGCGGATAGAATGAGGCGCGATGCGATCCCCCACATCCTTAGATGGAAA
TGCAGACGGGTTGCAAGCTACAAAGTGTTGGAGAAAGGTGTATTCGAGTATCCGAAGACATCGTATAGGGCTAAGAGGGGAATTGTGGGAGAGATTGAAGAAGGTGGTGA
GCATAATCAACGCCATGTTGGTGATCTTGTGGAGTTCAACCCCCATGGATCATCTGATGTGCATGGGACCGAAACGGGCGTACTAGACGTGTGTGACGTCGCATCCGCGC
GAATAGACCATTTGATGGAGAGTGATGAGGGGATGGTTGATCGTCAACCGCATGTGGATGTCGGAAACGATCGTAACTCACCACGTACCACTCAACCGGGCACATCTGCT
GATGGGATACGGATCGAAGGTGGGGACGGTGACATGATCGCTACATTGGAACGAATGGACGACCGCCTCAACAATAGGTTAGATGAGGTGGAGAAGCAACTGTTTGAGCT
GCGGTCGGAGATGAAAATCATGACGTCTCTACTACGACAATTATGCCAGGAGAGGAATGTAACAGACATTGGCGTGGTGACCATGCATGCTGACTCGCGCGTACCGAACA
TACAGCCGGACGCAGACACTGGTGTGGATACCGTGGATGTCCACCCACGCGTACAGGACATTCAGCCCAAAATGGAGACTAGCGGGTTGACCATGGATGTCGCCTCGCGC
AGACCGGACATCCAGCTTGAAACGGAGACTGGCGTGGTGACCACATATGTCGACTCGCGCAGACAGGACATCCAGTTCGAATCAATTACTGGCGTGGAGACTAGCGAGGA
CCGGTCGGAAGAGGGTTGGAACGAAGCGTATACTCCCCTAGGACGGGGAAAGAAGAGGAAAAATGAAGACTCCAGCATTCGGTATGATTGGGAACCTATTTTGAAAATCA
AAGACCGCATGTATCCCATGCCAGGTGTGCCCTTTAATGGGGATCCGTCACCAGCTGTCAGATATTCTTCGCTTCGCAGCATCCCGGCTACACTGTTCGAGGAGTTTCAC
CAATGGCTCTCAAATCCAAACAACGAAGATGACACGCGGCCGTCGTTCGTGTCACACGACGTTGCTACTGAGAAGCGAGCCAGGCTCGATAAGAAGTTCTTCTACATTGT
TGCCCTCCCGAGTAAGCCATTGGAAGACACTGAATCAAGTTGGCAATGCGAGCAGACATACTTGGACTACGAGCTAGGGCTCGACAATGATTTTGCTCCTGCATGGGGGG
ACGTCGACTTTTTGTACACGTTGGCGACGATGCGCGAGCATTACCTTCTGTTGGCGATGTACATGAATCGAGGTGCAATTTACGTCTACGATTCCATGTCGGGGTACATC
AAGCGGCCCCAAGTAGACAAGTTCTTGGAGCCACTTTGTCATATGTTGTCGTCTTTACTGCATGCATGCAACATGTATACGATAGTTTTATCATATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAGGAAGTCAGTTTCCCGGTGAGGTACGGAGTTCAGAAGACGGATGAGCATGGAGACATAGTCGCTCGATACTTAAATCAGTCGTTGGGAACGGCACGCGCGAT
TATTTCGTCTAAAGATTGCGTAGGCCGTAGCACCTGGCAGCCTGATGTGTTCCCCCTGTCTCCCGATCCGCAGTGGTTGCTCTTCGACCCAGTGGTGGCCAAATACTTTG
GATCGTGCCCGACTGACATGAGCCTTTCCACCTTCGAGAAGGAGTATATGGAAAATGTGGTATTCGACAATGACGAGGATGCAGTTAAGGTGTCATTGGTGTACTACACG
AACCTGGCAATGTTGGGAAAAGACAAACAAAAGACATTGGTCGACGAATCGTTGTTTTGGGAAGTGGAGGATTTGGCACACTTCAACAGCATTGACTGGGGTACCAGAAT
ATGGGAGAGGACGATCAAGGGCTTGAAAAATCCCCTGAAGGATAGAGTGGACACTTTTAAGAAGAAGGCGATGGAGAAGAGTGACCACAAAGTGAAGTACAGTCTGAGTG
ACTTTCCCCATGCTTTCCAGGTATGGGCGAACGAGACCATGTCCACTATGGAGGGGAAGTTCGCGGATAGAATGAGGCGCGATGCGATCCCCCACATCCTTAGATGGAAA
TGCAGACGGGTTGCAAGCTACAAAGTGTTGGAGAAAGGTGTATTCGAGTATCCGAAGACATCGTATAGGGCTAAGAGGGGAATTGTGGGAGAGATTGAAGAAGGTGGTGA
GCATAATCAACGCCATGTTGGTGATCTTGTGGAGTTCAACCCCCATGGATCATCTGATGTGCATGGGACCGAAACGGGCGTACTAGACGTGTGTGACGTCGCATCCGCGC
GAATAGACCATTTGATGGAGAGTGATGAGGGGATGGTTGATCGTCAACCGCATGTGGATGTCGGAAACGATCGTAACTCACCACGTACCACTCAACCGGGCACATCTGCT
GATGGGATACGGATCGAAGGTGGGGACGGTGACATGATCGCTACATTGGAACGAATGGACGACCGCCTCAACAATAGGTTAGATGAGGTGGAGAAGCAACTGTTTGAGCT
GCGGTCGGAGATGAAAATCATGACGTCTCTACTACGACAATTATGCCAGGAGAGGAATGTAACAGACATTGGCGTGGTGACCATGCATGCTGACTCGCGCGTACCGAACA
TACAGCCGGACGCAGACACTGGTGTGGATACCGTGGATGTCCACCCACGCGTACAGGACATTCAGCCCAAAATGGAGACTAGCGGGTTGACCATGGATGTCGCCTCGCGC
AGACCGGACATCCAGCTTGAAACGGAGACTGGCGTGGTGACCACATATGTCGACTCGCGCAGACAGGACATCCAGTTCGAATCAATTACTGGCGTGGAGACTAGCGAGGA
CCGGTCGGAAGAGGGTTGGAACGAAGCGTATACTCCCCTAGGACGGGGAAAGAAGAGGAAAAATGAAGACTCCAGCATTCGGTATGATTGGGAACCTATTTTGAAAATCA
AAGACCGCATGTATCCCATGCCAGGTGTGCCCTTTAATGGGGATCCGTCACCAGCTGTCAGATATTCTTCGCTTCGCAGCATCCCGGCTACACTGTTCGAGGAGTTTCAC
CAATGGCTCTCAAATCCAAACAACGAAGATGACACGCGGCCGTCGTTCGTGTCACACGACGTTGCTACTGAGAAGCGAGCCAGGCTCGATAAGAAGTTCTTCTACATTGT
TGCCCTCCCGAGTAAGCCATTGGAAGACACTGAATCAAGTTGGCAATGCGAGCAGACATACTTGGACTACGAGCTAGGGCTCGACAATGATTTTGCTCCTGCATGGGGGG
ACGTCGACTTTTTGTACACGTTGGCGACGATGCGCGAGCATTACCTTCTGTTGGCGATGTACATGAATCGAGGTGCAATTTACGTCTACGATTCCATGTCGGGGTACATC
AAGCGGCCCCAAGTAGACAAGTTCTTGGAGCCACTTTGTCATATGTTGTCGTCTTTACTGCATGCATGCAACATGTATACGATAGTTTTATCATATTAA
Protein sequenceShow/hide protein sequence
MNKEVSFPVRYGVQKTDEHGDIVARYLNQSLGTARAIISSKDCVGRSTWQPDVFPLSPDPQWLLFDPVVAKYFGSCPTDMSLSTFEKEYMENVVFDNDEDAVKVSLVYYT
NLAMLGKDKQKTLVDESLFWEVEDLAHFNSIDWGTRIWERTIKGLKNPLKDRVDTFKKKAMEKSDHKVKYSLSDFPHAFQVWANETMSTMEGKFADRMRRDAIPHILRWK
CRRVASYKVLEKGVFEYPKTSYRAKRGIVGEIEEGGEHNQRHVGDLVEFNPHGSSDVHGTETGVLDVCDVASARIDHLMESDEGMVDRQPHVDVGNDRNSPRTTQPGTSA
DGIRIEGGDGDMIATLERMDDRLNNRLDEVEKQLFELRSEMKIMTSLLRQLCQERNVTDIGVVTMHADSRVPNIQPDADTGVDTVDVHPRVQDIQPKMETSGLTMDVASR
RPDIQLETETGVVTTYVDSRRQDIQFESITGVETSEDRSEEGWNEAYTPLGRGKKRKNEDSSIRYDWEPILKIKDRMYPMPGVPFNGDPSPAVRYSSLRSIPATLFEEFH
QWLSNPNNEDDTRPSFVSHDVATEKRARLDKKFFYIVALPSKPLEDTESSWQCEQTYLDYELGLDNDFAPAWGDVDFLYTLATMREHYLLLAMYMNRGAIYVYDSMSGYI
KRPQVDKFLEPLCHMLSSLLHACNMYTIVLSY