; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008526 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008526
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold10:35589836..35590540
RNA-Seq ExpressionSpg008526
SyntenySpg008526
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5320331.1 unnamed protein product [Arabidopsis thaliana]5.4e-1528.76Show/hide
Query:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSI
        +LW LW  RN+ +   +  D   ++R +    D  E +    L  +  G   E       V+WKPPP    K N+DA+W     R GIGWI R+ SG  +
Subjt:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSI

Query:  CMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLA
         MG + +    +V   E++A+   +  +   ++         I  ESDA  ++ +LN  D+    +    E+I +L + F E+KF++ PR  N   D +A
Subjt:  CMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLA

Query:  RIAISPPPWFP-FWNPLPWWKRVLVV
        R +IS   + P  ++ +P W R  ++
Subjt:  RIAISPPPWFP-FWNPLPWWKRVLVV

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]3.1e-1837.29Show/hide
Query:  MILWNLWNYRNKTIQSSSIPDK----ISMVRMIERSLD--------LREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGG
        +I W +W  RN++I      D+     S+V  I  ++D         R   ND +L    RGR + L  L   VRW  PP + WKLN+DASWSE ++ GG
Subjt:  MILWNLWNYRNKTIQSSSIPDK----ISMVRMIERSLD--------LREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGG

Query:  IGWIFRDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLS
        IGWI  D  G  +  G  +I+EK  +  LE+  I+ GL+ I       + Q+  PI++ESD+V VI+++  ED DL+
Subjt:  IGWIFRDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.0e-1731.6Show/hide
Query:  MILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGR--NSELKTL-----TSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIF
        +I W +W  RNK+I     P+     R I+ ++D R   N A  N+  +G+  N +L  +      +  +WKPP  ++WKLN++A+W      GGIGWI 
Subjt:  MILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGR--NSELKTL-----TSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIF

Query:  RDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQ
        RD  G  I    + I+ + ++  LE+ AI EGL+ I         +   PI +ESD++  I +L+ + +D +EI +L EEI ++      +   +  R+ 
Subjt:  RDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQ

Query:  NLAVDLLARIAI
        N     LAR A+
Subjt:  NLAVDLLARIAI

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.7e-2131.84Show/hide
Query:  WNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSICM
        W +WN+RN  I         +M++ + + +      ++  L+          KTL ++++W+PPP+  W LN+DASWS+S  RGGIGWI R   G  +  
Subjt:  WNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSICM

Query:  GFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLARI
        G + ++   +VK+LE  AILEGL+N+ +           P+ +E+D+  V  +LN + EDL++  ++ EEI+ L++    + F    R+ N     LA+ 
Subjt:  GFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLARI

Query:  A
        A
Subjt:  A

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]9.0e-1830.62Show/hide
Query:  MILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTL---TSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSS
        +I W +W  RNK+I      +   +  +I+R + +     D +L  +   ++  L       +  RWKPP  ++WKLN+DA+W      GGIGWI RD  
Subjt:  MILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTL---TSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSS

Query:  GSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASI-PIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLA
        G  I    + I+ + ++  LE+ AI EGL+ I         Q    PI +ESD++  I +L+ + +D +EI +L EEI ++      +   +  R+ N  
Subjt:  GSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASI-PIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLA

Query:  VDLLARIAI
           LAR A+
Subjt:  VDLLARIAI

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134129.7e-1831.6Show/hide
Query:  MILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGR--NSELKTL-----TSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIF
        +I W +W  RNK+I     P+     R I+ ++D R   N A  N+  +G+  N +L  +      +  +WKPP  ++WKLN++A+W      GGIGWI 
Subjt:  MILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGR--NSELKTL-----TSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIF

Query:  RDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQ
        RD  G  I    + I+ + ++  LE+ AI EGL+ I         +   PI +ESD++  I +L+ + +D +EI +L EEI ++      +   +  R+ 
Subjt:  RDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQ

Query:  NLAVDLLARIAI
        N     LAR A+
Subjt:  NLAVDLLARIAI

A0A6J1CQG0 uncharacterized protein LOC1110132161.5e-1837.29Show/hide
Query:  MILWNLWNYRNKTIQSSSIPDK----ISMVRMIERSLD--------LREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGG
        +I W +W  RN++I      D+     S+V  I  ++D         R   ND +L    RGR + L  L   VRW  PP + WKLN+DASWSE ++ GG
Subjt:  MILWNLWNYRNKTIQSSSIPDK----ISMVRMIERSLD--------LREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGG

Query:  IGWIFRDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLS
        IGWI  D  G  +  G  +I+EK  +  LE+  I+ GL+ I       + Q+  PI++ESD+V VI+++  ED DL+
Subjt:  IGWIFRDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLS

A0A6J1DNV9 uncharacterized protein LOC1110224038.5e-2231.84Show/hide
Query:  WNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSICM
        W +WN+RN  I         +M++ + + +      ++  L+          KTL ++++W+PPP+  W LN+DASWS+S  RGGIGWI R   G  +  
Subjt:  WNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSICM

Query:  GFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLARI
        G + ++   +VK+LE  AILEGL+N+ +           P+ +E+D+  V  +LN + EDL++  ++ EEI+ L++    + F    R+ N     LA+ 
Subjt:  GFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLARI

Query:  A
        A
Subjt:  A

A0A6J1DSV1 uncharacterized protein LOC1110236084.3e-1830.62Show/hide
Query:  MILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTL---TSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSS
        +I W +W  RNK+I      +   +  +I+R + +     D +L  +   ++  L       +  RWKPP  ++WKLN+DA+W      GGIGWI RD  
Subjt:  MILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTL---TSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSS

Query:  GSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASI-PIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLA
        G  I    + I+ + ++  LE+ AI EGL+ I         Q    PI +ESD++  I +L+ + +D +EI +L EEI ++      +   +  R+ N  
Subjt:  GSSICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASI-PIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLA

Query:  VDLLARIAI
           LAR A+
Subjt:  VDLLARIAI

A0A7G2EDP1 (thale cress) hypothetical protein2.6e-1528.76Show/hide
Query:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSI
        +LW LW  RN+ +   +  D   ++R +    D  E +    L  +  G   E       V+WKPPP    K N+DA+W     R GIGWI R+ SG  +
Subjt:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSI

Query:  CMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLA
         MG + +    +V   E++A+   +  +   ++         I  ESDA  ++ +LN  D+    +    E+I +L + F E+KF++ PR  N   D +A
Subjt:  CMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLA

Query:  RIAISPPPWFP-FWNPLPWWKRVLVV
        R +IS   + P  ++ +P W R  ++
Subjt:  RIAISPPPWFP-FWNPLPWWKRVLVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.4e-0624.52Show/hide
Query:  QSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSICMGFQQIKEKWSV
        Q   IP +I++      + +  E N  A +N++ RG   E     +  RW+ P     K N D S+     +   GW+ RDS+GS +  G    ++  + 
Subjt:  QSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSICMGFQQIKEKWSV

Query:  KILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIR----LKNRFVEIKFDYCPRDQNLAVDLLARIAISPPPW
           E++A++  +++    SHG        +  E D   +  ++NG     S++ F     IR       +F  I F++  R  N   D+LA+  +     
Subjt:  KILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIR----LKNRFVEIKFDYCPRDQNLAVDLLARIAISPPPW

Query:  F--PFWNP
        F   FW P
Subjt:  F--PFWNP

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-1728.32Show/hide
Query:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSI
        +LW LW  RN+ +      D   ++R      D  E +    L  +  G   E       V+WK PP    K N+DA+W     R GIGWI R+ SG  +
Subjt:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSI

Query:  CMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLA
         MG + +    +V   E++A+   +  +   ++         I  ESDA  ++ +LN  D+    +    E+I +L + F E+KF++ PR  N   D +A
Subjt:  CMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLA

Query:  RIAISPPPWFP-FWNPLPWWKRVLVV
        R +IS   + P  ++ +P W R  ++
Subjt:  RIAISPPPWFP-FWNPLPWWKRVLVV

AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-0721.26Show/hide
Query:  LDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPST
        LD++  N+D     Q +   ++        +W+ P  +  K N D S    ++  G+ WI R+S G+ +  G  + + + ++K  E  A++  ++     
Subjt:  LDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSICMGFQQIKEKWSVKILEMKAILEGLKNIPST

Query:  SHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLARIAIS
         +         +  E D + V +++  ++ +   + +  E I +    F  +KF +  R+QN+ VD+LA+ A++
Subjt:  SHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLARIAIS

AT4G29090.1 Ribonuclease H-like superfamily protein6.8e-1627.03Show/hide
Query:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSI
        +LW LW  RN+ +      +   ++R  E  L+      +A    +  G   ++   +S  RW+PPP    K N+DA+W+   +R GIGW+ R+  G   
Subjt:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSI

Query:  CMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLA
         MG + + +  SV   E++A+   + ++    +         +  ESD+  +I+ILN  DE    +    +++ RL ++F E+KF + PR+ N   + +A
Subjt:  CMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLA

Query:  RIAISPPPWFP-FWNPLPWWKR
        R ++S   + P  ++ +P W R
Subjt:  RIAISPPPWFP-FWNPLPWWKR

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.6e-1023.53Show/hide
Query:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSL-DLREDNNDAHLNSQQRG-RNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGS
        ++W +W   N  + + +   +      +E +L D +E  ++   N QQ G RN++    T   +W PP  D  K N DAS  E     G+GWI R+S G+
Subjt:  ILWNLWNYRNKTIQSSSIPDKISMVRMIERSL-DLREDNNDAHLNSQQRG-RNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGS

Query:  SICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDL
         I  G  + + + + +  E   ++  ++     S+G   +    +  E D   + +++N +  +   +    + I      F  I+F +  R+QN   D 
Subjt:  SICMGFQQIKEKWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDL

Query:  LARIAISPPPWFPFWNPLPWW
        LA+ AI     +  ++  P++
Subjt:  LARIAISPPPWFPFWNPLPWW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTGTGGAATCTATGGAATTACAGAAACAAAACAATACAATCATCCAGCATCCCAGACAAGATTTCAATGGTCAGAATGATTGAAAGAAGCCTCGATCTCAGAGA
GGATAACAATGACGCGCACCTGAATTCTCAGCAGCGAGGAAGGAATTCCGAATTGAAGACCCTCACGAGTCAAGTGAGATGGAAACCGCCGCCGGTAGACGCATGGAAGC
TAAATTCAGATGCTTCCTGGAGTGAATCACAGAAAAGAGGAGGAATTGGTTGGATTTTTCGTGACTCTTCAGGGTCTTCAATCTGTATGGGATTTCAGCAAATTAAGGAA
AAATGGTCGGTCAAAATTCTAGAAATGAAGGCGATTTTAGAAGGTCTGAAGAACATACCTTCAACCAGCCATGGTGACAGCTCCCAAGCCTCGATCCCCATCTTCGTCGA
GTCTGATGCGGTTAATGTTATCAAAATTCTAAACGGGGAAGATGAAGATTTGTCGGAGATTTCTTTCCTTACCGAAGAGATCATCCGCCTGAAGAATCGTTTCGTAGAAA
TCAAATTCGATTACTGCCCGAGAGATCAAAACTTAGCAGTGGATCTTCTGGCTCGCATTGCTATTTCTCCCCCTCCCTGGTTCCCGTTTTGGAATCCTCTCCCATGGTGG
AAGAGGGTGTTGGTTGTGTGGTTTGGGCCCCCCTTTTGTGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTTTGTGGAATCTATGGAATTACAGAAACAAAACAATACAATCATCCAGCATCCCAGACAAGATTTCAATGGTCAGAATGATTGAAAGAAGCCTCGATCTCAGAGA
GGATAACAATGACGCGCACCTGAATTCTCAGCAGCGAGGAAGGAATTCCGAATTGAAGACCCTCACGAGTCAAGTGAGATGGAAACCGCCGCCGGTAGACGCATGGAAGC
TAAATTCAGATGCTTCCTGGAGTGAATCACAGAAAAGAGGAGGAATTGGTTGGATTTTTCGTGACTCTTCAGGGTCTTCAATCTGTATGGGATTTCAGCAAATTAAGGAA
AAATGGTCGGTCAAAATTCTAGAAATGAAGGCGATTTTAGAAGGTCTGAAGAACATACCTTCAACCAGCCATGGTGACAGCTCCCAAGCCTCGATCCCCATCTTCGTCGA
GTCTGATGCGGTTAATGTTATCAAAATTCTAAACGGGGAAGATGAAGATTTGTCGGAGATTTCTTTCCTTACCGAAGAGATCATCCGCCTGAAGAATCGTTTCGTAGAAA
TCAAATTCGATTACTGCCCGAGAGATCAAAACTTAGCAGTGGATCTTCTGGCTCGCATTGCTATTTCTCCCCCTCCCTGGTTCCCGTTTTGGAATCCTCTCCCATGGTGG
AAGAGGGTGTTGGTTGTGTGGTTTGGGCCCCCCTTTTGTGATTAA
Protein sequenceShow/hide protein sequence
MILWNLWNYRNKTIQSSSIPDKISMVRMIERSLDLREDNNDAHLNSQQRGRNSELKTLTSQVRWKPPPVDAWKLNSDASWSESQKRGGIGWIFRDSSGSSICMGFQQIKE
KWSVKILEMKAILEGLKNIPSTSHGDSSQASIPIFVESDAVNVIKILNGEDEDLSEISFLTEEIIRLKNRFVEIKFDYCPRDQNLAVDLLARIAISPPPWFPFWNPLPWW
KRVLVVWFGPPFCD