; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008237 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008237
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold2:18479847..18481414
RNA-Seq ExpressionSpg008237
SyntenySpg008237
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]2.3e-2129.44Show/hide
Query:  HILLQCQRAKELW-NITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNS
        H +  C+ AKE+W N  +  V ++   N +F + W  +  S S EE G  A  CW +W  RN  +        +  +  + + + +L++ F   +   ++
Subjt:  HILLQCQRAKELW-NITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNS

Query:  AVGHSSFPLD--NAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF
          G  S P    + W PPP   +K+NVD A  S     G+G+V R  +G  + A  R I  S+   + EL A +EG+R A+D+     ++E D+Q  IN 
Subjt:  AVGHSSFPLD--NAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF

Query:  LNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFA
        +  T +       L E +  L  NF  +   ++PR  N+ A  LA+FA
Subjt:  LNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFA

XP_022131661.1 uncharacterized protein LOC111004786 [Momordica charantia]8.6e-2434.16Show/hide
Query:  SDHILLQCQRAKELWNITFNRVFQDVEFNGN--FVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQS
        + H +  C+RAK++WN+ F  +F     NGN  F+D W+L+    + ++    A T W IW DRN   HG  +    +R  WI++Y +  S+A   +   
Subjt:  SDHILLQCQRAKELWNITFNRVFQDVEFNGN--FVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQS

Query:  RNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAIN
        R S    S  P    W PP   + KVN DAA  S+S   GLG++ R   G ++ A + F+    +P  AE+R ILE ++LA      ++++ESD Q AI 
Subjt:  RNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAIN

Query:  FL
         +
Subjt:  FL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.1e-2230.92Show/hide
Query:  HILLQCQRAKELWNITFN-RVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEA----FPKRSQ
        H    C+RA+++W   F        E N +F++ W  +      ++L   A+T W IW DRN  +HG  + P+  +  W+T +L   S+A    +  R+Q
Subjt:  HILLQCQRAKELWNITFN-RVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEA----FPKRSQ

Query:  SRNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAI
        S +        P+   W P    S K+N DAA   +S     G + R    S+V A +  +     P  AE+R ILEG++ A       + +ESDS LAI
Subjt:  SRNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAI

Query:  NFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKF
          +          ++    I  L C F  ISF  S R  NRAA  LAK+
Subjt:  NFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKF

XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]4.4e-2031.33Show/hide
Query:  SDHILLQCQRAKELWNITFNRVFQDVEFNGNFVD-RWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSR
        S HIL  C  A+ +W  T  ++      +  FVD  W ++ +  S++ +   AVT W++W +RN    G               Y++E+    P R    
Subjt:  SDHILLQCQRAKELWNITFNRVFQDVEFNGNFVD-RWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSR

Query:  NSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF
             H S P +  W PP    +K+N D A  S     G+G+V R   G ++GA ++ ID      E E RA+ EGVRLA DL L K+++ESDS   +N 
Subjt:  NSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF

Query:  LNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFAK
        +   S   S +  + E   +  C F E     + R  N AA ++AK+AK
Subjt:  LNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFAK

XP_030505068.1 uncharacterized protein LOC115720043 [Cannabis sativa]2.3e-2130Show/hide
Query:  HILLQCQRAKELWNITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNSA
        H L  C+RA+++W ++   + + +  + +  +  + + ++ S  +L + A   W+IW +RN+++HG    P ++   +  +YL E   A  ++  + NS 
Subjt:  HILLQCQRAKELWNITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNSA

Query:  V---GHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF
             +SS   D+ W  PP    K+N DAA        G G + ++ +G IV   A      F P   E+ A++  ++   +L LP   IE+DSQL +N 
Subjt:  V---GHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF

Query:  LNRTSDPWSCLESLTESIWVLGCNF--CEISFVFSPRVRNRAADILAKFA
        L  +  P S   SL ++I +L  NF   +IS V+  R  N AA +LAKFA
Subjt:  LNRTSDPWSCLESLTESIWVLGCNF--CEISFVFSPRVRNRAADILAKFA

TrEMBL top hitse value%identityAlignment
A0A5E4FZN9 PREDICTED: retrotransposon1.1e-2129.44Show/hide
Query:  HILLQCQRAKELW-NITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNS
        H +  C+ AKE+W N  +  V ++   N +F + W  +  S S EE G  A  CW +W  RN  +        +  +  + + + +L++ F   +   ++
Subjt:  HILLQCQRAKELW-NITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNS

Query:  AVGHSSFPLD--NAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF
          G  S P    + W PPP   +K+NVD A  S     G+G+V R  +G  + A  R I  S+   + EL A +EG+R A+D+     ++E D+Q  IN 
Subjt:  AVGHSSFPLD--NAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF

Query:  LNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFA
        +  T +       L E +  L  NF  +   ++PR  N+ A  LA+FA
Subjt:  LNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFA

A0A6J1BQ49 uncharacterized protein LOC1110047864.1e-2434.16Show/hide
Query:  SDHILLQCQRAKELWNITFNRVFQDVEFNGN--FVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQS
        + H +  C+RAK++WN+ F  +F     NGN  F+D W+L+    + ++    A T W IW DRN   HG  +    +R  WI++Y +  S+A   +   
Subjt:  SDHILLQCQRAKELWNITFNRVFQDVEFNGN--FVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQS

Query:  RNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAIN
        R S    S  P    W PP   + KVN DAA  S+S   GLG++ R   G ++ A + F+    +P  AE+R ILE ++LA      ++++ESD Q AI 
Subjt:  RNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAIN

Query:  FL
         +
Subjt:  FL

A0A803NGI9 Uncharacterized protein3.3e-2128.96Show/hide
Query:  LAESLSAAVVSSSSEEESWMEVKTGWSKRRR-RECRLLS------DHILLQCQRAKELWNITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCW
        LA  L  + +S+SS  + ++ V+  W+     + C L S      +H L  C+ AK++W +    +  D+    +F D  ML+++  S  E+ ++  T W
Subjt:  LAESLSAAVVSSSSEEESWMEVKTGWSKRRR-RECRLLS------DHILLQCQRAKELWNITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCW

Query:  TIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGAR
         +W DRN  +HG   P ++   LW  +     +  F ++S S  S    +       W  PP D  K+NVDAA   S  +IG+GI+ R   G +V A ++
Subjt:  TIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGAR

Query:  FIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFAKILKQD
         +     P E E +A+L G+  A    L   + ESDS + +N +N  S+  S    L   I         +      R  N+AA  LAK A  L  D
Subjt:  FIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFAKILKQD

A0A803P5M6 Uncharacterized protein1.1e-2130.94Show/hide
Query:  HILLQCQRAKELWNITFNRV-FQDVE--FNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVH-GDPLPPINMRSLWITNYLKELSEAFPK-RSQ
        H L  C+ AK++W ++  R+ F      FNG+++     I      E L       W IW DRNK VH G P  P       I  Y  +  E F K + Q
Subjt:  HILLQCQRAKELWNITFNRV-FQDVE--FNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVH-GDPLPPINMRSLWITNYLKELSEAFPK-RSQ

Query:  SRNSAV--GHSSFPLDNA-------WAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVI
        +R  A    H S P  +A       W PP  + +K+NVDAA      ++G+G + R   G+++ A ++ +  SF   E E +A+   V      + P   
Subjt:  SRNSAV--GHSSFPLDNA-------WAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVI

Query:  IESDSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFAKILKQD
        IE+D+    N LNR +   SC   L   I  L  +F ++      R  N+AA  LAK+A  L +D
Subjt:  IESDSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFAKILKQD

A0A803Q027 Uncharacterized protein1.1e-2130Show/hide
Query:  HILLQCQRAKELWNITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNSA
        H L  C+RA+++W ++   + + +  + +  +  + + ++ S  +L + A   W+IW +RN+++HG    P ++   +  +YL E   A  ++  + NS 
Subjt:  HILLQCQRAKELWNITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNSA

Query:  V---GHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF
             +SS   D+ W  PP    K+N DAA        G G + ++ +G IV   A      F P   E+ A++  ++   +L LP   IE+DSQL +N 
Subjt:  V---GHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINF

Query:  LNRTSDPWSCLESLTESIWVLGCNF--CEISFVFSPRVRNRAADILAKFA
        L  +  P S   SL ++I +L  NF   +IS V+  R  N AA +LAKFA
Subjt:  LNRTSDPWSCLESLTESIWVLGCNF--CEISFVFSPRVRNRAADILAKFA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-2127.56Show/hide
Query:  DHILLQCQRAKELWNITFNRVFQDVEFNGN-FVDRWMLINSSCSLEELGR----VAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRS
        +H+L +C  A+ +W I+    + + E+  + + + + ++N    + +LG+    V    W +W  RN+ +          +       L+   E F + S
Subjt:  DHILLQCQRAKELWNITFNRVFQDVEFNGN-FVDRWMLINSSCSLEELGR----VAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKELSEAFPKRS

Query:  QSRNSAVGHSSFP-----LDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIES
         +R    G +S P     L   W  PP    K N DA W   +P  G+G + R   G ++  GAR + ++ +  EAEL A+   V         ++I ES
Subjt:  QSRNSAVGHSSFP-----LDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIES

Query:  DSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAK
        D+Q  +N LN + D W  L+   E I  L  +F E+ F F+PR  N+ AD +A+
Subjt:  DSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAK

AT3G09510.1 Ribonuclease H-like superfamily protein9.2e-0821.28Show/hide
Query:  LERTLAESLSAAVVSSSSEEESWMEVKTGWSKRRRRECRLLSDHILLQCQRAKELWNITFNRVFQDVEFNGNF---VDRWMLINSSCSLEELGRVAVT--
        L+  L  +LS A+ ++       M +     +  R    +  +H L  C  A   W ++ + + ++   + +F   +   +      ++ +  ++     
Subjt:  LERTLAESLSAAVVSSSSEEESWMEVKTGWSKRRRRECRLLSDHILLQCQRAKELWNITFNRVFQDVEFNGNF---VDRWMLINSSCSLEELGRVAVT--

Query:  CWTIWMDRN----KKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSI
         W IW  RN     K    P   +        ++L        K++ S    +  +       W  PP    K N DA +     E   G + R   G+ 
Subjt:  CWTIWMDRN----KKVHGDPLPPINMRSLWITNYLKELSEAFPKRSQSRNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSI

Query:  VGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKF
        +  G+  +  + +P EAE +A+L  ++        +V +E D Q  IN +N  S   S L +  E I      F  I F F  R  N+ A +LAK+
Subjt:  VGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKF

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0530.15Show/hide
Query:  PPPDSWKVNVDAAWTSSSPEIGLGIV---CRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINFLNRTSDPWSCLESL
        P  D   +  DAAW   + ++G G V   C         + AR +     P  AE  A+   ++ A  + + K+ + SDSQ  I  +   S P +    +
Subjt:  PPPDSWKVNVDAAWTSSSPEIGLGIV---CRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKVIIESDSQLAINFLNRTSDPWSCLESL

Query:  TESIWVLGCNFCEISFVFSPRVRNRAADILAKFAKI
           I  L   F ++SF F PR  NR AD LAK + I
Subjt:  TESIWVLGCNFCEISFVFSPRVRNRAADILAKFAKI

AT4G29090.1 Ribonuclease H-like superfamily protein5.4e-1627.91Show/hide
Query:  CRLLSDHILLQCQRAKELWNITFNRVFQDVEFNGNFVDR------WM--LINSSCSLEELGR-VAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKE
        C+   +H+L +C  A+  W I+       +   G + D       W+  L N +   E+  + V    W +W +RN+ V                + L+E
Subjt:  CRLLSDHILLQCQRAKELWNITFNRVFQDVEFNGNFVDR------WM--LINSSCSLEELGR-VAVTCWTIWMDRNKKVHGDPLPPINMRSLWITNYLKE

Query:  LSEAFPKRSQSRNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKV
                S      V  SS      W PPP    K N DA W   +   G+G V R   G +   GAR + K     EAEL A+   V      +   V
Subjt:  LSEAFPKRSQSRNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLRLPKV

Query:  IIESDSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAK
        I ESDSQ+ I  LN   + W  L+   + +  L   F E+ FVF PR  N  A+ +A+
Subjt:  IIESDSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCTGTTCGTGCTTGAAACTTGGTCGGAAAAACACCCACTGCAGAAACGTTGCCGCGCGACGCTGGAACTAAACAAAAACCACCGCTACACGTCGAAACCCACGCG
CGAATCACTGAAACGACGCCAAAAACTCGCTGGCCCGAGGACAAACGCACGAAGCCGTGCCGAACAGCTGCTGCAACCCAACCGCCGTCGACCTGCCGTGGAAATCGTCG
GAAAGCACGCGCCCGCCGGAGTTGCTGAAGAACGATGCCGGAGAACGCCGCTATCCGACCACCACCACCCTTCAAACCGCGCGAACGACCGCAACTCACTGGAACGCACG
CTGGCCGAGTCGCTGTCCGCCGCCGTGGTCTCGAGCTCGTCGGAGGAGGAGTCATGGATGGAGGTGAAGACAGGGTGGTCGAAAAGAAGAAGAAGGGAATGTAGGCTGCT
GTCGGATCATATATTGCTTCAATGTCAAAGAGCTAAGGAATTATGGAATATTACCTTTAATCGTGTGTTTCAGGATGTTGAATTTAATGGCAACTTTGTGGACAGATGGA
TGCTCATCAATTCAAGCTGTAGTCTGGAGGAGCTAGGTCGAGTTGCGGTTACATGCTGGACGATTTGGATGGATAGAAATAAAAAAGTTCATGGTGATCCTCTTCCCCCA
ATAAATATGAGAAGTCTTTGGATTACAAATTACCTAAAAGAGCTTTCGGAAGCTTTCCCCAAAAGATCGCAAAGCAGAAATTCAGCTGTCGGTCATTCTTCGTTTCCCTT
GGATAATGCCTGGGCTCCGCCTCCTCCTGATTCATGGAAAGTGAATGTTGATGCTGCATGGACTTCTTCATCGCCGGAGATTGGTCTTGGAATTGTTTGTAGGACATTTG
ATGGGAGCATTGTTGGTGCTGGAGCTCGATTTATTGATAAATCTTTTGATCCCCCTGAAGCGGAGCTGAGAGCAATTTTGGAAGGAGTTCGTCTGGCGTTGGATTTAAGG
CTCCCTAAGGTGATTATTGAATCCGATTCCCAATTGGCCATAAATTTTCTCAATAGAACTTCAGATCCGTGGAGTTGTTTGGAGAGTCTGACTGAGAGTATATGGGTCTT
GGGCTGTAATTTCTGTGAGATTAGTTTTGTTTTTAGCCCTAGAGTGAGAAATAGGGCTGCTGATATTCTGGCCAAATTTGCAAAAATTTTGAAACAGGATATTAGAGAAG
TTGTATTTTATAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGCTGTTCGTGCTTGAAACTTGGTCGGAAAAACACCCACTGCAGAAACGTTGCCGCGCGACGCTGGAACTAAACAAAAACCACCGCTACACGTCGAAACCCACGCG
CGAATCACTGAAACGACGCCAAAAACTCGCTGGCCCGAGGACAAACGCACGAAGCCGTGCCGAACAGCTGCTGCAACCCAACCGCCGTCGACCTGCCGTGGAAATCGTCG
GAAAGCACGCGCCCGCCGGAGTTGCTGAAGAACGATGCCGGAGAACGCCGCTATCCGACCACCACCACCCTTCAAACCGCGCGAACGACCGCAACTCACTGGAACGCACG
CTGGCCGAGTCGCTGTCCGCCGCCGTGGTCTCGAGCTCGTCGGAGGAGGAGTCATGGATGGAGGTGAAGACAGGGTGGTCGAAAAGAAGAAGAAGGGAATGTAGGCTGCT
GTCGGATCATATATTGCTTCAATGTCAAAGAGCTAAGGAATTATGGAATATTACCTTTAATCGTGTGTTTCAGGATGTTGAATTTAATGGCAACTTTGTGGACAGATGGA
TGCTCATCAATTCAAGCTGTAGTCTGGAGGAGCTAGGTCGAGTTGCGGTTACATGCTGGACGATTTGGATGGATAGAAATAAAAAAGTTCATGGTGATCCTCTTCCCCCA
ATAAATATGAGAAGTCTTTGGATTACAAATTACCTAAAAGAGCTTTCGGAAGCTTTCCCCAAAAGATCGCAAAGCAGAAATTCAGCTGTCGGTCATTCTTCGTTTCCCTT
GGATAATGCCTGGGCTCCGCCTCCTCCTGATTCATGGAAAGTGAATGTTGATGCTGCATGGACTTCTTCATCGCCGGAGATTGGTCTTGGAATTGTTTGTAGGACATTTG
ATGGGAGCATTGTTGGTGCTGGAGCTCGATTTATTGATAAATCTTTTGATCCCCCTGAAGCGGAGCTGAGAGCAATTTTGGAAGGAGTTCGTCTGGCGTTGGATTTAAGG
CTCCCTAAGGTGATTATTGAATCCGATTCCCAATTGGCCATAAATTTTCTCAATAGAACTTCAGATCCGTGGAGTTGTTTGGAGAGTCTGACTGAGAGTATATGGGTCTT
GGGCTGTAATTTCTGTGAGATTAGTTTTGTTTTTAGCCCTAGAGTGAGAAATAGGGCTGCTGATATTCTGGCCAAATTTGCAAAAATTTTGAAACAGGATATTAGAGAAG
TTGTATTTTATAGTTGA
Protein sequenceShow/hide protein sequence
MLLFVLETWSEKHPLQKRCRATLELNKNHRYTSKPTRESLKRRQKLAGPRTNARSRAEQLLQPNRRRPAVEIVGKHAPAGVAEERCRRTPLSDHHHPSNRANDRNSLERT
LAESLSAAVVSSSSEEESWMEVKTGWSKRRRRECRLLSDHILLQCQRAKELWNITFNRVFQDVEFNGNFVDRWMLINSSCSLEELGRVAVTCWTIWMDRNKKVHGDPLPP
INMRSLWITNYLKELSEAFPKRSQSRNSAVGHSSFPLDNAWAPPPPDSWKVNVDAAWTSSSPEIGLGIVCRTFDGSIVGAGARFIDKSFDPPEAELRAILEGVRLALDLR
LPKVIIESDSQLAINFLNRTSDPWSCLESLTESIWVLGCNFCEISFVFSPRVRNRAADILAKFAKILKQDIREVVFYS