; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037453 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037453
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPolynucleotidyl transferase, ribonuclease H-like superfamily protein
Genome locationscaffold11:32725210..32726399
RNA-Seq ExpressionSpg037453
SyntenySpg037453
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]8.7e-2331.93Show/hide
Query:  WVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLS
        W  +    + EE GL A  CW +W  RN  + E       +    + +  + +  AN+ + +   + ++ +  +P   W PPP G +KINVD A     S
Subjt:  WVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLS

Query:  VMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCN-VEG-LVEEIWALENGLRYIRYQYVS
        V G+G+V R+++G  + A          A   EL A + G+R AID+G  + +LE D Q  IN +    E  CN ++G L+EE+  L +  R +  Q+  
Subjt:  VMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCN-VEG-LVEEIWALENGLRYIRYQYVS

Query:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVEND
        R GNKVA +LA+ A   N   +W++  P WL  ++E D
Subjt:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVEND

XP_024041100.1 uncharacterized protein LOC112098855 isoform X1 [Citrus clementina]1.9e-2231.91Show/hide
Query:  EISSRCTKEELGLVAVTCWSIWEDRNKVMH----EDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDH
        +++ +  K++LGL+  TCW+IW  RN+V++    EDP     + +  +  YLR              Q    + N+  +VW+PPP G +K+NVDAA    
Subjt:  EISSRCTKEELGLVAVTCWSIWEDRNKVMH----EDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDH

Query:  LSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVS
         S  G+G+V RDS G I+ A+  S+    +    E +A+L G++ A    C  II+ESD    +    +       +   VEEI A          Q+V 
Subjt:  LSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVS

Query:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLV
        R  N VA SLAK A        W++ FP  + LL+
Subjt:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLV

XP_024041103.1 uncharacterized protein LOC112098855 isoform X2 [Citrus clementina]1.9e-2231.91Show/hide
Query:  EISSRCTKEELGLVAVTCWSIWEDRNKVMH----EDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDH
        +++ +  K++LGL+  TCW+IW  RN+V++    EDP     + +  +  YLR              Q    + N+  +VW+PPP G +K+NVDAA    
Subjt:  EISSRCTKEELGLVAVTCWSIWEDRNKVMH----EDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDH

Query:  LSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVS
         S  G+G+V RDS G I+ A+  S+    +    E +A+L G++ A    C  II+ESD    +    +       +   VEEI A          Q+V 
Subjt:  LSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVS

Query:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLV
        R  N VA SLAK A        W++ FP  + LL+
Subjt:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLV

XP_024041104.1 uncharacterized protein LOC112098855 isoform X3 [Citrus clementina]1.9e-2231.91Show/hide
Query:  EISSRCTKEELGLVAVTCWSIWEDRNKVMH----EDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDH
        +++ +  K++LGL+  TCW+IW  RN+V++    EDP     + +  +  YLR              Q    + N+  +VW+PPP G +K+NVDAA    
Subjt:  EISSRCTKEELGLVAVTCWSIWEDRNKVMH----EDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDH

Query:  LSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVS
         S  G+G+V RDS G I+ A+  S+    +    E +A+L G++ A    C  II+ESD    +    +       +   VEEI A          Q+V 
Subjt:  LSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVS

Query:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLV
        R  N VA SLAK A        W++ FP  + LL+
Subjt:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLV

XP_038719958.1 uncharacterized protein LOC120012588 [Tripterygium wilfordii]3.3e-2229.55Show/hide
Query:  WVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPN----EVWSPPPRGCWKINVDAAWS
        W    ++ T  +L ++A TC  IW+ RN  ++E+             E++ + + A     S+Q    +   + P     EVW PP  G  K+N DA  +
Subjt:  WVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPN----EVWSPPPRGCWKINVDAAWS

Query:  DHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASE---VWCNVEGLVEEIWALENGLRYIR
        +  +  G+G+V RDS G ++ + + S  +      AE +A L  V  A+D+G   I+LE D Q+ I  L         WC   G+++EI  +  G    +
Subjt:  DHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASE---VWCNVEGLVEEIWALENGLRYIR

Query:  YQYVSRKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVENDRSSF
        + +V R GN  A  +AK A    C + WV+  PD L L+V+ D+ +F
Subjt:  YQYVSRKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVENDRSSF

TrEMBL top hitse value%identityAlignment
A0A0J8BAU9 Uncharacterized protein4.8e-1926.16Show/hide
Query:  LEQAVLSC--DVEEIRKIPINKNLEDRMIWHYDKLGKYTVKSGWVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEAN
        LE+  L C  D E +R+       +D  +W    +  +     W  I     KE+  +  +  W +W+ RN+++ ++ +       +   + LR++ + N
Subjt:  LEQAVLSC--DVEEIRKIPINKNLEDRMIWHYDKLGKYTVKSGWVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEAN

Query:  SKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESD
               K+  +R        W  P  G  KINVDAA +     +G+G+V RD +G ++ A S +      A  AE  A+L     AI  G  +++LESD
Subjt:  SKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESD

Query:  CQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVEND
         QV IN +    +   +++ ++E++  L      I++ +  R+ N++A  LAK A  + C E W+   P W+  L+ +D
Subjt:  CQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVEND

A0A1U8J822 uncharacterized protein LOC1079027188.2e-1928.7Show/hide
Query:  KEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCR
        K   G++ VT W IW  RNK +HE  +    +   +IR +   Y        S+ K   + ++ S  + WSPPP+G  KINVD   S        G + R
Subjt:  KEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCR

Query:  DSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLA
        + +G I+ +     +L+     AE  A+L G++ A+DLG  ++ILESD ++ +N + K+SE +        +   L    +  R+Q+++R+GN+    +A
Subjt:  DSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLA

Query:  KRAKLSNCSESWVDYFPDWLCLLVENDRSS
             +     WV+  P     + ++DR S
Subjt:  KRAKLSNCSESWVDYFPDWLCLLVENDRSS

A0A5E4FZN9 PREDICTED: retrotransposon4.2e-2331.93Show/hide
Query:  WVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLS
        W  +    + EE GL A  CW +W  RN  + E       +    + +  + +  AN+ + +   + ++ +  +P   W PPP G +KINVD A     S
Subjt:  WVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLS

Query:  VMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCN-VEG-LVEEIWALENGLRYIRYQYVS
        V G+G+V R+++G  + A          A   EL A + G+R AID+G  + +LE D Q  IN +    E  CN ++G L+EE+  L +  R +  Q+  
Subjt:  VMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCN-VEG-LVEEIWALENGLRYIRYQYVS

Query:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVEND
        R GNKVA +LA+ A   N   +W++  P WL  ++E D
Subjt:  RKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVEND

A0A803PWX1 Uncharacterized protein2.3e-2129.08Show/hide
Query:  DRMIWH----YDKL---GKYTVKSGWVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSP
        +R +W+    Y +L   G+  V +  + +SS  TK+E     +  W++W  RN V H    P       W  ++L  + E N      Q++   +     
Subjt:  DRMIWH----YDKL---GKYTVKSGWVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSP

Query:  NEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNV
           W+PP RG +K+NVDA      ++ G+  V RD +G ++ A++   +        EL+AIL+G++  I     S  +ESDC  A+N + K  E   +V
Subjt:  NEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNV

Query:  EGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAKRAKLSNCSESWVDYFP
        +GL+ +I  L    R     +V R+ N+VA  LA  A ++  S  WV   P
Subjt:  EGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAKRAKLSNCSESWVDYFP

A0A803Q8J4 Uncharacterized protein3.7e-1929.17Show/hide
Query:  EISSRCTKEELGLVAVTCWSIWEDRNKVMH-EDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSP-----NEVWSPPPRGCWKINVDAAWS
        +IS   TK EL  +  T WSIW DRN V+H + P  PT+  ++  + +L NY+       S+Q+ +    ++ P     ++ WSPPP  C K+NVDAA+ 
Subjt:  EISSRCTKEELGLVAVTCWSIWEDRNKVMH-EDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSP-----NEVWSPPPRGCWKINVDAAWS

Query:  DHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQY
        +  + +G G + RDS+G +  A S   +        E K +   ++ A  L     ++E+D  +  N L K +    + + L+ ++    + L  +   +
Subjt:  DHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQY

Query:  VSRKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVEND
        V R GN+ A  LAK+A + +   +W++ FP  +  +V  D
Subjt:  VSRKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVEND

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein8.4e-0826.29Show/hide
Query:  CWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEAS
        CW IW+ RN+++ ++           I    +  ++A +   S+Q      R+ +P    +      +   VDAAW    S+ G G V + +  +  E +
Subjt:  CWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEAS

Query:  SFSSDL--IPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCN-VEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAK
        +FS+     P    AE  AI   +  A+ L    +++ SD +  ++ L   S V  N + GL+ EI ++ N  R I +Q++ R  N +AD+ AK
Subjt:  SFSSDL--IPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCN-VEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAK

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.4e-1625.57Show/hide
Query:  LVAVTCWSIWEDRNKVM---HEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDS
        LV    W +W+ RN++M    E   P  ++R+      + ++EE +++ +   K +  +   + +  W  PP    K N DA W       GIG + R+ 
Subjt:  LVAVTCWSIWEDRNKVM---HEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRINSPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDS

Query:  DGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAKR
         G ++   + +         AEL+A+   V          II ESD Q  +N L  + + W  ++  +E+I  L +    +++++  R GNKVAD +A+ 
Subjt:  DGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAKR

Query:  A-KLSNCSESWVDYFPDWL
        +   SN         P WL
Subjt:  A-KLSNCSESWVDYFPDWL

AT2G46460.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein8.4e-0824.32Show/hide
Query:  WSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGL
        W+ PP G  K N D +++  +     G + RD  G  + A   + +    A  +E +A+L+ ++     G   I  E D +  +  L +    + ++   
Subjt:  WSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGL

Query:  VEEIWALENGLRYIRYQYVSRKGNKVADSLAKRAKLSNCSESWVDYFP
        + ++ A +   +  R+ +++RK NK AD LAK     N    + DY P
Subjt:  VEEIWALENGLRYIRYQYVSRKGNKVADSLAKRAKLSNCSESWVDYFP

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.7e-0528.68Show/hide
Query:  INVDAAWSDHLSVMGIGIVCRDSD--GAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALE
        I  DAAW      +G G V R+     A+   S+  +  +P    AE  A+ + ++ A  +G   + + SD Q  I  +T  S       G++ +I  L 
Subjt:  INVDAAWSDHLSVMGIGIVCRDSD--GAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEGLVEEIWALE

Query:  NGLRYIRYQYVSRKGNKVADSLAKRAKLS
         G   + + +V R  N+VAD LAK + +S
Subjt:  NGLRYIRYQYVSRKGNKVADSLAKRAKLS

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-1227.03Show/hide
Query:  WSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRIN-SPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEA-
        W IW+  N ++         K    +   L + +E    T ++++Q  NR  + S N  WSPP R   K N DA+  +  +V G+G + R+S G +IE  
Subjt:  WSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQTTNRRIN-SPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEA-

Query:  -SSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFL-TKASEVWCNVEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAKRAKLSN
           F   +  E   AE   ++  ++ +   G   +I E D Q     + TK+S     ++  ++ I +       I + +  R+ N  AD LAK+A   N
Subjt:  -SSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFL-TKASEVWCNVEGLVEEIWALENGLRYIRYQYVSRKGNKVADSLAKRAKLSN

Query:  CSESWVDYFPDWLCLLVENDRS
           S     P +L   V ND S
Subjt:  CSESWVDYFPDWLCLLVENDRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTAGAACAGGCTGTGCTCAGTTGTGATGTGGAGGAAATAAGGAAAATTCCTATCAATAAAAATCTGGAAGATAGAATGATATGGCATTATGATAAATTGGGGAA
ATATACGGTCAAGAGTGGATGGGTAGAGATCAGCTCGAGATGCACGAAGGAGGAGCTAGGCCTTGTAGCAGTTACATGCTGGTCTATCTGGGAGGACAGAAATAAAGTCA
TGCATGAAGATCCTATCCCTCCAACAATCAAAAGAAGCCAGTGGATTAGAGAATACCTGAGAAATTACGAGGAGGCGAACTCGAAGACTGATAGCTCGCAAAAGCAGACC
ACAAATCGAAGAATCAATTCTCCAAATGAAGTTTGGTCTCCTCCTCCGAGAGGCTGTTGGAAGATCAATGTTGATGCGGCATGGTCGGATCATCTTTCGGTGATGGGAAT
CGGCATTGTGTGTAGGGATTCTGATGGAGCCATCATCGAAGCTTCAAGTTTCTCTTCTGACTTGATTCCTGAGGCCCCTGGCGCTGAATTGAAGGCAATCTTGATGGGAG
TTAGAAGGGCTATCGATTTGGGTTGTGGGAGTATTATTTTGGAATCAGATTGTCAAGTGGCTATTAACTTCCTAACAAAGGCTTCTGAAGTTTGGTGTAATGTAGAGGGC
CTAGTTGAAGAGATTTGGGCTTTGGAAAATGGGCTCAGATATATTAGATATCAATATGTGTCCAGAAAGGGGAATAAAGTGGCTGATAGTTTAGCCAAGAGAGCAAAATT
GTCAAATTGTAGCGAGTCTTGGGTTGATTATTTCCCAGATTGGTTGTGTTTGTTGGTCGAGAATGACCGTTCTTCATTTGCCCAAGTGGCAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTAGAACAGGCTGTGCTCAGTTGTGATGTGGAGGAAATAAGGAAAATTCCTATCAATAAAAATCTGGAAGATAGAATGATATGGCATTATGATAAATTGGGGAA
ATATACGGTCAAGAGTGGATGGGTAGAGATCAGCTCGAGATGCACGAAGGAGGAGCTAGGCCTTGTAGCAGTTACATGCTGGTCTATCTGGGAGGACAGAAATAAAGTCA
TGCATGAAGATCCTATCCCTCCAACAATCAAAAGAAGCCAGTGGATTAGAGAATACCTGAGAAATTACGAGGAGGCGAACTCGAAGACTGATAGCTCGCAAAAGCAGACC
ACAAATCGAAGAATCAATTCTCCAAATGAAGTTTGGTCTCCTCCTCCGAGAGGCTGTTGGAAGATCAATGTTGATGCGGCATGGTCGGATCATCTTTCGGTGATGGGAAT
CGGCATTGTGTGTAGGGATTCTGATGGAGCCATCATCGAAGCTTCAAGTTTCTCTTCTGACTTGATTCCTGAGGCCCCTGGCGCTGAATTGAAGGCAATCTTGATGGGAG
TTAGAAGGGCTATCGATTTGGGTTGTGGGAGTATTATTTTGGAATCAGATTGTCAAGTGGCTATTAACTTCCTAACAAAGGCTTCTGAAGTTTGGTGTAATGTAGAGGGC
CTAGTTGAAGAGATTTGGGCTTTGGAAAATGGGCTCAGATATATTAGATATCAATATGTGTCCAGAAAGGGGAATAAAGTGGCTGATAGTTTAGCCAAGAGAGCAAAATT
GTCAAATTGTAGCGAGTCTTGGGTTGATTATTTCCCAGATTGGTTGTGTTTGTTGGTCGAGAATGACCGTTCTTCATTTGCCCAAGTGGCAGTTTAA
Protein sequenceShow/hide protein sequence
MKLEQAVLSCDVEEIRKIPINKNLEDRMIWHYDKLGKYTVKSGWVEISSRCTKEELGLVAVTCWSIWEDRNKVMHEDPIPPTIKRSQWIREYLRNYEEANSKTDSSQKQT
TNRRINSPNEVWSPPPRGCWKINVDAAWSDHLSVMGIGIVCRDSDGAIIEASSFSSDLIPEAPGAELKAILMGVRRAIDLGCGSIILESDCQVAINFLTKASEVWCNVEG
LVEEIWALENGLRYIRYQYVSRKGNKVADSLAKRAKLSNCSESWVDYFPDWLCLLVENDRSSFAQVAV