; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010557 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010557
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold5:10167391..10174709
RNA-Seq ExpressionSpg010557
SyntenySpg010557
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABA91072.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]9.9e-2426.42Show/hide
Query:  VGNRGFLRFWLSSLRGVEDEEHGRGEEMHEAVLVEGGCKVK---SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAY
        VGN   +R W          +H R       V + G C++K    LI +   W  A++R  F  +D   IL+I L     +D + W  D+ G+FSV+SAY
Subjt:  VGNRGFLRFWLSSLRGVEDEEHGRGEEMHEAVLVEGGCKVK---SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAY

Query:  HLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPVVYTPRATRIMRN
        +LA  ++++  +S+S   HS ++W  +WK  +  + KI AWK  ++ L    N   + +  +  C +  ++ E S+HA+++C         PRA+ ++ +
Subjt:  HLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPVVYTPRATRIMRN

Query:  GPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEI
                    SD S++A++  GG+  ++RD+ GS + A  + +         E +A  EGL +  +       + +E+D + L++ L +   +L+E+
Subjt:  GPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEI

XP_011470502.1 PREDICTED: uncharacterized protein LOC105353223 [Fragaria vesca subsp. vesca]1.6e-2630.85Show/hide
Query:  KVKSLID-ERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSA--EAWRKLWKLNIIPRAKI
        KV+ LID E   W +  ++E F  V+A  I  IPL     +D  +W  DQKG +SV+S YH+A  +      ++S  SH      W+K+WK+N+ P+ ++
Subjt:  KVKSLID-ERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSA--EAWRKLWKLNIIPRAKI

Query:  CAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIH---------AIWECKMAKPVVYTPRATRI-----------MRNGPP----------PPQGR
         AW+ + ++LP    L  KG+D++  C    +  E  +H          +WEC        T  A  I            R G            PP GR
Subjt:  CAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIH---------AIWECKMAKPVVYTPRATRI-----------MRNGPP----------PPQGR

Query:  WKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEIKTLM
         K N D S+   +E GG+G VVRD+ G    A  + I  +      E +A+  GL I  +   +   VE ESD   L+     D+E+L+EI  +M
Subjt:  WKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEIKTLM

XP_022149515.1 uncharacterized protein LOC111017927 [Momordica charantia]5.2e-2541.43Show/hide
Query:  SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFI
        +L+ +R RW E+ +R +F + +A  ILNIPL   N  DE+IW  D+K KFSVKS Y L   ++S++E  TS+    A+ W+KLW+  +  + KIC W+  
Subjt:  SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFI

Query:  NDILPNYVNLHSKGIDINQAC-FLYRKKDETSIHAIWECK
        NDI+     L+ KGI I Q C F  ++++E+S H  W C+
Subjt:  NDILPNYVNLHSKGIDINQAC-FLYRKKDETSIHAIWECK

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]3.4e-2427.87Show/hide
Query:  VKSLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWK
        V  LIDE+ +W E  + ++F   DA  I+ IPL +   +D++IW +D+KG +SVKS Y +A  I    + S S+H  +   WR +WKL I  + KI  W+
Subjt:  VKSLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWK

Query:  FINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPV----------------------VYTPR---------------------------
          +D+LP   NL  K +     C       ET  HA+ EC  A+ +                       + PR                           
Subjt:  FINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPV----------------------VYTPR---------------------------

Query:  -----ATRIMRNG-----------------------------PPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAM
               R++ N                               PPP G  K N DA+ + E ++ GLG VVRDS G+   A ++ ++    +++ EA AM
Subjt:  -----ATRIMRNG-----------------------------PPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAM

Query:  WEGLKI---LHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEIKTLMS
          GLK+    H  FG       ESD++++I  ++K   +LTEI  L+S
Subjt:  WEGLKI---LHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEIKTLMS

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]9.9e-2426.32Show/hide
Query:  VKSLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWK
        V  LI   N+W E K+R++F  VD  +IL IPL    ++DE++W +D++G +SVKS Y LA      +  S ++ SH  + W  LW L +  + KI  W+
Subjt:  VKSLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWK

Query:  FINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAK--------------------------------------------------------
          N++LP+  NL  + +     C   +   ET  HA+ ECK A+                                                        
Subjt:  FINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAK--------------------------------------------------------

Query:  -----PVVYTPRATRIM----------------------RNGPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGL--QLIKGSWPISLLEAK
             P++   +A  ++                      +   PPPQ  +K N DA++N++    G+G V+RDS G ++ AG+   L+KGS   SL EA+
Subjt:  -----PVVYTPRATRIM----------------------RNGPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGL--QLIKGSWPISLLEAK

Query:  AMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEI
        A+  GL++      +SL   +ESD ++++  ++  + + +EI
Subjt:  AMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEI

TrEMBL top hitse value%identityAlignment
A0A6J1D5Y4 uncharacterized protein LOC1110179272.5e-2541.43Show/hide
Query:  SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFI
        +L+ +R RW E+ +R +F + +A  ILNIPL   N  DE+IW  D+K KFSVKS Y L   ++S++E  TS+    A+ W+KLW+  +  + KIC W+  
Subjt:  SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFI

Query:  NDILPNYVNLHSKGIDINQAC-FLYRKKDETSIHAIWECK
        NDI+     L+ KGI I Q C F  ++++E+S H  W C+
Subjt:  NDILPNYVNLHSKGIDINQAC-FLYRKKDETSIHAIWECK

A0A803PFL5 Uncharacterized protein9.0e-2329.13Show/hide
Query:  VKSLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWK
        V + I +   W   ++ ++F Q+D   IL IPL    S D +IW H+  G +SVKS++HLATSIS  ++ S+SD   +   W+  WKL + P+ KI AWK
Subjt:  VKSLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWK

Query:  FINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPV-------VYTPRA-------TRI-----------MRN-----------------
         I + LP    LH + +  +  C   +   E+  HA++ CK AK V       + T  A       TR+           MR+                 
Subjt:  FINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPV-------VYTPRA-------TRI-----------MRN-----------------

Query:  ----GPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVE-VESDAIDLIHCLHKDEEN
              PPP    K N DA+ N++    G+G VVR+  G ++ A  + ++G +    +EAK ++    IL+    N +++  VE+DA+ +   L+ +  +
Subjt:  ----GPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVE-VESDAIDLIHCLHKDEEN

Query:  LTEIKTLMS
        L+    L++
Subjt:  LTEIKTLMS

A0A803QGC3 Uncharacterized protein1.5e-2229.06Show/hide
Query:  VLVEGGCKVKSLIDERNRWIEAKVRENFNQVDAMDILNI-PLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNII
        VL+     + +L+     W E  +R+ F+  D   +L I PL  IN+ D I WS    G +SV S Y +     + + A  S+ S     W+ +W  ++ 
Subjt:  VLVEGGCKVKSLIDERNRWIEAKVRENFNQVDAMDILNI-PLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNII

Query:  PRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPVVYTPRATRIMRNGPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDS
        P+ K   W+  N  +P  V LH +G+ ++  C L + +DE   HA+W+C   K ++  P AT+ +    PPP G +  N+DAS        G+  V+RDS
Subjt:  PRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPVVYTPRATRIMRNGPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDS

Query:  GGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTE
         G+L+ A    + G   + L +A  +  G+ +  +   +  NV+V SD+  +I  +  +  N T+
Subjt:  GGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTE

C7J8D0 Os11g0106066 protein4.8e-2426.42Show/hide
Query:  VGNRGFLRFWLSSLRGVEDEEHGRGEEMHEAVLVEGGCKVK---SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAY
        VGN   +R W          +H R       V + G C++K    LI +   W  A++R  F  +D   IL+I L     +D + W  D+ G+FSV+SAY
Subjt:  VGNRGFLRFWLSSLRGVEDEEHGRGEEMHEAVLVEGGCKVK---SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAY

Query:  HLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPVVYTPRATRIMRN
        +LA  ++++  +S+S   HS ++W  +WK  +  + KI AWK  ++ L    N   + +  +  C +  ++ E S+HA+++C         PRA+ ++ +
Subjt:  HLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPVVYTPRATRIMRN

Query:  GPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEI
                    SD S++A++  GG+  ++RD+ GS + A  + +         E +A  EGL +  +       + +E+D + L++ L +   +L+E+
Subjt:  GPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEI

Q2RBM8 Retrotransposon protein, putative, unclassified4.8e-2426.42Show/hide
Query:  VGNRGFLRFWLSSLRGVEDEEHGRGEEMHEAVLVEGGCKVK---SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAY
        VGN   +R W          +H R       V + G C++K    LI +   W  A++R  F  +D   IL+I L     +D + W  D+ G+FSV+SAY
Subjt:  VGNRGFLRFWLSSLRGVEDEEHGRGEEMHEAVLVEGGCKVK---SLIDERNRWIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAY

Query:  HLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPVVYTPRATRIMRN
        +LA  ++++  +S+S   HS ++W  +WK  +  + KI AWK  ++ L    N   + +  +  C +  ++ E S+HA+++C         PRA+ ++ +
Subjt:  HLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAKPVVYTPRATRIMRN

Query:  GPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEI
                    SD S++A++  GG+  ++RD+ GS + A  + +         E +A  EGL +  +       + +E+D + L++ L +   +L+E+
Subjt:  GPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDEENLTEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.9e-0428.57Show/hide
Query:  PPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAM-WEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDE
        PP    K N+DA+W  E    G+GW++R+  G ++  G + +  +  +   E +A+ W  L +   N+     +  ESDA  L++ L+ D+
Subjt:  PPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAM-WEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDE

AT3G09510.1 Ribonuclease H-like superfamily protein5.4e-1227.07Show/hide
Query:  WIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFINDILPNYV
        W ++K+ +  +Q D   I  I L +    D+IIW+++  G+++V+S Y L T   S+N  + +    S +   ++W L I+P+ K   W+ ++  L    
Subjt:  WIEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFINDILPNYV

Query:  NLHSKGIDINQACFLYRKKDETSIHAIWECKMA
         L ++G+ I+ +C    +++E+  HA++ C  A
Subjt:  NLHSKGIDINQACFLYRKKDETSIHAIWECKMA

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.4e-0435Show/hide
Query:  LWKLNIIPRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAK
        +W L I P+ K+  WK +N+ LP    L S+ I I   C   R   ET  H ++ C  A+
Subjt:  LWKLNIIPRAKICAWKFINDILPNYVNLHSKGIDINQACFLYRKKDETSIHAIWECKMAK

AT4G29090.1 Ribonuclease H-like superfamily protein2.2e-0533.7Show/hide
Query:  PPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAM-WEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDE
        PPP    K N+DA+WN + E  G+GWV+R+  G +   G + +     +   E +AM W  L +    +     V  ESD+  LI  L+ DE
Subjt:  PPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAM-WEGLKILHKNFGNSLNVEVESDAIDLIHCLHKDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCTGCGTACGGTCGTCGAAGAGGGGGTTGCGAACGGTCGTCTGCACGAAGGATGGAGGGAGGCTGTACGAAGAGTCTCATTGTGGAGAAGAAGACTCAATGGAGC
TTGCGGGTTGCGTAGAAGGCTCGTCGGTGCGGCGGGAGAAGCTCGTCGGTATGGCGGAAGTGCAGATGAGGTGGGAAATCGAGGTTTCCTTAGGTTTTGGCTCTCAAGTT
TGCGAGGTGTGGAAGACGAAGAGCACGGGCGTGGGGAGGAAATGCATGAGGCAGTTTTGGTGGAGGGGGGTTGCAAAGTGAAGAGTCTCATTGACGAACGCAACAGGTGG
ATTGAAGCTAAGGTTAGAGAAAATTTTAACCAAGTCGACGCAATGGACATTCTCAATATTCCTCTCGGTGAGATCAATTCGAAGGACGAGATCATTTGGAGCCACGACCA
GAAAGGGAAGTTTTCTGTGAAGAGTGCTTATCACTTGGCAACTTCAATTTCCTCGTCAAATGAGGCATCAACGTCGGATCATAGTCATTCGGCTGAGGCTTGGAGGAAAT
TGTGGAAGCTCAACATAATTCCTAGGGCCAAGATCTGTGCTTGGAAATTCATCAATGATATTCTTCCTAATTATGTTAATCTCCATTCTAAAGGGATTGATATTAACCAA
GCTTGCTTTCTGTACAGGAAGAAGGATGAGACCTCCATTCATGCTATATGGGAATGCAAAATGGCTAAGCCAGTGGTTTACACTCCAAGAGCCACCAGAATCATGCGAAA
TGGACCCCCCCCTCCTCAAGGTCGTTGGAAATTTAATTCTGATGCTTCGTGGAATGCAGAGAAGGAGATCGGTGGCTTGGGGTGGGTTGTTCGTGATTCTGGCGGATCTC
TGATCTGTGCGGGCTTGCAGTTAATCAAAGGTTCTTGGCCGATTAGCCTTTTAGAAGCCAAAGCGATGTGGGAAGGGCTTAAAATCTTACACAAAAATTTTGGCAACTCA
TTGAATGTGGAGGTCGAATCGGATGCCATCGATCTGATCCATTGCCTGCACAAAGATGAGGAGAATTTGACAGAGATCAAGACATTGATGAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGCTGCGTACGGTCGTCGAAGAGGGGGTTGCGAACGGTCGTCTGCACGAAGGATGGAGGGAGGCTGTACGAAGAGTCTCATTGTGGAGAAGAAGACTCAATGGAGC
TTGCGGGTTGCGTAGAAGGCTCGTCGGTGCGGCGGGAGAAGCTCGTCGGTATGGCGGAAGTGCAGATGAGGTGGGAAATCGAGGTTTCCTTAGGTTTTGGCTCTCAAGTT
TGCGAGGTGTGGAAGACGAAGAGCACGGGCGTGGGGAGGAAATGCATGAGGCAGTTTTGGTGGAGGGGGGTTGCAAAGTGAAGAGTCTCATTGACGAACGCAACAGGTGG
ATTGAAGCTAAGGTTAGAGAAAATTTTAACCAAGTCGACGCAATGGACATTCTCAATATTCCTCTCGGTGAGATCAATTCGAAGGACGAGATCATTTGGAGCCACGACCA
GAAAGGGAAGTTTTCTGTGAAGAGTGCTTATCACTTGGCAACTTCAATTTCCTCGTCAAATGAGGCATCAACGTCGGATCATAGTCATTCGGCTGAGGCTTGGAGGAAAT
TGTGGAAGCTCAACATAATTCCTAGGGCCAAGATCTGTGCTTGGAAATTCATCAATGATATTCTTCCTAATTATGTTAATCTCCATTCTAAAGGGATTGATATTAACCAA
GCTTGCTTTCTGTACAGGAAGAAGGATGAGACCTCCATTCATGCTATATGGGAATGCAAAATGGCTAAGCCAGTGGTTTACACTCCAAGAGCCACCAGAATCATGCGAAA
TGGACCCCCCCCTCCTCAAGGTCGTTGGAAATTTAATTCTGATGCTTCGTGGAATGCAGAGAAGGAGATCGGTGGCTTGGGGTGGGTTGTTCGTGATTCTGGCGGATCTC
TGATCTGTGCGGGCTTGCAGTTAATCAAAGGTTCTTGGCCGATTAGCCTTTTAGAAGCCAAAGCGATGTGGGAAGGGCTTAAAATCTTACACAAAAATTTTGGCAACTCA
TTGAATGTGGAGGTCGAATCGGATGCCATCGATCTGATCCATTGCCTGCACAAAGATGAGGAGAATTTGACAGAGATCAAGACATTGATGAGCTAG
Protein sequenceShow/hide protein sequence
MGLRTVVEEGVANGRLHEGWREAVRRVSLWRRRLNGACGLRRRLVGAAGEARRYGGSADEVGNRGFLRFWLSSLRGVEDEEHGRGEEMHEAVLVEGGCKVKSLIDERNRW
IEAKVRENFNQVDAMDILNIPLGEINSKDEIIWSHDQKGKFSVKSAYHLATSISSSNEASTSDHSHSAEAWRKLWKLNIIPRAKICAWKFINDILPNYVNLHSKGIDINQ
ACFLYRKKDETSIHAIWECKMAKPVVYTPRATRIMRNGPPPPQGRWKFNSDASWNAEKEIGGLGWVVRDSGGSLICAGLQLIKGSWPISLLEAKAMWEGLKILHKNFGNS
LNVEVESDAIDLIHCLHKDEENLTEIKTLMS