; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015336 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015336
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold3:43560353..43566052
RNA-Seq ExpressionSpg015336
SyntenySpg015336
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW44814.1 putative ribonuclease H protein [Vitis vinifera]1.3e-3531.44Show/hide
Query:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL
        Q F +F++F V  G++IRFW+D+W   +PL   +P L  V   KNA I       R  +WN   RR L D E+ +  +L   L+ + +  +  DK  W +
Subjt:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL

Query:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG
         PSG  + KS        S +        +W    P KVK F+W VA++ LNT+D +Q +  +  LSP+ C+LC+   E++D+LFLHC   +  W+ +  
Subjt:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG

Query:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR
        L  + +   R   D        +    +  V+   A    MW++W+ RNAR F+DK+ +     +S++ +AS W +   K+F    L M+  DW A+ +
Subjt:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR

RVW52490.1 putative ribonuclease H protein [Vitis vinifera]7.7e-3631.77Show/hide
Query:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL
        Q F +F++F V  G++IRFW+D+W   +PL   +P L  V   KNA I       R  +WN   RR L D E+ +  +L   L+ + +  +  DK  W +
Subjt:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL

Query:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG
         PSG  + KS        S +        +W    P KVK F+W VA++ LNT+D +Q +  +  LSP+ C+LC+   E++D+LFLHC   +  W+ +  
Subjt:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG

Query:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR
        L  + +   R   D        +    K  V+   A    MW++W+ RNAR F+DK+ +     +S+  +AS W +   K+F    L M+  DW A+ +
Subjt:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR

RVW53010.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.3e-3531.44Show/hide
Query:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL
        Q F +F++F V  G++IRFW+D+W   +PL   +P L  V   KNA I       R  +WN   RR L D E+ +  +L   L+ + +  +  DK  W +
Subjt:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL

Query:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG
         PSG  + KS        S +        +W    P KVK F+W VA++ LNT+D +Q +  +  LSP+ C+LC+   E++D+LFLHC   +  W+ +  
Subjt:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG

Query:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR
        L  + +   R   D        +    +  V+   A    MW++W+ RNAR F+DK+ +     +S++ +AS W +   K+F    L M+  DW A+ +
Subjt:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR

RVX23716.1 putative ribonuclease H protein [Vitis vinifera]1.3e-3531.44Show/hide
Query:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL
        Q F +F++F V  G++IRFW+D+W   +PL   +P L  V   KNA I       R  +WN   RR L D E+ +  +L   L+ + +  +  DK  W +
Subjt:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL

Query:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG
         PSG  + KS        S +        +W    P KVK F+W VA++ LNT+D +Q +  +  LSP+ C+LC+   E++D+LFLHC   +  W+ +  
Subjt:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG

Query:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR
        L  + +   R   D        +    +  V+   A    MW++W+ RNAR F+DK+ +     +S++ +AS W +   K+F    L M+  DW A+ +
Subjt:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]1.0e-5142.25Show/hide
Query:  KGWKVFANLLKD-----FVDGKVFDEKKEREKQPNSQTFRKKSYAEAIKYTHNNDSGFPPAFNMSGHKNAEPQTTAGDYPYINPFQPDKALLKCPSKEMA
        + WK  A ++K        + ++ D++    K   ++  R+ ++ E I  T  +        + S   +   + T   Y  INPFQ DKAL+KCPSK++A
Subjt:  KGWKVFANLLKD-----FVDGKVFDEKKEREKQPNSQTFRKKSYAEAIKYTHNNDSGFPPAFNMSGHKNAEPQTTAGDYPYINPFQPDKALLKCPSKEMA

Query:  NLLVTNKGWVSFGPIILKVERWNKNLHGRINVVPSYGGWVRIRNLPPHLWHLQTFKALGDCLGGFIEYAEPNSLLIECVEVGIRVRGNYCGFIPGEIEVV
         LL+TNKGWV+FGP+ +K+E WN  LHGR  + PSYG WV+IRN+P HLW L TFKA+G+ LGGFI+Y + NS  IEC +V I+V+ NYCGFIP EI  +
Subjt:  NLLVTNKGWVSFGPIILKVERWNKNLHGRINVVPSYGGWVRIRNLPPHLWHLQTFKALGDCLGGFIEYAEPNSLLIECVEVGIRVRGNYCGFIPGEIEVV

Query:  EDDQIFKTQVV---------------HGSFSSEAAHVFFRGPMDTDFNPVDKWRIENG
        +    F+ +VV               HG FSSEAA  F +G  +   N +D+WR+ENG
Subjt:  EDDQIFKTQVV---------------HGSFSSEAAHVFFRGPMDTDFNPVDKWRIENG

TrEMBL top hitse value%identityAlignment
A0A438EAW3 Putative ribonuclease H protein6.3e-3631.44Show/hide
Query:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL
        Q F +F++F V  G++IRFW+D+W   +PL   +P L  V   KNA I       R  +WN   RR L D E+ +  +L   L+ + +  +  DK  W +
Subjt:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL

Query:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG
         PSG  + KS        S +        +W    P KVK F+W VA++ LNT+D +Q +  +  LSP+ C+LC+   E++D+LFLHC   +  W+ +  
Subjt:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG

Query:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR
        L  + +   R   D        +    +  V+   A    MW++W+ RNAR F+DK+ +     +S++ +AS W +   K+F    L M+  DW A+ +
Subjt:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR

A0A438EXL3 Putative ribonuclease H protein3.7e-3631.77Show/hide
Query:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL
        Q F +F++F V  G++IRFW+D+W   +PL   +P L  V   KNA I       R  +WN   RR L D E+ +  +L   L+ + +  +  DK  W +
Subjt:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL

Query:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG
         PSG  + KS        S +        +W    P KVK F+W VA++ LNT+D +Q +  +  LSP+ C+LC+   E++D+LFLHC   +  W+ +  
Subjt:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG

Query:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR
        L  + +   R   D        +    K  V+   A    MW++W+ RNAR F+DK+ +     +S+  +AS W +   K+F    L M+  DW A+ +
Subjt:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR

A0A438EZ36 LINE-1 retrotransposable element ORF2 protein6.3e-3631.44Show/hide
Query:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL
        Q F +F++F V  G++IRFW+D+W   +PL   +P L  V   KNA I       R  +WN   RR L D E+ +  +L   L+ + +  +  DK  W +
Subjt:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL

Query:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG
         PSG  + KS        S +        +W    P KVK F+W VA++ LNT+D +Q +  +  LSP+ C+LC+   E++D+LFLHC   +  W+ +  
Subjt:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG

Query:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR
        L  + +   R   D        +    +  V+   A    MW++W+ RNAR F+DK+ +     +S++ +AS W +   K+F    L M+  DW A+ +
Subjt:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR

A0A438KR90 Putative ribonuclease H protein6.3e-3631.44Show/hide
Query:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL
        Q F +F++F V  G++IRFW+D+W   +PL   +P L  V   KNA I       R  +WN   RR L D E+ +  +L   L+ + +  +  DK  W +
Subjt:  QSFEEFSKFQVSCGNKIRFWEDVWCDTRPLKVVFPYLFDVSYKKNASIKDCWD-ERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLG-NQEDKILWKL

Query:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG
         PSG  + KS        S +        +W    P KVK F+W VA++ LNT+D +Q +  +  LSP+ C+LC+   E++D+LFLHC   +  W+ +  
Subjt:  EPSGASSCKSMLRKSINISPNICKSLVGQIWKHNSPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGG

Query:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR
        L  + +   R   D        +    +  V+   A    MW++W+ RNAR F+DK+ +     +S++ +AS W +   K+F    L M+  DW A+ +
Subjt:  LMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLLWK-RNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR

A0A6J1D6X4 uncharacterized protein LOC1110181864.8e-5242.25Show/hide
Query:  KGWKVFANLLKD-----FVDGKVFDEKKEREKQPNSQTFRKKSYAEAIKYTHNNDSGFPPAFNMSGHKNAEPQTTAGDYPYINPFQPDKALLKCPSKEMA
        + WK  A ++K        + ++ D++    K   ++  R+ ++ E I  T  +        + S   +   + T   Y  INPFQ DKAL+KCPSK++A
Subjt:  KGWKVFANLLKD-----FVDGKVFDEKKEREKQPNSQTFRKKSYAEAIKYTHNNDSGFPPAFNMSGHKNAEPQTTAGDYPYINPFQPDKALLKCPSKEMA

Query:  NLLVTNKGWVSFGPIILKVERWNKNLHGRINVVPSYGGWVRIRNLPPHLWHLQTFKALGDCLGGFIEYAEPNSLLIECVEVGIRVRGNYCGFIPGEIEVV
         LL+TNKGWV+FGP+ +K+E WN  LHGR  + PSYG WV+IRN+P HLW L TFKA+G+ LGGFI+Y + NS  IEC +V I+V+ NYCGFIP EI  +
Subjt:  NLLVTNKGWVSFGPIILKVERWNKNLHGRINVVPSYGGWVRIRNLPPHLWHLQTFKALGDCLGGFIEYAEPNSLLIECVEVGIRVRGNYCGFIPGEIEVV

Query:  EDDQIFKTQVV---------------HGSFSSEAAHVFFRGPMDTDFNPVDKWRIENG
        +    F+ +VV               HG FSSEAA  F +G  +   N +D+WR+ENG
Subjt:  EDDQIFKTQVV---------------HGSFSSEAAHVFFRGPMDTDFNPVDKWRIENG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCTGTTCGACAGGGGTCAGTGAAGATAGAAAGCAAAGTCTTCTACTGTGGCTTTGACCGACACTTTAAAGCTGGAATTGATTTCAAAGGATGGAAAGTTTTTGC
GAACCTGCTGAAAGACTTCGTTGATGGAAAGGTCTTCGATGAAAAAAAGGAAAGAGAGAAGCAACCAAACTCTCAGACTTTCCGGAAGAAATCCTATGCGGAGGCTATTA
AGTACACGCACAATAACGATTCTGGTTTCCCCCCTGCTTTTAACATGAGTGGCCACAAGAATGCTGAACCCCAAACAACAGCGGGAGACTACCCTTATATAAATCCTTTC
CAGCCCGACAAAGCTCTGTTAAAATGTCCATCAAAGGAGATGGCAAATTTATTGGTCACAAACAAGGGGTGGGTTAGTTTTGGGCCCATTATCTTAAAAGTTGAGAGATG
GAACAAAAATCTTCATGGTAGAATAAATGTGGTGCCCAGTTATGGAGGGTGGGTGAGAATTAGAAATCTCCCGCCGCATTTATGGCATTTGCAAACATTTAAGGCTTTGG
GAGATTGTCTGGGAGGCTTCATTGAATATGCGGAACCTAATTCCTTACTCATCGAATGCGTGGAAGTTGGGATCAGAGTGAGAGGAAATTACTGTGGCTTCATCCCTGGA
GAAATTGAGGTTGTTGAGGATGACCAGATCTTCAAAACTCAGGTTGTCCATGGCAGCTTCTCGTCGGAAGCGGCGCATGTCTTTTTCAGAGGCCCCATGGATACTGACTT
TAACCCAGTGGATAAATGGAGGATTGAGAATGGAAAGAAGAAACTGTACGAAGAACATGAGCCCAAGACTGGTTTAGAGATTAAGACGGGCTGTCAGAATGCGAAAACGG
ACCAGGCCCAGAAGGACCGCCCCAAAAGAAAAAGCCCAGAGACTAGCCCGAGAATTCTGGGCAAGAACAGAAAAGGAGTCTCCTTCGCTAAAGAGGCCCAAGTTACCCTG
TTTAAAAAAGGAAAGACCAAATATACGGGGAAGGCAAAAGACCCACGGGAACACCATGACTATATGGATAACGAAGAGGAAGACGACGAATTAGAACCTTCAATCTCGAG
CCCTGGCAGTAAAACAGAGGACGATTACTCTGTTGACAAAGACCGAACAAAGACCCACGGGGACATCCCTGGGGAATATTTCAATTGCTTTATGAATGATGATGAGTATT
CATCTCTGGACAACATAGTTTGCGGGGACGTTGAAGATGAAGGCCAGTGCTCGAGTATGTCTGAAGACCAAAGGGGTCATGAGATAATCCCTCTTTCCTTAGTGGATATG
AAGGCTCAGTGCTATGAGAATGAGGAGCAAGAACTACCTCAACCACGGGCTTTAGAAGTAATATCGTCTTCAGAAAAGGAAAACAGAGCTTCCCAAGACGAGGAAGGATT
TGTTATTAGTAGAGAGTTGATTCTAACCTTGAAGAGGAATAACTTATGCATCCGGCCAATTCTTGGAGCTGCGGCAAAGAAAGGGAGCACGGCCAAAAAGCGTAGAATTA
GAGAAGTTACCAACCTCTTGAGAAGTCTAGAAAAGGAGGATGAATTAGAGTCGGTTAATGTTAGGGAAGGCCAGAGCCAGCAAACAGGCGACGCCATGGATAGAGACGTT
GAAGAATCGAAGCTTCATCAGATTGATCGCAAGATTGTGAAATCGGTTTGGAGCTCTAGACATGTCGGTTGGGTGAGCTTAGATGCTTGGGGCTCGGCAGGGGGAATTCT
AGTTATGTGGAAGGAGAACTGTGTTTCTCAAGTGGACAATAGATCAATCCTCAAGTCTTCCCTCCTTGAAGTTGCATTGACTGATCAAAGGAGGATGTACCAGAAATGCA
AGATAAAATGGATGACCGAAGGGGATGAAAACACGGCGAATCAGCAAAGCTTTGAGGAGTTCTCCAAGTTCCAAGTTAGTTGTGGCAACAAGATTAGATTCTGGGAAGAT
GTGTGGTGTGACACGAGACCCTTAAAAGTGGTTTTCCCATACCTTTTTGATGTTTCATATAAGAAGAATGCCTCTATTAAAGATTGTTGGGACGAAAGAAACCAAACTTG
GAATCTTGGCTTACGCAGGGGTCTGTTTGATCGGGAGTTGAGTAATTGGGTCGCCTTAACCGAGATGCTGGAAAATATACAGTTGGGTAATCAGGAAGACAAGATTTTGT
GGAAGCTGGAGCCTTCGGGTGCTTCTTCTTGTAAATCCATGTTGCGGAAGTCGATTAATATTTCCCCTAATATTTGTAAGTCTCTTGTCGGGCAAATATGGAAGCACAAT
TCCCCTAAAAAGGTGAAAATCTTCCTTTGGTCTGTGGCCTATAGGAGTTTGAACACGGATGACAAAGTGCAAAGAAAGCTGAGGAATTGGTGCCTTTCTCCTTCAAGTTG
CAGGCTTTGTCTGATGGATTGCGAGAGTTTAGACTATTTATTCCTCCACTGTATGTTTGCTCAGAAGGCTTGGAACTTCATAGGTGGTTTAATGGGGCTCTCTTTTTGTG
TGTCGAGAAGGGCTGAGGATTGGTTGGCTGAAGGTTTGATCGCTTGGAACTTAAAGAATAAGGCTAAGGTGATTGGCAGTTGCGCGTTTAGAGTGACTATGTGGCTTTTG
TGGAAGAGGAACGCTAGAACCTTCGACGATAAATCGTCTTCTTTTGAGCTCTTTGCTAACTCTGTAAAGAATATTGCTTCTTGGTGGATCTCTTCGCATAAGAAGATCTT
TTGTAATTATAGCTTGCTAATGATTATTAGTGATTGGCAAGCTCTTTTGAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCTGTTCGACAGGGGTCAGTGAAGATAGAAAGCAAAGTCTTCTACTGTGGCTTTGACCGACACTTTAAAGCTGGAATTGATTTCAAAGGATGGAAAGTTTTTGC
GAACCTGCTGAAAGACTTCGTTGATGGAAAGGTCTTCGATGAAAAAAAGGAAAGAGAGAAGCAACCAAACTCTCAGACTTTCCGGAAGAAATCCTATGCGGAGGCTATTA
AGTACACGCACAATAACGATTCTGGTTTCCCCCCTGCTTTTAACATGAGTGGCCACAAGAATGCTGAACCCCAAACAACAGCGGGAGACTACCCTTATATAAATCCTTTC
CAGCCCGACAAAGCTCTGTTAAAATGTCCATCAAAGGAGATGGCAAATTTATTGGTCACAAACAAGGGGTGGGTTAGTTTTGGGCCCATTATCTTAAAAGTTGAGAGATG
GAACAAAAATCTTCATGGTAGAATAAATGTGGTGCCCAGTTATGGAGGGTGGGTGAGAATTAGAAATCTCCCGCCGCATTTATGGCATTTGCAAACATTTAAGGCTTTGG
GAGATTGTCTGGGAGGCTTCATTGAATATGCGGAACCTAATTCCTTACTCATCGAATGCGTGGAAGTTGGGATCAGAGTGAGAGGAAATTACTGTGGCTTCATCCCTGGA
GAAATTGAGGTTGTTGAGGATGACCAGATCTTCAAAACTCAGGTTGTCCATGGCAGCTTCTCGTCGGAAGCGGCGCATGTCTTTTTCAGAGGCCCCATGGATACTGACTT
TAACCCAGTGGATAAATGGAGGATTGAGAATGGAAAGAAGAAACTGTACGAAGAACATGAGCCCAAGACTGGTTTAGAGATTAAGACGGGCTGTCAGAATGCGAAAACGG
ACCAGGCCCAGAAGGACCGCCCCAAAAGAAAAAGCCCAGAGACTAGCCCGAGAATTCTGGGCAAGAACAGAAAAGGAGTCTCCTTCGCTAAAGAGGCCCAAGTTACCCTG
TTTAAAAAAGGAAAGACCAAATATACGGGGAAGGCAAAAGACCCACGGGAACACCATGACTATATGGATAACGAAGAGGAAGACGACGAATTAGAACCTTCAATCTCGAG
CCCTGGCAGTAAAACAGAGGACGATTACTCTGTTGACAAAGACCGAACAAAGACCCACGGGGACATCCCTGGGGAATATTTCAATTGCTTTATGAATGATGATGAGTATT
CATCTCTGGACAACATAGTTTGCGGGGACGTTGAAGATGAAGGCCAGTGCTCGAGTATGTCTGAAGACCAAAGGGGTCATGAGATAATCCCTCTTTCCTTAGTGGATATG
AAGGCTCAGTGCTATGAGAATGAGGAGCAAGAACTACCTCAACCACGGGCTTTAGAAGTAATATCGTCTTCAGAAAAGGAAAACAGAGCTTCCCAAGACGAGGAAGGATT
TGTTATTAGTAGAGAGTTGATTCTAACCTTGAAGAGGAATAACTTATGCATCCGGCCAATTCTTGGAGCTGCGGCAAAGAAAGGGAGCACGGCCAAAAAGCGTAGAATTA
GAGAAGTTACCAACCTCTTGAGAAGTCTAGAAAAGGAGGATGAATTAGAGTCGGTTAATGTTAGGGAAGGCCAGAGCCAGCAAACAGGCGACGCCATGGATAGAGACGTT
GAAGAATCGAAGCTTCATCAGATTGATCGCAAGATTGTGAAATCGGTTTGGAGCTCTAGACATGTCGGTTGGGTGAGCTTAGATGCTTGGGGCTCGGCAGGGGGAATTCT
AGTTATGTGGAAGGAGAACTGTGTTTCTCAAGTGGACAATAGATCAATCCTCAAGTCTTCCCTCCTTGAAGTTGCATTGACTGATCAAAGGAGGATGTACCAGAAATGCA
AGATAAAATGGATGACCGAAGGGGATGAAAACACGGCGAATCAGCAAAGCTTTGAGGAGTTCTCCAAGTTCCAAGTTAGTTGTGGCAACAAGATTAGATTCTGGGAAGAT
GTGTGGTGTGACACGAGACCCTTAAAAGTGGTTTTCCCATACCTTTTTGATGTTTCATATAAGAAGAATGCCTCTATTAAAGATTGTTGGGACGAAAGAAACCAAACTTG
GAATCTTGGCTTACGCAGGGGTCTGTTTGATCGGGAGTTGAGTAATTGGGTCGCCTTAACCGAGATGCTGGAAAATATACAGTTGGGTAATCAGGAAGACAAGATTTTGT
GGAAGCTGGAGCCTTCGGGTGCTTCTTCTTGTAAATCCATGTTGCGGAAGTCGATTAATATTTCCCCTAATATTTGTAAGTCTCTTGTCGGGCAAATATGGAAGCACAAT
TCCCCTAAAAAGGTGAAAATCTTCCTTTGGTCTGTGGCCTATAGGAGTTTGAACACGGATGACAAAGTGCAAAGAAAGCTGAGGAATTGGTGCCTTTCTCCTTCAAGTTG
CAGGCTTTGTCTGATGGATTGCGAGAGTTTAGACTATTTATTCCTCCACTGTATGTTTGCTCAGAAGGCTTGGAACTTCATAGGTGGTTTAATGGGGCTCTCTTTTTGTG
TGTCGAGAAGGGCTGAGGATTGGTTGGCTGAAGGTTTGATCGCTTGGAACTTAAAGAATAAGGCTAAGGTGATTGGCAGTTGCGCGTTTAGAGTGACTATGTGGCTTTTG
TGGAAGAGGAACGCTAGAACCTTCGACGATAAATCGTCTTCTTTTGAGCTCTTTGCTAACTCTGTAAAGAATATTGCTTCTTGGTGGATCTCTTCGCATAAGAAGATCTT
TTGTAATTATAGCTTGCTAATGATTATTAGTGATTGGCAAGCTCTTTTGAGATAG
Protein sequenceShow/hide protein sequence
MASVRQGSVKIESKVFYCGFDRHFKAGIDFKGWKVFANLLKDFVDGKVFDEKKEREKQPNSQTFRKKSYAEAIKYTHNNDSGFPPAFNMSGHKNAEPQTTAGDYPYINPF
QPDKALLKCPSKEMANLLVTNKGWVSFGPIILKVERWNKNLHGRINVVPSYGGWVRIRNLPPHLWHLQTFKALGDCLGGFIEYAEPNSLLIECVEVGIRVRGNYCGFIPG
EIEVVEDDQIFKTQVVHGSFSSEAAHVFFRGPMDTDFNPVDKWRIENGKKKLYEEHEPKTGLEIKTGCQNAKTDQAQKDRPKRKSPETSPRILGKNRKGVSFAKEAQVTL
FKKGKTKYTGKAKDPREHHDYMDNEEEDDELEPSISSPGSKTEDDYSVDKDRTKTHGDIPGEYFNCFMNDDEYSSLDNIVCGDVEDEGQCSSMSEDQRGHEIIPLSLVDM
KAQCYENEEQELPQPRALEVISSSEKENRASQDEEGFVISRELILTLKRNNLCIRPILGAAAKKGSTAKKRRIREVTNLLRSLEKEDELESVNVREGQSQQTGDAMDRDV
EESKLHQIDRKIVKSVWSSRHVGWVSLDAWGSAGGILVMWKENCVSQVDNRSILKSSLLEVALTDQRRMYQKCKIKWMTEGDENTANQQSFEEFSKFQVSCGNKIRFWED
VWCDTRPLKVVFPYLFDVSYKKNASIKDCWDERNQTWNLGLRRGLFDRELSNWVALTEMLENIQLGNQEDKILWKLEPSGASSCKSMLRKSINISPNICKSLVGQIWKHN
SPKKVKIFLWSVAYRSLNTDDKVQRKLRNWCLSPSSCRLCLMDCESLDYLFLHCMFAQKAWNFIGGLMGLSFCVSRRAEDWLAEGLIAWNLKNKAKVIGSCAFRVTMWLL
WKRNARTFDDKSSSFELFANSVKNIASWWISSHKKIFCNYSLLMIISDWQALLR