; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015077 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015077
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold3:40464079..40471257
RNA-Seq ExpressionSpg015077
SyntenySpg015077
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68165.1 hypothetical protein VITISV_008538 [Vitis vinifera]3.3e-5735.29Show/hide
Query:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-
        E +++W  +I SIYG H  GW + +         W  IA  +  F +F +F   +G  IRFWED+W   Q L  RFP L  +   K+++I+    S+   
Subjt:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-

Query:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ
        +W+ +FRR L D E+    +L++ L ++H+  + PD  SWSL  SG ++ KS F  L+  S   S   +KL+W    P K+K F+W +A++ +NT++ LQ
Subjt:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ

Query:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY
         +     LSP  C LCM+ GE +DHLF+HC+     W+ + +L  I    P+ + D +S   + +   ++  V+   A  A LW +W ERNAR FEDKS 
Subjt:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY

Query:  NFDSFCDCVQNTALWWISLHKNF
        N ++  D +   A  W+S  K F
Subjt:  NFDSFCDCVQNTALWWISLHKNF

CAN68165.1 hypothetical protein VITISV_008538 [Vitis vinifera]1.1e-0734.09Show/hide
Query:  GILFMWKEHVVDGQLREWNK----DIRLREECNKREILNQIDHIDRLEELRSIQ-NAEVERKRLKVELMQMTVNEQRCFNQKSKIKWLKEGDENTNFFHK
        G  FM K   +  +L+EWNK    D+  R++C    IL  I + D +E+   +     ++R   K EL ++ + E+  + QK+++KW+KEGD N+  FHK
Subjt:  GILFMWKEHVVDGQLREWNK----DIRLREECNKREILNQIDHIDRLEELRSIQ-NAEVERKRLKVELMQMTVNEQRCFNQKSKIKWLKEGDENTNFFHK

Query:  WTMARKNRAYISVLDDDTGQILSSEAEIENEI
            R+NR +I VL+++ G +L +   I+ EI
Subjt:  WTMARKNRAYISVLDDDTGQILSSEAEIENEI

CAN68165.1 hypothetical protein VITISV_008538 [Vitis vinifera]1.3e-5635.29Show/hide
Query:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-
        E +++W  +I SIYG H  GW + ++        W  IA  +  F +F +F   +G  IRFWED+W   Q L  RFP L      K++ I+    S+   
Subjt:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-

Query:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ
        +W+ +FRR L D E+    +L++ L ++H+  + PD  SWSL  SG ++ KS F  L+  S   S   +KL+W    P K+K F+W +A++ +NT++ LQ
Subjt:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ

Query:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY
         +     LSP  C LCM+ GE +DHLF+HC+     W+ + +L  I    P+ + D +S   + +   ++  V+   A  A LW +W ERNAR FEDKS 
Subjt:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY

Query:  NFDSFCDCVQNTALWWISLHKNF
        N ++  D +   A  W+S  K F
Subjt:  NFDSFCDCVQNTALWWISLHKNF

RVW15141.1 putative ribonuclease H protein [Vitis vinifera]2.8e-5635.29Show/hide
Query:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-
        E +++W  +I SIY  H  GW + ++        W  IA  +  F +F +F   +G  IRFWED+W   Q L  RFP L  +   K++ I+    S+   
Subjt:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-

Query:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ
        +W+ +FRR L D E+    +L++ L ++H+  + PD  SWSL  SG ++ KS F  L+  S   S   +KL+W    P K+K F+W +A++ +NT++ LQ
Subjt:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ

Query:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY
         +     LSP  C LCM+ GE  DHLF+HC+     W+ + +L  I    P+ I D LS   + +   ++  ++   A  A LW +W ERNAR FEDKS 
Subjt:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY

Query:  NFDSFCDCVQNTALWWISLHKNF
        N ++  D +   A  W+S  K F
Subjt:  NFDSFCDCVQNTALWWISLHKNF

RVW27524.1 putative ribonuclease H protein [Vitis vinifera]2.8e-5634.98Show/hide
Query:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-
        E +++W  +I SIYG H  GW + ++        W  IA  +  F +F +F   +G  IRFWED+W   Q L  RFP L  +   K++ I+    S+   
Subjt:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-

Query:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ
        +W+ +FRR L D E+    +L++ L ++H+  + PD  SWSL  SG ++ KS F  L+  S       +KL+W    P K+K F+W +A++ +NT + LQ
Subjt:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ

Query:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY
         +     LSP  C LCM+ GE +DHLF+HC+     W+ + +L  I    P+ + D +S   + +   ++  V+   A  A LW +W ERNAR FEDKS 
Subjt:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY

Query:  NFDSFCDCVQNTALWWISLHKNF
        N ++  D +   A  W+S  K F
Subjt:  NFDSFCDCVQNTALWWISLHKNF

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]1.6e-6747.19Show/hide
Query:  KSFANVVRSNSQNRAPKSIQRSKKLEVNGSPELTKDAGPIGCVEEVRMIDWNNVIVITKRDFHDEWGRILEVIQMQMREAFVINPFQPDKELLKCPSNEM
        KS A +V++N ++   +  +R    E  G     K AG     EEVR ++W   IVIT+RDFHD+W RIL  ++ Q   +++INPFQ DK L+KCPS ++
Subjt:  KSFANVVRSNSQNRAPKSIQRSKKLEVNGSPELTKDAGPIGCVEEVRMIDWNNVIVITKRDFHDEWGRILEVIQMQMREAFVINPFQPDKELLKCPSNEM

Query:  AELLTSNKGWVSFGPIILKVEKWNIRKHNKFSCVPSYGGWVRLRNLPLHLWQLRIFKAIGVQLGGFIEYIVPNSLLIDCMEVNLKVKENYCGFIPTEVKV
        A LL +NKGWV+FGP+ +K+E WN   H +    PSYG WV++RN+PLHLW L  FKAIG  LGGFI+Y   NS  I+C +V +KVK NYCGFIP E+  
Subjt:  AELLTSNKGWVSFGPIILKVEKWNIRKHNKFSCVPSYGGWVRLRNLPLHLWQLRIFKAIGVQLGGFIEYIVPNSLLIDCMEVNLKVKENYCGFIPTEVKV

Query:  IDGEDVFNVQIVTFQDGNLLINRDAGIHGSFLPTAAHAFHRGPSDPSFSPMDIWRIENGSEYPTIEL
        +DG   F  ++V+F+D   L  +D GIHG F   AA +FH+G ++ S + +D WR+ENG  YP + +
Subjt:  IDGEDVFNVQIVTFQDGNLLINRDAGIHGSFLPTAAHAFHRGPSDPSFSPMDIWRIENGSEYPTIEL

TrEMBL top hitse value%identityAlignment
A0A438BVX7 Putative ribonuclease H protein1.4e-5635.29Show/hide
Query:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-
        E +++W  +I SIY  H  GW + ++        W  IA  +  F +F +F   +G  IRFWED+W   Q L  RFP L  +   K++ I+    S+   
Subjt:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-

Query:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ
        +W+ +FRR L D E+    +L++ L ++H+  + PD  SWSL  SG ++ KS F  L+  S   S   +KL+W    P K+K F+W +A++ +NT++ LQ
Subjt:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ

Query:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY
         +     LSP  C LCM+ GE  DHLF+HC+     W+ + +L  I    P+ I D LS   + +   ++  ++   A  A LW +W ERNAR FEDKS 
Subjt:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY

Query:  NFDSFCDCVQNTALWWISLHKNF
        N ++  D +   A  W+S  K F
Subjt:  NFDSFCDCVQNTALWWISLHKNF

A0A438CWE0 Putative ribonuclease H protein1.4e-5634.98Show/hide
Query:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-
        E +++W  +I SIYG H  GW + ++        W  IA  +  F +F +F   +G  IRFWED+W   Q L  RFP L  +   K++ I+    S+   
Subjt:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-

Query:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ
        +W+ +FRR L D E+    +L++ L ++H+  + PD  SWSL  SG ++ KS F  L+  S       +KL+W    P K+K F+W +A++ +NT + LQ
Subjt:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ

Query:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY
         +     LSP  C LCM+ GE +DHLF+HC+     W+ + +L  I    P+ + D +S   + +   ++  V+   A  A LW +W ERNAR FEDKS 
Subjt:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY

Query:  NFDSFCDCVQNTALWWISLHKNF
        N ++  D +   A  W+S  K F
Subjt:  NFDSFCDCVQNTALWWISLHKNF

A0A6J1D6X4 uncharacterized protein LOC1110181867.6e-6847.19Show/hide
Query:  KSFANVVRSNSQNRAPKSIQRSKKLEVNGSPELTKDAGPIGCVEEVRMIDWNNVIVITKRDFHDEWGRILEVIQMQMREAFVINPFQPDKELLKCPSNEM
        KS A +V++N ++   +  +R    E  G     K AG     EEVR ++W   IVIT+RDFHD+W RIL  ++ Q   +++INPFQ DK L+KCPS ++
Subjt:  KSFANVVRSNSQNRAPKSIQRSKKLEVNGSPELTKDAGPIGCVEEVRMIDWNNVIVITKRDFHDEWGRILEVIQMQMREAFVINPFQPDKELLKCPSNEM

Query:  AELLTSNKGWVSFGPIILKVEKWNIRKHNKFSCVPSYGGWVRLRNLPLHLWQLRIFKAIGVQLGGFIEYIVPNSLLIDCMEVNLKVKENYCGFIPTEVKV
        A LL +NKGWV+FGP+ +K+E WN   H +    PSYG WV++RN+PLHLW L  FKAIG  LGGFI+Y   NS  I+C +V +KVK NYCGFIP E+  
Subjt:  AELLTSNKGWVSFGPIILKVEKWNIRKHNKFSCVPSYGGWVRLRNLPLHLWQLRIFKAIGVQLGGFIEYIVPNSLLIDCMEVNLKVKENYCGFIPTEVKV

Query:  IDGEDVFNVQIVTFQDGNLLINRDAGIHGSFLPTAAHAFHRGPSDPSFSPMDIWRIENGSEYPTIEL
        +DG   F  ++V+F+D   L  +D GIHG F   AA +FH+G ++ S + +D WR+ENG  YP + +
Subjt:  IDGEDVFNVQIVTFQDGNLLINRDAGIHGSFLPTAAHAFHRGPSDPSFSPMDIWRIENGSEYPTIEL

A5BPI6 Uncharacterized protein1.6e-5735.29Show/hide
Query:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-
        E +++W  +I SIYG H  GW + +         W  IA  +  F +F +F   +G  IRFWED+W   Q L  RFP L  +   K+++I+    S+   
Subjt:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-

Query:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ
        +W+ +FRR L D E+    +L++ L ++H+  + PD  SWSL  SG ++ KS F  L+  S   S   +KL+W    P K+K F+W +A++ +NT++ LQ
Subjt:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ

Query:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY
         +     LSP  C LCM+ GE +DHLF+HC+     W+ + +L  I    P+ + D +S   + +   ++  V+   A  A LW +W ERNAR FEDKS 
Subjt:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY

Query:  NFDSFCDCVQNTALWWISLHKNF
        N ++  D +   A  W+S  K F
Subjt:  NFDSFCDCVQNTALWWISLHKNF

A5BPI6 Uncharacterized protein5.2e-0834.09Show/hide
Query:  GILFMWKEHVVDGQLREWNK----DIRLREECNKREILNQIDHIDRLEELRSIQ-NAEVERKRLKVELMQMTVNEQRCFNQKSKIKWLKEGDENTNFFHK
        G  FM K   +  +L+EWNK    D+  R++C    IL  I + D +E+   +     ++R   K EL ++ + E+  + QK+++KW+KEGD N+  FHK
Subjt:  GILFMWKEHVVDGQLREWNK----DIRLREECNKREILNQIDHIDRLEELRSIQ-NAEVERKRLKVELMQMTVNEQRCFNQKSKIKWLKEGDENTNFFHK

Query:  WTMARKNRAYISVLDDDTGQILSSEAEIENEI
            R+NR +I VL+++ G +L +   I+ EI
Subjt:  WTMARKNRAYISVLDDDTGQILSSEAEIENEI

A5BPI6 Uncharacterized protein6.1e-5735.29Show/hide
Query:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-
        E +++W  +I SIYG H  GW + ++        W  IA  +  F +F +F   +G  IRFWED+W   Q L  RFP L      K++ I+    S+   
Subjt:  ENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVWCDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQ-

Query:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ
        +W+ +FRR L D E+    +L++ L ++H+  + PD  SWSL  SG ++ KS F  L+  S   S   +KL+W    P K+K F+W +A++ +NT++ LQ
Subjt:  AWDLDFRRGLFDRELTSWVALVEKLHNVHIGQN-PDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSPKKVKVFLWSIAYRSLNTDERLQ

Query:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY
         +     LSP  C LCM+ GE +DHLF+HC+     W+ + +L  I    P+ + D +S   + +   ++  V+   A  A LW +W ERNAR FEDKS 
Subjt:  SKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWIERNARTFEDKSY

Query:  NFDSFCDCVQNTALWWISLHKNF
        N ++  D +   A  W+S  K F
Subjt:  NFDSFCDCVQNTALWWISLHKNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCCTTTTCCCCACTCCGACTACCACCCATCACCATTGCCGTCGTCATCTGAAGAAAGAGACGTCTGATCGCCGACGTTTGTTCGAGGAAGGAGCAGATCTGGA
GTTTTTCACGTTGCTGCACCAAAAGCTCGAGAACACGGGTCACCTCACGATCTCCAGCGTTTTTTCGAGGAAGGAGACGTCGGTCGGACGTCAAGCTCAGTTGCTGCAGT
CACACGAAGTTCATGGGGTCGTTAGAATCCAAGAAGAACATTTGAGAAGAAGATTCTCTTTATCAGCTGAAGAAATTGTCTTGGTTTGGATCTCCGACTCGATTGACGAT
CTCCTCCTTGGCCCAGCCACCCATAAATTCTTCCGGAAAACTGACTGCAAGAATGGCTTCATTTGGATCCAAAAAACTACGAACAAACGTGGAAGCGTCCTTGAGATCAC
GAAGAATAGAGAAAAGGACAAGGGGAAGTCATTTGCTAATGTGGTCAGAAGTAACAGCCAAAACAGAGCCCCAAAGTCGATTCAAAGATCTAAAAAACTAGAGGTAAATG
GTAGTCCAGAGCTGACGAAGGATGCTGGACCCATTGGATGCGTGGAAGAGGTGAGGATGATTGACTGGAACAACGTCATAGTGATCACCAAGAGAGACTTCCACGATGAG
TGGGGTCGTATTCTTGAAGTGATCCAAATGCAAATGAGGGAAGCCTTTGTTATAAACCCCTTCCAGCCAGATAAAGAGCTCCTTAAATGCCCCTCGAACGAAATGGCTGA
GTTATTAACTAGTAACAAAGGGTGGGTGAGCTTTGGTCCAATCATCTTAAAAGTGGAAAAATGGAATATAAGAAAGCATAATAAATTCTCCTGTGTTCCAAGCTATGGAG
GATGGGTTAGGTTGCGAAATCTCCCATTGCATTTATGGCAATTAAGAATATTTAAGGCGATTGGTGTGCAGCTGGGAGGATTTATTGAATATATTGTGCCGAATTCTCTG
CTCATTGATTGTATGGAAGTGAATTTGAAAGTAAAGGAAAACTATTGTGGGTTTATTCCGACAGAAGTCAAAGTTATTGATGGAGAAGATGTGTTTAACGTCCAGATAGT
CACATTTCAAGATGGGAACCTTCTGATCAATAGAGATGCCGGAATCCATGGAAGTTTCCTTCCGACAGCCGCGCACGCCTTCCACAGAGGGCCCTCAGATCCGTCTTTCA
GTCCAATGGACATTTGGAGGATAGAGAATGGGTCGGAATACCCAACGATAGAGCTCAGTGAAAGCCTAATCCGAGGTCAGTCAAATACTGGACCCAACGAGAGGAAAAGC
CTAGCTATCAAAGGAAAAAATAAAAAGGCCGTCTCCTTTGCAAAATCAGCCCAAACGACCACTTTTAAAAAGGGATCTATTCAACTGACTGATAAAACCAAAAGCCCACA
TATGGACAGAGACACGTGCAAAGATTGGATGGGGGATTATGGCTTAGAATCTGATTTATCTCTTTCAAGCCCAGCTAGCAAAGATTACGGCGATCTGATGGACAGAAACG
AATATATTGCTAAGTCGTCTGATGAAGAAATACCAGAAGAATATTATCGTTGCTTTGCTAATGATAAAGAGATGGACGAAGATAGGCAAGAAGGCAAAGAAGGGAGCAAG
GAAAAATCTTCAGAGGGTGTCGATCAAGGGAAAAAGGCCACTGATTCCCCTTTTATAGGGACATCGCACCAAGATAATCATATGGCCAGTATTATTAGCACCAGTGAGCA
AATGGCTGAGCACCCACAGGCCTTAGCCGTTGTCCCAAGTAGTAATTCAATAGATAGAGATATTGGCTCGGGTTCCCTAGACGGATTTACGATTAGTAAAGAGATTGTTC
AAACCCTCAGGAAAAGTCATCTTTGCATTAGACCCATATCTGGAGCCTATTCCAAAAAAGGCACCTCTACACAGAAAAGAAGAAACAGAGAGGAGACCAAGCTTAGTCAG
ATAGATAGGAAGTTGGTGAAATCCATTTGGAGCTCTCGCCACATTGTTTGGCAGTTTTTAGATGCGAGTAACTTTGCAGGGGGTATTTTGTTTATGTGGAAAGAGCATGT
TGTGGATGGACAGCTGAGAGAATGGAACAAAGATATCCGCTTAAGAGAAGAGTGCAACAAAAGGGAGATTCTTAACCAGATTGACCACATTGATCGCCTAGAAGAGCTGA
GAAGTATTCAAAACGCTGAGGTAGAGAGAAAGCGCCTCAAGGTGGAGCTGATGCAAATGACCGTGAATGAGCAGAGATGTTTTAATCAGAAAAGTAAGATTAAATGGCTT
AAGGAAGGGGATGAGAACACCAATTTCTTCCATAAATGGACCATGGCAAGGAAAAATAGAGCCTACATTTCGGTCCTTGACGATGATACAGGGCAGATCCTTTCTTCTGA
AGCAGAGATTGAAAATGAAATTGAAAACAACTCTATGTGGAGAAACATTATTGCTAGTATCTATGGAGTTCACCCTCGTGGCTGGAGTTCAAAGTCCTTAACTGAGAAGA
AAGGCAACAAGATTTGGGTGGATATTGCGGCCAATTATCCCACGTTCCAGCGCTTCATCAAGTTTAATGCCTCCAACGGTAAAAATATCAGATTTTGGGAGGATGTTTGG
TGTGATTCTCAGCCCCTTAACCATCGTTTCCCTGACTTATACCTTCTGTCCAAGAAGAAGGATGTCGTCATTGCAGATTGTTGGAGCAGTAGCGTGCAGGCTTGGGACTT
GGATTTTAGGAGAGGTCTCTTTGACAGAGAGCTTACCAGTTGGGTGGCTTTGGTGGAAAAGCTTCACAATGTCCACATTGGGCAGAATCCAGACTCTGTTAGTTGGTCTT
TGGAGGGGTCGGGGAAGTATTCGACCAAGTCCCTCTTCTACAAGCTGACAAATGCTTCCCCTAAAATCAGTTCCACCACTAGTAAGTTAATCTGGAAGCACAACAGCCCG
AAGAAGGTCAAAGTTTTTTTGTGGTCTATAGCCTATAGAAGTCTAAATACGGATGAGAGGTTGCAATCCAAGTTTAAACAATGGACTCTTTCCCCCTCGGCCTGTAGATT
GTGTATGAAAGATGGGGAAAACATCGACCACTTATTCATCCACTGTAATTTTGTGTGGAGGGCGTGGAACTTCATTGCTAGATTATTGGGTATTTCCTCCTGCCTCCCTA
AGAAGATTGATGATTGGCTTAGTGAAGGCCTTTCGACTTGGAACCTAAAAAGGAAAGCCAAGGTTATTGCTACTTGTGCCTTTAGGGCCACTCTTTGGAGCCTTTGGATA
GAAAGGAACGCTAGAACGTTTGAGGATAAGTCTTACAACTTCGATTCTTTTTGTGATTGTGTACAAAATACGGCGTTGTGGTGGATTTCTTTACACAAGAATTTTTTTTG
TAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCCTTTTCCCCACTCCGACTACCACCCATCACCATTGCCGTCGTCATCTGAAGAAAGAGACGTCTGATCGCCGACGTTTGTTCGAGGAAGGAGCAGATCTGGA
GTTTTTCACGTTGCTGCACCAAAAGCTCGAGAACACGGGTCACCTCACGATCTCCAGCGTTTTTTCGAGGAAGGAGACGTCGGTCGGACGTCAAGCTCAGTTGCTGCAGT
CACACGAAGTTCATGGGGTCGTTAGAATCCAAGAAGAACATTTGAGAAGAAGATTCTCTTTATCAGCTGAAGAAATTGTCTTGGTTTGGATCTCCGACTCGATTGACGAT
CTCCTCCTTGGCCCAGCCACCCATAAATTCTTCCGGAAAACTGACTGCAAGAATGGCTTCATTTGGATCCAAAAAACTACGAACAAACGTGGAAGCGTCCTTGAGATCAC
GAAGAATAGAGAAAAGGACAAGGGGAAGTCATTTGCTAATGTGGTCAGAAGTAACAGCCAAAACAGAGCCCCAAAGTCGATTCAAAGATCTAAAAAACTAGAGGTAAATG
GTAGTCCAGAGCTGACGAAGGATGCTGGACCCATTGGATGCGTGGAAGAGGTGAGGATGATTGACTGGAACAACGTCATAGTGATCACCAAGAGAGACTTCCACGATGAG
TGGGGTCGTATTCTTGAAGTGATCCAAATGCAAATGAGGGAAGCCTTTGTTATAAACCCCTTCCAGCCAGATAAAGAGCTCCTTAAATGCCCCTCGAACGAAATGGCTGA
GTTATTAACTAGTAACAAAGGGTGGGTGAGCTTTGGTCCAATCATCTTAAAAGTGGAAAAATGGAATATAAGAAAGCATAATAAATTCTCCTGTGTTCCAAGCTATGGAG
GATGGGTTAGGTTGCGAAATCTCCCATTGCATTTATGGCAATTAAGAATATTTAAGGCGATTGGTGTGCAGCTGGGAGGATTTATTGAATATATTGTGCCGAATTCTCTG
CTCATTGATTGTATGGAAGTGAATTTGAAAGTAAAGGAAAACTATTGTGGGTTTATTCCGACAGAAGTCAAAGTTATTGATGGAGAAGATGTGTTTAACGTCCAGATAGT
CACATTTCAAGATGGGAACCTTCTGATCAATAGAGATGCCGGAATCCATGGAAGTTTCCTTCCGACAGCCGCGCACGCCTTCCACAGAGGGCCCTCAGATCCGTCTTTCA
GTCCAATGGACATTTGGAGGATAGAGAATGGGTCGGAATACCCAACGATAGAGCTCAGTGAAAGCCTAATCCGAGGTCAGTCAAATACTGGACCCAACGAGAGGAAAAGC
CTAGCTATCAAAGGAAAAAATAAAAAGGCCGTCTCCTTTGCAAAATCAGCCCAAACGACCACTTTTAAAAAGGGATCTATTCAACTGACTGATAAAACCAAAAGCCCACA
TATGGACAGAGACACGTGCAAAGATTGGATGGGGGATTATGGCTTAGAATCTGATTTATCTCTTTCAAGCCCAGCTAGCAAAGATTACGGCGATCTGATGGACAGAAACG
AATATATTGCTAAGTCGTCTGATGAAGAAATACCAGAAGAATATTATCGTTGCTTTGCTAATGATAAAGAGATGGACGAAGATAGGCAAGAAGGCAAAGAAGGGAGCAAG
GAAAAATCTTCAGAGGGTGTCGATCAAGGGAAAAAGGCCACTGATTCCCCTTTTATAGGGACATCGCACCAAGATAATCATATGGCCAGTATTATTAGCACCAGTGAGCA
AATGGCTGAGCACCCACAGGCCTTAGCCGTTGTCCCAAGTAGTAATTCAATAGATAGAGATATTGGCTCGGGTTCCCTAGACGGATTTACGATTAGTAAAGAGATTGTTC
AAACCCTCAGGAAAAGTCATCTTTGCATTAGACCCATATCTGGAGCCTATTCCAAAAAAGGCACCTCTACACAGAAAAGAAGAAACAGAGAGGAGACCAAGCTTAGTCAG
ATAGATAGGAAGTTGGTGAAATCCATTTGGAGCTCTCGCCACATTGTTTGGCAGTTTTTAGATGCGAGTAACTTTGCAGGGGGTATTTTGTTTATGTGGAAAGAGCATGT
TGTGGATGGACAGCTGAGAGAATGGAACAAAGATATCCGCTTAAGAGAAGAGTGCAACAAAAGGGAGATTCTTAACCAGATTGACCACATTGATCGCCTAGAAGAGCTGA
GAAGTATTCAAAACGCTGAGGTAGAGAGAAAGCGCCTCAAGGTGGAGCTGATGCAAATGACCGTGAATGAGCAGAGATGTTTTAATCAGAAAAGTAAGATTAAATGGCTT
AAGGAAGGGGATGAGAACACCAATTTCTTCCATAAATGGACCATGGCAAGGAAAAATAGAGCCTACATTTCGGTCCTTGACGATGATACAGGGCAGATCCTTTCTTCTGA
AGCAGAGATTGAAAATGAAATTGAAAACAACTCTATGTGGAGAAACATTATTGCTAGTATCTATGGAGTTCACCCTCGTGGCTGGAGTTCAAAGTCCTTAACTGAGAAGA
AAGGCAACAAGATTTGGGTGGATATTGCGGCCAATTATCCCACGTTCCAGCGCTTCATCAAGTTTAATGCCTCCAACGGTAAAAATATCAGATTTTGGGAGGATGTTTGG
TGTGATTCTCAGCCCCTTAACCATCGTTTCCCTGACTTATACCTTCTGTCCAAGAAGAAGGATGTCGTCATTGCAGATTGTTGGAGCAGTAGCGTGCAGGCTTGGGACTT
GGATTTTAGGAGAGGTCTCTTTGACAGAGAGCTTACCAGTTGGGTGGCTTTGGTGGAAAAGCTTCACAATGTCCACATTGGGCAGAATCCAGACTCTGTTAGTTGGTCTT
TGGAGGGGTCGGGGAAGTATTCGACCAAGTCCCTCTTCTACAAGCTGACAAATGCTTCCCCTAAAATCAGTTCCACCACTAGTAAGTTAATCTGGAAGCACAACAGCCCG
AAGAAGGTCAAAGTTTTTTTGTGGTCTATAGCCTATAGAAGTCTAAATACGGATGAGAGGTTGCAATCCAAGTTTAAACAATGGACTCTTTCCCCCTCGGCCTGTAGATT
GTGTATGAAAGATGGGGAAAACATCGACCACTTATTCATCCACTGTAATTTTGTGTGGAGGGCGTGGAACTTCATTGCTAGATTATTGGGTATTTCCTCCTGCCTCCCTA
AGAAGATTGATGATTGGCTTAGTGAAGGCCTTTCGACTTGGAACCTAAAAAGGAAAGCCAAGGTTATTGCTACTTGTGCCTTTAGGGCCACTCTTTGGAGCCTTTGGATA
GAAAGGAACGCTAGAACGTTTGAGGATAAGTCTTACAACTTCGATTCTTTTTGTGATTGTGTACAAAATACGGCGTTGTGGTGGATTTCTTTACACAAGAATTTTTTTTG
TAATTAG
Protein sequenceShow/hide protein sequence
MSSLFPTPTTTHHHCRRHLKKETSDRRRLFEEGADLEFFTLLHQKLENTGHLTISSVFSRKETSVGRQAQLLQSHEVHGVVRIQEEHLRRRFSLSAEEIVLVWISDSIDD
LLLGPATHKFFRKTDCKNGFIWIQKTTNKRGSVLEITKNREKDKGKSFANVVRSNSQNRAPKSIQRSKKLEVNGSPELTKDAGPIGCVEEVRMIDWNNVIVITKRDFHDE
WGRILEVIQMQMREAFVINPFQPDKELLKCPSNEMAELLTSNKGWVSFGPIILKVEKWNIRKHNKFSCVPSYGGWVRLRNLPLHLWQLRIFKAIGVQLGGFIEYIVPNSL
LIDCMEVNLKVKENYCGFIPTEVKVIDGEDVFNVQIVTFQDGNLLINRDAGIHGSFLPTAAHAFHRGPSDPSFSPMDIWRIENGSEYPTIELSESLIRGQSNTGPNERKS
LAIKGKNKKAVSFAKSAQTTTFKKGSIQLTDKTKSPHMDRDTCKDWMGDYGLESDLSLSSPASKDYGDLMDRNEYIAKSSDEEIPEEYYRCFANDKEMDEDRQEGKEGSK
EKSSEGVDQGKKATDSPFIGTSHQDNHMASIISTSEQMAEHPQALAVVPSSNSIDRDIGSGSLDGFTISKEIVQTLRKSHLCIRPISGAYSKKGTSTQKRRNREETKLSQ
IDRKLVKSIWSSRHIVWQFLDASNFAGGILFMWKEHVVDGQLREWNKDIRLREECNKREILNQIDHIDRLEELRSIQNAEVERKRLKVELMQMTVNEQRCFNQKSKIKWL
KEGDENTNFFHKWTMARKNRAYISVLDDDTGQILSSEAEIENEIENNSMWRNIIASIYGVHPRGWSSKSLTEKKGNKIWVDIAANYPTFQRFIKFNASNGKNIRFWEDVW
CDSQPLNHRFPDLYLLSKKKDVVIADCWSSSVQAWDLDFRRGLFDRELTSWVALVEKLHNVHIGQNPDSVSWSLEGSGKYSTKSLFYKLTNASPKISSTTSKLIWKHNSP
KKVKVFLWSIAYRSLNTDERLQSKFKQWTLSPSACRLCMKDGENIDHLFIHCNFVWRAWNFIARLLGISSCLPKKIDDWLSEGLSTWNLKRKAKVIATCAFRATLWSLWI
ERNARTFEDKSYNFDSFCDCVQNTALWWISLHKNFFCN