; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006576 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006576
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold2:16133670..16137335
RNA-Seq ExpressionSpg006576
SyntenySpg006576
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.7e-1925.61Show/hide
Query:  SNRAGPLPEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANK
        S   G LP F++ VI+Q+ W++FC H ++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+                          
Subjt:  SNRAGPLPEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANK

Query:  GGGDSCLWKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLS
                                     DE  +F  ++                                                       +   L 
Subjt:  GGGDSCLWKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLS

Query:  DAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMIC
          +  V + GA W +S     T   + L   A  W  F++  LLPTTH  TVS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT +C
Subjt:  DAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMIC

Query:  SRAGVPTVLEDIILFDKGIIDTPNLARL
          A  P ++ +  L + G ID   +AR+
Subjt:  SRAGVPTVLEDIILFDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.6e-3026.53Show/hide
Query:  SNRAGPLPEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANK
        S   G LP F++ VI+Q+ W++FC H ++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+                          
Subjt:  SNRAGPLPEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANK

Query:  GGGDSCLWKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLS
                                     DE  +F  ++                                                       + + L 
Subjt:  GGGDSCLWKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLS

Query:  DAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMIC
          +  V   GA W +S     T   + L   A  W  F++ RLLPTTH  TVS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT +C
Subjt:  DAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMIC

Query:  SRAGVPTVLEDIILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYGVNAILEQLALSASRQ-------EFAERQALTFWNYVKSRD
          A  P ++ +  L + G ID   +AR+         Q+    R          G ++  + A+ ++L+    +Q       +   +Q   FW Y K RD
Subjt:  SRAGVPTVLEDIILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYGVNAILEQLALSASRQ-------EFAERQALTFWNYVKSRD

Query:  ANLKKALQENFSKPYPALPAFPEDLL
          LKKALQ NF++P P  PAFP+++L
Subjt:  ANLKKALQENFSKPYPALPAFPEDLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]5.2e-2334.87Show/hide
Query:  ANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMICSRAGVPTVLEDIILFDKGIIDTPNLARLQRM
        A  W  F++ RLLPTTH  TVS++R+LL +++L   SI+VG++I +EI  C  +K G LFFP+ IT +C  A  P ++ +  L   G ID   +AR+ + 
Subjt:  ANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMICSRAGVPTVLEDIILFDKGIIDTPNLARLQRM

Query:  QEVR------------------QGGLVYGVNAILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPAFPEDLL
         +                     G ++  + A+ ++L+    +Q       +   +Q   FW Y K RD  LKKALQ NF++P P  P FP++LL
Subjt:  QEVR------------------QGGLVYGVNAILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPAFPEDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.2e-2534.75Show/hide
Query:  PRAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGC
        P   ++E     +  +L   +  V   GA W +S     T   + L   A  W  F++ RLLPTTH   VS++R+LL  ++L   SI+VG++I +EI  C
Subjt:  PRAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGC

Query:  WKKKVGKLFFPNTITMICSRAGVPTVLEDIILFDKGIIDT-----------------PNLARLQRMQEVRQGGLVYGVNAILEQLALSASRQEFAERQAL
          +K G LFFP+ IT +C  A  P ++ +  L + G ID                  P+ +R       R  G V      LEQ     S+QE   +Q  
Subjt:  WKKKVGKLFFPNTITMICSRAGVPTVLEDIILFDKGIIDT-----------------PNLARLQRMQEVRQGGLVYGVNAILEQLALSASRQEFAERQAL

Query:  TFWNYVKSRDANLKKALQENFSKPYPALPAFPEDLL
         FW Y K RD  LKKALQ NF++P P  PAFP+++L
Subjt:  TFWNYVKSRDANLKKALQENFSKPYPALPAFPEDLL

XP_024971944.1 uncharacterized protein LOC112510826 [Cynara cardunculus var. scolymus]1.1e-2032.13Show/hide
Query:  YNEMAVAPSNEQLSDAVREVGIEGARWQL-SKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKK
        +  ++ + S  +L +  RE+G  G RW   S    RT++++ LK  AN W+ FIR  L PTTHDS++S E+++L + ++   +I+VGK++   I  C K+
Subjt:  YNEMAVAPSNEQLSDAVREVGIEGARWQL-SKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKK

Query:  KVGKLFFPNTITMICSRAGVPTVLEDIIL---FDKGIIDTPNLARLQRMQEVRQGGLVYGVNAILEQ-LALSASRQEFAERQALTFWNYVKSRDANLKKA
        + GKLFFP+ I  +  +AGVP   +D+++    +K  ID   +++L+   E  +G  + GV   +E+ L  S S  +F   Q       +K+  A+L   
Subjt:  KVGKLFFPNTITMICSRAGVPTVLEDIIL---FDKGIIDTPNLARLQRMQEVRQGGLVYGVNAILEQ-LALSASRQEFAERQALTFWNYVKSRDANLKKA

Query:  LQENFSKPYPALPAFPEDLLN
        L+ +  K         EDL N
Subjt:  LQENFSKPYPALPAFPEDLLN

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.3e-1925.61Show/hide
Query:  SNRAGPLPEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANK
        S   G LP F++ VI+Q+ W++FC H ++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+                          
Subjt:  SNRAGPLPEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANK

Query:  GGGDSCLWKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLS
                                     DE  +F  ++                                                       +   L 
Subjt:  GGGDSCLWKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLS

Query:  DAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMIC
          +  V + GA W +S     T   + L   A  W  F++  LLPTTH  TVS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT +C
Subjt:  DAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMIC

Query:  SRAGVPTVLEDIILFDKGIIDTPNLARL
          A  P ++ +  L + G ID   +AR+
Subjt:  SRAGVPTVLEDIILFDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)1.2e-3026.53Show/hide
Query:  SNRAGPLPEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANK
        S   G LP F++ VI+Q+ W++FC H ++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+                          
Subjt:  SNRAGPLPEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANK

Query:  GGGDSCLWKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLS
                                     DE  +F  ++                                                       + + L 
Subjt:  GGGDSCLWKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLS

Query:  DAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMIC
          +  V   GA W +S     T   + L   A  W  F++ RLLPTTH  TVS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT +C
Subjt:  DAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMIC

Query:  SRAGVPTVLEDIILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYGVNAILEQLALSASRQ-------EFAERQALTFWNYVKSRD
          A  P ++ +  L + G ID   +AR+         Q+    R          G ++  + A+ ++L+    +Q       +   +Q   FW Y K RD
Subjt:  SRAGVPTVLEDIILFDKGIIDTPNLARL---------QRMQEVR---------QGGLVYGVNAILEQLALSASRQ-------EFAERQALTFWNYVKSRD

Query:  ANLKKALQENFSKPYPALPAFPEDLL
          LKKALQ NF++P P  PAFP+++L
Subjt:  ANLKKALQENFSKPYPALPAFPEDLL

A0A2P5CEY2 Uncharacterized protein2.5e-2334.87Show/hide
Query:  ANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMICSRAGVPTVLEDIILFDKGIIDTPNLARLQRM
        A  W  F++ RLLPTTH  TVS++R+LL +++L   SI+VG++I +EI  C  +K G LFFP+ IT +C  A  P ++ +  L   G ID   +AR+ + 
Subjt:  ANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMICSRAGVPTVLEDIILFDKGIIDTPNLARLQRM

Query:  QEVR------------------QGGLVYGVNAILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPAFPEDLL
         +                     G ++  + A+ ++L+    +Q       +   +Q   FW Y K RD  LKKALQ NF++P P  P FP++LL
Subjt:  QEVR------------------QGGLVYGVNAILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPAFPEDLL

A0A2P5DXM3 Uncharacterized protein2.1e-2534.75Show/hide
Query:  PRAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGC
        P   ++E     +  +L   +  V   GA W +S     T   + L   A  W  F++ RLLPTTH   VS++R+LL  ++L   SI+VG++I +EI  C
Subjt:  PRAAYNEMAVAPSNEQLSDAVREVGIEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGC

Query:  WKKKVGKLFFPNTITMICSRAGVPTVLEDIILFDKGIIDT-----------------PNLARLQRMQEVRQGGLVYGVNAILEQLALSASRQEFAERQAL
          +K G LFFP+ IT +C  A  P ++ +  L + G ID                  P+ +R       R  G V      LEQ     S+QE   +Q  
Subjt:  WKKKVGKLFFPNTITMICSRAGVPTVLEDIILFDKGIIDT-----------------PNLARLQRMQEVRQGGLVYGVNAILEQLALSASRQEFAERQAL

Query:  TFWNYVKSRDANLKKALQENFSKPYPALPAFPEDLL
         FW Y K RD  LKKALQ NF++P P  PAFP+++L
Subjt:  TFWNYVKSRDANLKKALQENFSKPYPALPAFPEDLL

W9QTD9 Uncharacterized protein1.4e-1824.7Show/hide
Query:  PEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANKGGGDSCL
        P F++ VI Q+ W++FC H    +VPLVREFYA L + +     V+   V +++  IN ++                                       
Subjt:  PEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANKGGGDSCL

Query:  WKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLSDAVREVG
                         L E  DE + F + V                                                       ++EQL   + EV 
Subjt:  WKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLSDAVREVG

Query:  IEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGC-WKKKVGKLFFPNTITMICSRAGVP
        IEGA WQ+S     T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ +I + EI  C   +K G L+FP+ IT +  +A VP
Subjt:  IEGARWQLSKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGC-WKKKVGKLFFPNTITMICSRAGVP

Query:  TVLEDIILFDKGIIDTPNLARLQRMQEV
           ++ I+ + G I T +++R+ + + V
Subjt:  TVLEDIILFDKGIIDTPNLARLQRMQEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGGCACGAGAAGGACGAAACCTTCGGGTTTCTCACCAGAGATCGTGGATCAAGGTACTTTCACTCAAACTCCTTCTTCCTTGACAATGTCAGCTACCTCGAGGGA
GAATCCGAGTTCATCTCAGCCAAGAAGGAGATGGGAGCCCAAAGACGTGCAACTCTTGAAGAAGAAGAAAGAAGGCATGATGAAGAAGAAGTCGTCCAAGCTGAGAATAG
CTCTCGGCAAGAAGAGGCTTCAATGGGTAACATTTCTGAACCTTCAACTAACCCTTCTTTGTCTCGCAGGAACAAGCAATTTGTTACGTATAGTGCAAGGAAGAAGAGTC
CCAAGAAGGTTGCACCCGAACAACCACTTGTTACCGAGCCCCTCAAGGAGATGGATGATGACCAAGTTCCTATCTCTGCAGCATCGAGGAGAAAGAGAAGAAGAGAGATC
AAAGCTGAACGGAGGACTAAAAAAAAATGACCCCATATTTGCCAAGAGGTCGAGGACAAGGTCCATGGACGCCTCTCCTGCAGCTCCTCCAACTGTCTCACCCACCAAGC
CAAAAGTCAAATCACCGAAGGCTCCATCTCCTAAAAATCCATTCCCTGAAGTCTTCAAAGATGTAAACTTTCAGGAAAGGATGGAGATAATGAGAAAGAGAGATTTCTTG
AATGAGAAAAGATTCTCTAACAGAGCAGGACCACTGCCAGAGTTCGTAAGCAGTGTTATCTCACAGTATAAGTGGCAGGAGTTTTGTCCTCACCTCCAGGAGGCTGTAGT
GCCTTTAGTTCGAGAGTTTTATGCCGGCCTGAGGGAGGAAAGTATCAGTATGGAGGTGGTGAGAGGCAAGATGGTCAGCTACTCTTCAGTAGACATCAACAGGGTGTACA
AAATCAAGGCACCCCTACATCCAAGAGGGAACGATGTCATTAGGAACCCTTCGACCAAGCAAATGAAAGATGCACTTAAATTGGTGGCCAACAAGGGAGGAGGAGATTCT
TGCCTGTGGAAGGAAAAGAGCAGGAAAGCTTTTCTTTGGATCACACATCACCCAGCTTTGTCAGAGGGTGAAGATGAAGGATTGAAATTCCGTAATTCCGTAAAGCGGAA
GCGTGATTGGGACCGATTCTGCGAACACCACCACTATGGCTACACCGGTATGACTCTTAGACTTCTAGAGGCAGGAGACTGGTGGGAGTCTAGAGGAAGCCTTGAGGCGA
ATTCTAATTTCCCCCGTGCGGCATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTAGGTATTGAAGGGGCACGGTGGCAACTG
TCAAAAACAGAGAAGAGGACATTTCAGTCAGCTTATCTGAAGAGGGAGGCAAACACATGGATGGGGTTTATCAGACAACGATTGCTTCCAACAACTCACGACTCGACGGT
CTCACGGGAACGGGTTCTTCTAGCTTTTGCGATTTTGCGGTCTCTCAGCATTGACGTAGGAAAGATCATTGTTAATGAAATATTTGGATGTTGGAAGAAAAAGGTGGGGA
AGTTGTTTTTTCCGAACACGATCACTATGATATGCAGCAGGGCAGGAGTGCCCACGGTTCTAGAGGATATTATTCTGTTTGACAAGGGGATCATTGACACGCCTAACTTG
GCACGGCTCCAGCGTATGCAGGAGGTACGTCAAGGTGGGCTTGTCTACGGCGTCAACGCGATTTTAGAACAACTGGCACTTTCGGCCAGTAGGCAAGAGTTTGCCGAAAG
GCAAGCTTTGACCTTCTGGAACTATGTTAAAAGTCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCATATCCAGCCCTTCCAGCATTCCCTGAGG
ATTTATTGAACCCCTGGATCCCACCCCCACCGGTTCAAAGAGGAAAAGAGGATGAGGAAAATGAGCCGAGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACGGCACGAGAAGGACGAAACCTTCGGGTTTCTCACCAGAGATCGTGGATCAAGGTACTTTCACTCAAACTCCTTCTTCCTTGACAATGTCAGCTACCTCGAGGGA
GAATCCGAGTTCATCTCAGCCAAGAAGGAGATGGGAGCCCAAAGACGTGCAACTCTTGAAGAAGAAGAAAGAAGGCATGATGAAGAAGAAGTCGTCCAAGCTGAGAATAG
CTCTCGGCAAGAAGAGGCTTCAATGGGTAACATTTCTGAACCTTCAACTAACCCTTCTTTGTCTCGCAGGAACAAGCAATTTGTTACGTATAGTGCAAGGAAGAAGAGTC
CCAAGAAGGTTGCACCCGAACAACCACTTGTTACCGAGCCCCTCAAGGAGATGGATGATGACCAAGTTCCTATCTCTGCAGCATCGAGGAGAAAGAGAAGAAGAGAGATC
AAAGCTGAACGGAGGACTAAAAAAAAATGACCCCATATTTGCCAAGAGGTCGAGGACAAGGTCCATGGACGCCTCTCCTGCAGCTCCTCCAACTGTCTCACCCACCAAGC
CAAAAGTCAAATCACCGAAGGCTCCATCTCCTAAAAATCCATTCCCTGAAGTCTTCAAAGATGTAAACTTTCAGGAAAGGATGGAGATAATGAGAAAGAGAGATTTCTTG
AATGAGAAAAGATTCTCTAACAGAGCAGGACCACTGCCAGAGTTCGTAAGCAGTGTTATCTCACAGTATAAGTGGCAGGAGTTTTGTCCTCACCTCCAGGAGGCTGTAGT
GCCTTTAGTTCGAGAGTTTTATGCCGGCCTGAGGGAGGAAAGTATCAGTATGGAGGTGGTGAGAGGCAAGATGGTCAGCTACTCTTCAGTAGACATCAACAGGGTGTACA
AAATCAAGGCACCCCTACATCCAAGAGGGAACGATGTCATTAGGAACCCTTCGACCAAGCAAATGAAAGATGCACTTAAATTGGTGGCCAACAAGGGAGGAGGAGATTCT
TGCCTGTGGAAGGAAAAGAGCAGGAAAGCTTTTCTTTGGATCACACATCACCCAGCTTTGTCAGAGGGTGAAGATGAAGGATTGAAATTCCGTAATTCCGTAAAGCGGAA
GCGTGATTGGGACCGATTCTGCGAACACCACCACTATGGCTACACCGGTATGACTCTTAGACTTCTAGAGGCAGGAGACTGGTGGGAGTCTAGAGGAAGCCTTGAGGCGA
ATTCTAATTTCCCCCGTGCGGCATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTAGGTATTGAAGGGGCACGGTGGCAACTG
TCAAAAACAGAGAAGAGGACATTTCAGTCAGCTTATCTGAAGAGGGAGGCAAACACATGGATGGGGTTTATCAGACAACGATTGCTTCCAACAACTCACGACTCGACGGT
CTCACGGGAACGGGTTCTTCTAGCTTTTGCGATTTTGCGGTCTCTCAGCATTGACGTAGGAAAGATCATTGTTAATGAAATATTTGGATGTTGGAAGAAAAAGGTGGGGA
AGTTGTTTTTTCCGAACACGATCACTATGATATGCAGCAGGGCAGGAGTGCCCACGGTTCTAGAGGATATTATTCTGTTTGACAAGGGGATCATTGACACGCCTAACTTG
GCACGGCTCCAGCGTATGCAGGAGGTACGTCAAGGTGGGCTTGTCTACGGCGTCAACGCGATTTTAGAACAACTGGCACTTTCGGCCAGTAGGCAAGAGTTTGCCGAAAG
GCAAGCTTTGACCTTCTGGAACTATGTTAAAAGTCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCATATCCAGCCCTTCCAGCATTCCCTGAGG
ATTTATTGAACCCCTGGATCCCACCCCCACCGGTTCAAAGAGGAAAAGAGGATGAGGAAAATGAGCCGAGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MHGTRRTKPSGFSPEIVDQGTFTQTPSSLTMSATSRENPSSSQPRRRWEPKDVQLLKKKKEGMMKKKSSKLRIALGKKRLQWVTFLNLQLTLLCLAGTSNLLRIVQGRRV
PRRLHPNNHLLPSPSRRWMMTKFLSLQHRGEREEERSKLNGGLKKNDPIFAKRSRTRSMDASPAAPPTVSPTKPKVKSPKAPSPKNPFPEVFKDVNFQERMEIMRKRDFL
NEKRFSNRAGPLPEFVSSVISQYKWQEFCPHLQEAVVPLVREFYAGLREESISMEVVRGKMVSYSSVDINRVYKIKAPLHPRGNDVIRNPSTKQMKDALKLVANKGGGDS
CLWKEKSRKAFLWITHHPALSEGEDEGLKFRNSVKRKRDWDRFCEHHHYGYTGMTLRLLEAGDWWESRGSLEANSNFPRAAYNEMAVAPSNEQLSDAVREVGIEGARWQL
SKTEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIVNEIFGCWKKKVGKLFFPNTITMICSRAGVPTVLEDIILFDKGIIDTPNL
ARLQRMQEVRQGGLVYGVNAILEQLALSASRQEFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVQRGKEDEENEPSQED