; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034487 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034487
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold4:13918982..13929906
RNA-Seq ExpressionSpg034487
SyntenySpg034487
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8676815.1 hypothetical protein F3Y22_tig00111582pilonHSYRG01273 [Hibiscus syriacus]3.1e-2024.13Show/hide
Query:  YERFVNNLVRAKYLDMLKRDFLFERGF-------GDDLPHFLRVDITNHDWEQFCAKPEPANS--QVVHEFYANIDEEEGFQLNAVVREVWIEGAQWRLS
        +++F N   +A++ +   R   FE  F       G   P  + + +T   W++F   P   N+   VV       DE +    + ++ ++  E  +W   
Subjt:  YERFVNNLVRAKYLDMLKRDFLFERGF-------GDDLPHFLRVDITNHDWEQFCAKPEPANS--QVVHEFYANIDEEEGFQLNAVVREVWIEGAQWRLS

Query:  KTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMD
        +T + +     L+  A     F+K +L+ T+H+  VS  ++LL+ +IM S  IDVG+II  ++H C  KK   L FPN+IT LC++  V  +A D IL  
Subjt:  KTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMD

Query:  TPNLARL------------------QRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLLSL
          ++ R                   +++    Q      + +++E +    + +         ++ YVK RD  +    Q       + FP F D++LS 
Subjt:  TPNLARL------------------QRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLLSL

Query:  WIPPSPIEREEEDED
        +   + +E ++++ED
Subjt:  WIPPSPIEREEEDED

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]7.6e-1922.28Show/hide
Query:  LSYERFVNNLVRAKYLDMLKRDFLFERGF-------GDDLPHFLRVDITNHDWEQFCAKPEPANSQVVHEFYANI-------------------------
        +++++F N+  +A++ +   R+  FE GF       G   P  + + +    W +F   P   N+ +V EFYANI                         
Subjt:  LSYERFVNNLVRAKYLDMLKRDFLFERGF-------GDDLPHFLRVDITNHDWEQFCAKPEPANSQVVHEFYANI-------------------------

Query:  --------------DEEEGFQLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISS
                      +E +  + + V+ ++  E  +W   +T + +     L+  A     F+K +L+ T+H+  VS  ++LL+ +++ S  IDVG+II  
Subjt:  --------------DEEEGFQLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISS

Query:  EIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMDTPNLAR------------------LQRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQF
        ++H C  KK   L FPN+IT LC++  V  +A D IL     + R                   +++    +      + +++E +    +++       
Subjt:  EIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMDTPNLAR------------------LQRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQF

Query:  QTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLLSLWIPPSPIEREEEDEDASQKD
        + ++ YVK RDV +    Q       + FP FPD++L  +   +  E E +  D    D
Subjt:  QTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLLSLWIPPSPIEREEEDEDASQKD

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.4e-2528.38Show/hide
Query:  ERKTREEIGKGVAGAVAETEEAEPEKQRLSYERFVNNLVRAKYLDMLKRDFLFERGFGDDLPHFLRVDITNHDWEQFCAKPEPANSQVVHEFYAN-----
        ER +      G+     +  +A   +   +  R+ NN ++ + L+  K   L        LP   +V IT H+W+QFCA PE     +V EFYAN     
Subjt:  ERKTREEIGKGVAGAVAETEEAEPEKQRLSYERFVNNLVRAKYLDMLKRDFLFERGFGDDLPHFLRVDITNHDWEQFCAKPEPANSQVVHEFYAN-----

Query:  ---------------------------IDEEEGF-------QLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDK
                                   +DE   F        L  V+  V   GA+W +S     T   + L   A     F+K RLL TTH   VS+D+
Subjt:  ---------------------------IDEEEGF-------QLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDK

Query:  VLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMDTPNL-----ARLQR---TQEARQ---------------GG
        +LL+ +++   SI+VG++I SEI +C  +K G LFFP++IT LC+ A       +  L +T  +     AR+ +   T+  +Q               G 
Subjt:  VLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMDTPNL-----ARLQR---TQEARQ---------------GG

Query:  LVCDIPSI-----QEQLQLY--SSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLL
        ++  + ++     Q+++Q Y   S ++   +Q Q +W Y K RD  L++ALQ+NF++P   FP FP ++L
Subjt:  LVCDIPSI-----QEQLQLY--SSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]6.4e-1834.39Show/hide
Query:  FIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVS----------TSAEDVI------------LM
        F+K RLL TTH   VS+D++LL+++++   SI+VG++I SEI +C  +K G LFFP++IT LC+ A             T   D I            L 
Subjt:  FIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVS----------TSAEDVI------------LM

Query:  DTPNLARLQRTQEAR-QGGLVCDIPSI-----QEQLQLY--SSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLL
          P+ +R      +R  G ++  + ++     Q+++Q Y   S ++   +Q Q +W Y K RD  L++ALQ+NF++P   FP FP +LL
Subjt:  DTPNLARLQRTQEAR-QGGLVCDIPSI-----QEQLQLY--SSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.3e-2334.63Show/hide
Query:  EFYANIDEEEGFQLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRK
        EF  NI E E   L  V+  V   GA+W +S     T   + L   A     F+K RLL TTH  IVS+D++LL+ +++   SI+VG++I SEI +C  +
Subjt:  EFYANIDEEEGFQLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRK

Query:  KVGKLFFPNMITILCQRAGVSTSAE--------DVILM------------DTPNLARLQRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQFQTYWNY
        K G LFFP++IT LC+ A    + E        D I +              P+ +R      +R  G   D+    + L+   S+ E   +Q Q +W Y
Subjt:  KVGKLFFPNMITILCQRAGVSTSAE--------DVILM------------DTPNLARLQRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQFQTYWNY

Query:  VKMRDVTLRRALQSNFSKPYQAFPVFPDDLL
         K RD  L++ALQ+NF++P   FP FP ++L
Subjt:  VKMRDVTLRRALQSNFSKPYQAFPVFPDDLL

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)1.2e-2528.38Show/hide
Query:  ERKTREEIGKGVAGAVAETEEAEPEKQRLSYERFVNNLVRAKYLDMLKRDFLFERGFGDDLPHFLRVDITNHDWEQFCAKPEPANSQVVHEFYAN-----
        ER +      G+     +  +A   +   +  R+ NN ++ + L+  K   L        LP   +V IT H+W+QFCA PE     +V EFYAN     
Subjt:  ERKTREEIGKGVAGAVAETEEAEPEKQRLSYERFVNNLVRAKYLDMLKRDFLFERGFGDDLPHFLRVDITNHDWEQFCAKPEPANSQVVHEFYAN-----

Query:  ---------------------------IDEEEGF-------QLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDK
                                   +DE   F        L  V+  V   GA+W +S     T   + L   A     F+K RLL TTH   VS+D+
Subjt:  ---------------------------IDEEEGF-------QLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDK

Query:  VLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMDTPNL-----ARLQR---TQEARQ---------------GG
        +LL+ +++   SI+VG++I SEI +C  +K G LFFP++IT LC+ A       +  L +T  +     AR+ +   T+  +Q               G 
Subjt:  VLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMDTPNL-----ARLQR---TQEARQ---------------GG

Query:  LVCDIPSI-----QEQLQLY--SSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLL
        ++  + ++     Q+++Q Y   S ++   +Q Q +W Y K RD  L++ALQ+NF++P   FP FP ++L
Subjt:  LVCDIPSI-----QEQLQLY--SSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLL

A0A2P5CEY2 Uncharacterized protein3.1e-1834.39Show/hide
Query:  FIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVS----------TSAEDVI------------LM
        F+K RLL TTH   VS+D++LL+++++   SI+VG++I SEI +C  +K G LFFP++IT LC+ A             T   D I            L 
Subjt:  FIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVS----------TSAEDVI------------LM

Query:  DTPNLARLQRTQEAR-QGGLVCDIPSI-----QEQLQLY--SSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLL
          P+ +R      +R  G ++  + ++     Q+++Q Y   S ++   +Q Q +W Y K RD  L++ALQ+NF++P   FP FP +LL
Subjt:  DTPNLARLQRTQEAR-QGGLVCDIPSI-----QEQLQLY--SSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLL

A0A2P5DXM3 Uncharacterized protein1.1e-2334.63Show/hide
Query:  EFYANIDEEEGFQLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRK
        EF  NI E E   L  V+  V   GA+W +S     T   + L   A     F+K RLL TTH  IVS+D++LL+ +++   SI+VG++I SEI +C  +
Subjt:  EFYANIDEEEGFQLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRK

Query:  KVGKLFFPNMITILCQRAGVSTSAE--------DVILM------------DTPNLARLQRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQFQTYWNY
        K G LFFP++IT LC+ A    + E        D I +              P+ +R      +R  G   D+    + L+   S+ E   +Q Q +W Y
Subjt:  KVGKLFFPNMITILCQRAGVSTSAE--------DVILM------------DTPNLARLQRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQFQTYWNY

Query:  VKMRDVTLRRALQSNFSKPYQAFPVFPDDLL
         K RD  L++ALQ+NF++P   FP FP ++L
Subjt:  VKMRDVTLRRALQSNFSKPYQAFPVFPDDLL

A0A6A2Y697 Reverse transcriptase domain-containing protein1.5e-2024.13Show/hide
Query:  YERFVNNLVRAKYLDMLKRDFLFERGF-------GDDLPHFLRVDITNHDWEQFCAKPEPANS--QVVHEFYANIDEEEGFQLNAVVREVWIEGAQWRLS
        +++F N   +A++ +   R   FE  F       G   P  + + +T   W++F   P   N+   VV       DE +    + ++ ++  E  +W   
Subjt:  YERFVNNLVRAKYLDMLKRDFLFERGF-------GDDLPHFLRVDITNHDWEQFCAKPEPANS--QVVHEFYANIDEEEGFQLNAVVREVWIEGAQWRLS

Query:  KTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMD
        +T + +     L+  A     F+K +L+ T+H+  VS  ++LL+ +IM S  IDVG+II  ++H C  KK   L FPN+IT LC++  V  +A D IL  
Subjt:  KTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMD

Query:  TPNLARL------------------QRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLLSL
          ++ R                   +++    Q      + +++E +    + +         ++ YVK RD  +    Q       + FP F D++LS 
Subjt:  TPNLARL------------------QRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLLSL

Query:  WIPPSPIEREEEDED
        +   + +E ++++ED
Subjt:  WIPPSPIEREEEDED

A0A6A3BU96 Uncharacterized protein3.7e-1922.28Show/hide
Query:  LSYERFVNNLVRAKYLDMLKRDFLFERGF-------GDDLPHFLRVDITNHDWEQFCAKPEPANSQVVHEFYANI-------------------------
        +++++F N+  +A++ +   R+  FE GF       G   P  + + +    W +F   P   N+ +V EFYANI                         
Subjt:  LSYERFVNNLVRAKYLDMLKRDFLFERGF-------GDDLPHFLRVDITNHDWEQFCAKPEPANSQVVHEFYANI-------------------------

Query:  --------------DEEEGFQLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISS
                      +E +  + + V+ ++  E  +W   +T + +     L+  A     F+K +L+ T+H+  VS  ++LL+ +++ S  IDVG+II  
Subjt:  --------------DEEEGFQLNAVVREVWIEGAQWRLSKTQKRTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISS

Query:  EIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMDTPNLAR------------------LQRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQF
        ++H C  KK   L FPN+IT LC++  V  +A D IL     + R                   +++    +      + +++E +    +++       
Subjt:  EIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMDTPNLAR------------------LQRTQEARQGGLVCDIPSIQEQLQLYSSRMECAERQF

Query:  QTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLLSLWIPPSPIEREEEDEDASQKD
        + ++ YVK RDV +    Q       + FP FPD++L  +   +  E E +  D    D
Subjt:  QTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLLSLWIPPSPIEREEEDEDASQKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCCGCAACTGGTACAGAATATTGAGATTCGCAGGATCTTCACAGAATCGTCCGAAAATCTCCAATCTGGTATCAAATCTCCAAAGATTGTGGATTATTACCTTCC
AAGAGAAGTTAATGATGAGCGACTTGAGGGAGCAAAATCTATGATCCAGCAAAGCTGTGTGAGATTGGTGTATGAGCGATCCTTCTGGGGTTGTGGGATGAGTCTTGAAC
TGTATGTGGAGACATACGGGGATCACTGTTCAAGAAACAGCCTGAAGGGTCTACAGAGGGATAAGGGATTTGATGTGGACTTTTCCTACCGCAAGCGCACGGGTCAAGGT
CCTACCCTTTCACTGGCCCGAAAGGGGTTTCTGTTTGATGGTTGGACCACAAACAGGTTGTTCATTAGAGGAGCACTGATACTTAAGGAGGTAGAGGTGAAATGCATGGA
GCATCTGGAGCCTCCACGTGTCACCCAGTGTAGATCCAACGACCTAGTCGTTAGCCAGCGTAGCGTGCATTTACCACGGGGCATTTTTTACCGTTGTGTTTGGCAAAAAA
AATTCTTTGAACTCCCAAATGGCCAAGACAAGAGAAAGAGGGAAAGGGAGGTTGAGGAAGAAGAAGTGTCGGTTTCGCCTGAGGCACTTAAAAAGAAAACCAAGAAGACA
AGAACACCGCAGGAGAAGGAAGCCAAACGACTGAGGAGACAGCAGCAGGCTGAAGCCACGGAAGTTCTGCAAAAGCAAGAAACTCAAGATCAGCAAGCAGGGGTGCAGGA
GGAGATCATTCTGATTCAACCGCCAGGCCGCCGCCGTAAGCAAAAGGCTGAGCGAATTAAGCGGGTAAAAACTGACACTTCATCTCCTCCCACCACTGAATCTGAGAAGG
AGAACACAGTGGCAGAGGTACAGGAAAATGTAGAGATTGAGAAAAAGACAGAGGAAGATGAGGCTGAGAGGAAGACTCGTGAAGAAATAGGTAAAGGAGTTGCAGGAGCA
GTGGCTGAAACAGAGGAGGCAGAGCCTGAGAAACAACGGTTGTCGTATGAACGCTTCGTCAACAACCTTGTTAGGGCAAAATACCTGGATATGCTGAAGCGAGATTTTCT
GTTTGAGAGAGGGTTTGGTGACGATCTACCGCACTTTTTGAGAGTAGACATCACAAACCATGACTGGGAGCAGTTTTGTGCCAAACCAGAGCCCGCGAACTCACAGGTAG
TGCATGAGTTCTACGCGAACATCGACGAGGAGGAAGGATTCCAATTGAATGCAGTTGTTCGTGAGGTGTGGATTGAAGGGGCCCAGTGGCGACTCTCGAAGACTCAAAAA
AGGACCTTCCAAACTGCCTATCTGAAGAGTGAAGCCAATACTGGGATGGGATTTATCAAGCAAAGGCTGCTTCTAACAACTCACGACTTCATAGTTTCTCGAGACAAAGT
TCTTCTGGTTTTTGCGATTATGAGATCACTGAGTATTGACGTCGGCAAGATCATCTCCAGCGAGATTCATAGTTGCTGGAGGAAAAAAGTGGGCAAACTATTTTTCCCCA
ACATGATCACGATATTATGCCAAAGGGCAGGGGTTTCCACAAGTGCTGAAGATGTAATTCTAATGGACACACCAAACTTGGCAAGGCTGCAGAGGACGCAGGAAGCTCGC
CAAGGTGGACTGGTGTGCGACATCCCCAGCATCCAAGAACAACTTCAATTGTATTCCAGCCGAATGGAGTGTGCTGAGAGGCAATTCCAAACGTATTGGAATTATGTTAA
AATGAGGGATGTCACGCTTAGGAGGGCTCTGCAATCAAACTTTTCTAAACCATATCAAGCCTTCCCCGTTTTTCCAGATGATCTGTTGAGCCTCTGGATCCCACCGTCGC
CGATTGAAAGAGAAGAGGAAGATGAGGATGCTAGTCAGAAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGCCGCAACTGGTACAGAATATTGAGATTCGCAGGATCTTCACAGAATCGTCCGAAAATCTCCAATCTGGTATCAAATCTCCAAAGATTGTGGATTATTACCTTCC
AAGAGAAGTTAATGATGAGCGACTTGAGGGAGCAAAATCTATGATCCAGCAAAGCTGTGTGAGATTGGTGTATGAGCGATCCTTCTGGGGTTGTGGGATGAGTCTTGAAC
TGTATGTGGAGACATACGGGGATCACTGTTCAAGAAACAGCCTGAAGGGTCTACAGAGGGATAAGGGATTTGATGTGGACTTTTCCTACCGCAAGCGCACGGGTCAAGGT
CCTACCCTTTCACTGGCCCGAAAGGGGTTTCTGTTTGATGGTTGGACCACAAACAGGTTGTTCATTAGAGGAGCACTGATACTTAAGGAGGTAGAGGTGAAATGCATGGA
GCATCTGGAGCCTCCACGTGTCACCCAGTGTAGATCCAACGACCTAGTCGTTAGCCAGCGTAGCGTGCATTTACCACGGGGCATTTTTTACCGTTGTGTTTGGCAAAAAA
AATTCTTTGAACTCCCAAATGGCCAAGACAAGAGAAAGAGGGAAAGGGAGGTTGAGGAAGAAGAAGTGTCGGTTTCGCCTGAGGCACTTAAAAAGAAAACCAAGAAGACA
AGAACACCGCAGGAGAAGGAAGCCAAACGACTGAGGAGACAGCAGCAGGCTGAAGCCACGGAAGTTCTGCAAAAGCAAGAAACTCAAGATCAGCAAGCAGGGGTGCAGGA
GGAGATCATTCTGATTCAACCGCCAGGCCGCCGCCGTAAGCAAAAGGCTGAGCGAATTAAGCGGGTAAAAACTGACACTTCATCTCCTCCCACCACTGAATCTGAGAAGG
AGAACACAGTGGCAGAGGTACAGGAAAATGTAGAGATTGAGAAAAAGACAGAGGAAGATGAGGCTGAGAGGAAGACTCGTGAAGAAATAGGTAAAGGAGTTGCAGGAGCA
GTGGCTGAAACAGAGGAGGCAGAGCCTGAGAAACAACGGTTGTCGTATGAACGCTTCGTCAACAACCTTGTTAGGGCAAAATACCTGGATATGCTGAAGCGAGATTTTCT
GTTTGAGAGAGGGTTTGGTGACGATCTACCGCACTTTTTGAGAGTAGACATCACAAACCATGACTGGGAGCAGTTTTGTGCCAAACCAGAGCCCGCGAACTCACAGGTAG
TGCATGAGTTCTACGCGAACATCGACGAGGAGGAAGGATTCCAATTGAATGCAGTTGTTCGTGAGGTGTGGATTGAAGGGGCCCAGTGGCGACTCTCGAAGACTCAAAAA
AGGACCTTCCAAACTGCCTATCTGAAGAGTGAAGCCAATACTGGGATGGGATTTATCAAGCAAAGGCTGCTTCTAACAACTCACGACTTCATAGTTTCTCGAGACAAAGT
TCTTCTGGTTTTTGCGATTATGAGATCACTGAGTATTGACGTCGGCAAGATCATCTCCAGCGAGATTCATAGTTGCTGGAGGAAAAAAGTGGGCAAACTATTTTTCCCCA
ACATGATCACGATATTATGCCAAAGGGCAGGGGTTTCCACAAGTGCTGAAGATGTAATTCTAATGGACACACCAAACTTGGCAAGGCTGCAGAGGACGCAGGAAGCTCGC
CAAGGTGGACTGGTGTGCGACATCCCCAGCATCCAAGAACAACTTCAATTGTATTCCAGCCGAATGGAGTGTGCTGAGAGGCAATTCCAAACGTATTGGAATTATGTTAA
AATGAGGGATGTCACGCTTAGGAGGGCTCTGCAATCAAACTTTTCTAAACCATATCAAGCCTTCCCCGTTTTTCCAGATGATCTGTTGAGCCTCTGGATCCCACCGTCGC
CGATTGAAAGAGAAGAGGAAGATGAGGATGCTAGTCAGAAGGATTAG
Protein sequenceShow/hide protein sequence
MWPQLVQNIEIRRIFTESSENLQSGIKSPKIVDYYLPREVNDERLEGAKSMIQQSCVRLVYERSFWGCGMSLELYVETYGDHCSRNSLKGLQRDKGFDVDFSYRKRTGQG
PTLSLARKGFLFDGWTTNRLFIRGALILKEVEVKCMEHLEPPRVTQCRSNDLVVSQRSVHLPRGIFYRCVWQKKFFELPNGQDKRKREREVEEEEVSVSPEALKKKTKKT
RTPQEKEAKRLRRQQQAEATEVLQKQETQDQQAGVQEEIILIQPPGRRRKQKAERIKRVKTDTSSPPTTESEKENTVAEVQENVEIEKKTEEDEAERKTREEIGKGVAGA
VAETEEAEPEKQRLSYERFVNNLVRAKYLDMLKRDFLFERGFGDDLPHFLRVDITNHDWEQFCAKPEPANSQVVHEFYANIDEEEGFQLNAVVREVWIEGAQWRLSKTQK
RTFQTAYLKSEANTGMGFIKQRLLLTTHDFIVSRDKVLLVFAIMRSLSIDVGKIISSEIHSCWRKKVGKLFFPNMITILCQRAGVSTSAEDVILMDTPNLARLQRTQEAR
QGGLVCDIPSIQEQLQLYSSRMECAERQFQTYWNYVKMRDVTLRRALQSNFSKPYQAFPVFPDDLLSLWIPPSPIEREEEDEDASQKD