; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021463 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021463
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold9:6687368..6689881
RNA-Seq ExpressionSpg021463
SyntenySpg021463
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.1e-1125.71Show/hide
Query:  EKIEEAETSSDSETDSDFEIKELDDDQEKM--EIMRKRDFLNEKGF-SNRAGTLRD--FVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAV
        +K   A+ SS +   S    K +D+  EK   E +  R+ + EKGF  + + TL    F++ +I    WQ FC HP + +VPLV+EFY  L+ +  +   
Subjt:  EKIEEAETSSDSETDSDFEIKELDDDQEKM--EIMRKRDFLNEKGF-SNRAGTLRD--FVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAV

Query:  V-------------RVEGVSDKGE-------DTSAKQFKARVDNVASL----------------HQKQTDP-------------TTHDNTISVERVMLLY
        V              V G+ ++ +       D   +Q K  +  +A L                H+ Q                +TH  TIS  R +LLY
Subjt:  V-------------RVEGVSDKGE-------DTSAKQFKARVDNVASL----------------HQKQTDP-------------TTHDNTISVERVMLLY

Query:  NIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASP-PQHTP
         ++ G  IN+G +I ++I +C  K  G L+F SLI++LC +  +  E  E R      +DL  I ++    +++ +K    +        P+ P   HT 
Subjt:  NIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASP-PQHTP

Query:  VSGPTPSSGAL-----AFTFRQLDRIE------DKLKTYWVYAKERDEAIREFY
         +    S   L      F  +Q +  E      ++L  +WVY+++RD A+++ +
Subjt:  VSGPTPSSGAL-----AFTFRQLDRIE------DKLKTYWVYAKERDEAIREFY

KAF4375842.1 hypothetical protein G4B88_026421 [Cannabis sativa]1.1e-1124.13Show/hide
Query:  MRKRDFLNEKGF---SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVG-LREESMSMAVVRVEGVSDKGEDTS----------------AKQ
        +R ++F  ++G        G++  ++ + I +  W + C  P   V  +V+EFY   L  E  +   VR   V     D +                 K 
Subjt:  MRKRDFLNEKGF---SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVG-LREESMSMAVVRVEGVSDKGEDTS----------------AKQ

Query:  FKARVDNVASLHQKQTD-----------------PTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKD
            +D       K+TD                 PT+HD+T+S ER+ +LY I+KG KIN+G +I +EI  C  ++ GKLFF  LIT+ C+   +    D
Subjt:  FKARVDNVASLHQKQTD-----------------PTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKD

Query:  EERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASPPQHTPVSGPTPSSGALAFTFRQLDRIEDKLKTYWVYAKERD
        E+       +    +   + + A ++   ST+                T  +   P    L+       ++ ++L+T+W Y +ERD
Subjt:  EERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASPPQHTPVSGPTPSSGALAFTFRQLDRIEDKLKTYWVYAKERD

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.4e-1427.02Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVRVEGVSDKGEDTSA--------KQFKARVDNV
        ++ R    EKGF    S   G L  F+ ++ITQ+ W++FCAHP++ +VPLVREFY  L +   +   VR   VS   E  +A         +    ++N+
Subjt:  MRKRDFLNEKGF----SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVRVEGVSDKGEDTSA--------KQFKARVDNV

Query:  AS-----------------------------------------LHQKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGS
                                                     +    PTTH  T+S +R++LL++++ G  IN+G +I  EI +C  +  G LFF S
Subjt:  AS-----------------------------------------LHQKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGS

Query:  LITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTS
        LIT+LC+  +     +EE+      ID   + ++ Q       +  +S
Subjt:  LITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.0e-1727.37Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVR-------------VEGVSDKGEDTS------
        ++ R    EKGF    S   G L  F+ ++ITQ+ W++FCAHP++ +VPLVREFY  L +   +   VR             V G+ D  ++ S      
Subjt:  MRKRDFLNEKGF----SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVR-------------VEGVSDKGEDTS------

Query:  -AKQFKARVDNVAS-----------------------------LHQKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGS
          +     ++ VA+                               + +  PTTH  T+S +R++LL++++ G  IN+G +I  EI +C  +  G LFF S
Subjt:  -AKQFKARVDNVAS-----------------------------LHQKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGS

Query:  LITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASPPQHTPVSGPTPSSGALAFTFRQ-----------LDRIED
        LIT+LC+  +     +EE+      ID   + ++    AQ     ST Q   P    PA+   +           AL     Q           L     
Subjt:  LITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASPPQHTPVSGPTPSSGALAFTFRQ-----------LDRIED

Query:  KLKTYWVYAKERDEAIREFYLSIAPSIAPVFPDFPQALLPQEEKESDDEDEDEDEENE
        + + +W Y+KERD A+++   +      P FP FPQ +L   + E + E  D+D  NE
Subjt:  KLKTYWVYAKERDEAIREFYLSIAPSIAPVFPDFPQALLPQEEKESDDEDEDEDEENE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]9.8e-1329.9Show/hide
Query:  FVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVR-------VEGV-------------SDKGEDTSAKQFKARVDNVA------------
        F+  +I Q+ WQ FCAHP++ +VPLVREFY  +         +R       VE +             S+  ED +  +    ++ VA            
Subjt:  FVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVR-------VEGV-------------SDKGEDTSAKQFKARVDNVA------------

Query:  -------SLH----------QKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKDEER
               SL+          + +  PTTH  T+S E V LLY+++ G  IN+G +I  EI +C  + +G LFF SLIT +C+  +     +EE+
Subjt:  -------SLH----------QKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKDEER

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)6.6e-1527.02Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVRVEGVSDKGEDTSA--------KQFKARVDNV
        ++ R    EKGF    S   G L  F+ ++ITQ+ W++FCAHP++ +VPLVREFY  L +   +   VR   VS   E  +A         +    ++N+
Subjt:  MRKRDFLNEKGF----SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVRVEGVSDKGEDTSA--------KQFKARVDNV

Query:  AS-----------------------------------------LHQKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGS
                                                     +    PTTH  T+S +R++LL++++ G  IN+G +I  EI +C  +  G LFF S
Subjt:  AS-----------------------------------------LHQKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGS

Query:  LITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTS
        LIT+LC+  +     +EE+      ID   + ++ Q       +  +S
Subjt:  LITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTS

A0A2P5BCG4 Uncharacterized protein (Fragment)4.9e-1827.37Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVR-------------VEGVSDKGEDTS------
        ++ R    EKGF    S   G L  F+ ++ITQ+ W++FCAHP++ +VPLVREFY  L +   +   VR             V G+ D  ++ S      
Subjt:  MRKRDFLNEKGF----SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVR-------------VEGVSDKGEDTS------

Query:  -AKQFKARVDNVAS-----------------------------LHQKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGS
          +     ++ VA+                               + +  PTTH  T+S +R++LL++++ G  IN+G +I  EI +C  +  G LFF S
Subjt:  -AKQFKARVDNVAS-----------------------------LHQKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGS

Query:  LITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASPPQHTPVSGPTPSSGALAFTFRQ-----------LDRIED
        LIT+LC+  +     +EE+      ID   + ++    AQ     ST Q   P    PA+   +           AL     Q           L     
Subjt:  LITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASPPQHTPVSGPTPSSGALAFTFRQ-----------LDRIED

Query:  KLKTYWVYAKERDEAIREFYLSIAPSIAPVFPDFPQALLPQEEKESDDEDEDEDEENE
        + + +W Y+KERD A+++   +      P FP FPQ +L   + E + E  D+D  NE
Subjt:  KLKTYWVYAKERDEAIREFYLSIAPSIAPVFPDFPQALLPQEEKESDDEDEDEDEENE

A0A2P5DAQ2 Uncharacterized protein4.7e-1329.9Show/hide
Query:  FVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVR-------VEGV-------------SDKGEDTSAKQFKARVDNVA------------
        F+  +I Q+ WQ FCAHP++ +VPLVREFY  +         +R       VE +             S+  ED +  +    ++ VA            
Subjt:  FVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAVVR-------VEGV-------------SDKGEDTSAKQFKARVDNVA------------

Query:  -------SLH----------QKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKDEER
               SL+          + +  PTTH  T+S E V LLY+++ G  IN+G +I  EI +C  + +G LFF SLIT +C+  +     +EE+
Subjt:  -------SLH----------QKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKDEER

A0A7J6FZ22 Uncharacterized protein5.2e-1224.13Show/hide
Query:  MRKRDFLNEKGF---SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVG-LREESMSMAVVRVEGVSDKGEDTS----------------AKQ
        +R ++F  ++G        G++  ++ + I +  W + C  P   V  +V+EFY   L  E  +   VR   V     D +                 K 
Subjt:  MRKRDFLNEKGF---SNRAGTLRDFVTKIITQYKWQEFCAHPQEVVVPLVREFYVG-LREESMSMAVVRVEGVSDKGEDTS----------------AKQ

Query:  FKARVDNVASLHQKQTD-----------------PTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKD
            +D       K+TD                 PT+HD+T+S ER+ +LY I+KG KIN+G +I +EI  C  ++ GKLFF  LIT+ C+   +    D
Subjt:  FKARVDNVASLHQKQTD-----------------PTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKD

Query:  EERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASPPQHTPVSGPTPSSGALAFTFRQLDRIEDKLKTYWVYAKERD
        E+       +    +   + + A ++   ST+                T  +   P    L+       ++ ++L+T+W Y +ERD
Subjt:  EERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASPPQHTPVSGPTPSSGALAFTFRQLDRIEDKLKTYWVYAKERD

W9RBS1 Uncharacterized protein5.2e-1225.71Show/hide
Query:  EKIEEAETSSDSETDSDFEIKELDDDQEKM--EIMRKRDFLNEKGF-SNRAGTLRD--FVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAV
        +K   A+ SS +   S    K +D+  EK   E +  R+ + EKGF  + + TL    F++ +I    WQ FC HP + +VPLV+EFY  L+ +  +   
Subjt:  EKIEEAETSSDSETDSDFEIKELDDDQEKM--EIMRKRDFLNEKGF-SNRAGTLRD--FVTKIITQYKWQEFCAHPQEVVVPLVREFYVGLREESMSMAV

Query:  V-------------RVEGVSDKGE-------DTSAKQFKARVDNVASL----------------HQKQTDP-------------TTHDNTISVERVMLLY
        V              V G+ ++ +       D   +Q K  +  +A L                H+ Q                +TH  TIS  R +LLY
Subjt:  V-------------RVEGVSDKGE-------DTSAKQFKARVDNVASL----------------HQKQTDP-------------TTHDNTISVERVMLLY

Query:  NIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASP-PQHTP
         ++ G  IN+G +I ++I +C  K  G L+F SLI++LC +  +  E  E R      +DL  I ++    +++ +K    +        P+ P   HT 
Subjt:  NIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLCKRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASP-PQHTP

Query:  VSGPTPSSGAL-----AFTFRQLDRIE------DKLKTYWVYAKERDEAIREFY
         +    S   L      F  +Q +  E      ++L  +WVY+++RD A+++ +
Subjt:  VSGPTPSSGAL-----AFTFRQLDRIE------DKLKTYWVYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAGAACAAGAAGCAGGAAACCCACGGGTTTTTCGCCAGTCGTCGAGGAAGGAAGAACCATCATTCATACTCCATCATCTTCGACAATGTCGGCCACTCCAAGGGA
GAATCCAAGTGGTTCTCGACAAAGAAGACTCACGCCCACCGATTCTCTCCACCAGACGCAAAAACCTGCAGACAAACCTCTTAGAAAGCGCTCAAAGAATGTGGGAGAAA
GCTCTCGGCAAGGAAAGACTCAAGAAGGTAACGTTTCTGCCCCTTCTACTAACTCAACTTCTTCTTGCAGGGACAAGCCCTTTGTGACCTACCTAGCCAAGAAGAAAAGC
TCCAAGAAGGCTGAACCCGAGAAGCCTCTAGTCCTTGAGCCCTTGAAAACAGTAAAAATGTTGCCGGATGTGTTCGAAGACATGATCCAACAGAGAGACCTAGAAGATGA
GAAGAAAGCTGAGAAAAAGAAAGAAGAAGAAAGACAAGAGGCCGAGAGGGCCAGAGAAGCTGAAAAAGCCATAAAACAAGATCAAGAACTTGGGAGGTTGGCTGTTGAAT
TGAAACTCCTTGAGGAAGAAAAAGAAAGACGAGAGATTTTGAGAGAAGAAGAAAAACGAAGAAAAGAAGCTGAGGATTTTCTTGCAGCTTTTGAGCCACTGCACAAGGCT
CAAAAGCAACAAGCTGATGAGGCCTTTGATCCTTTGTTCGAGTATGATGTTAGAGGACCTCCACCAGCTGCTGAGAGCACATCTTCGGGAGAGAAAAAGGGTGAAGAGAA
AATTGAAGAGGCCGAGACCTCTAGTGATAGTGAGACAGATTCAGATTTTGAGATTAAGGAATTGGACGATGACCAAGAGAAGATGGAGATTATGAGAAAAAGAGACTTCC
TAAACGAAAAAGGATTCTCAAATAGAGCAGGAACTTTGCGAGACTTTGTAACCAAGATTATCACCCAATACAAATGGCAGGAGTTCTGTGCTCACCCTCAGGAGGTCGTG
GTGCCTCTAGTGCGAGAATTCTATGTTGGTTTGAGGGAAGAGAGCATGAGTATGGCGGTGGTGAGAGTGGAAGGAGTCTCAGACAAAGGTGAAGACACTAGTGCCAAGCA
ATTTAAAGCCAGAGTCGACAATGTGGCTTCACTTCATCAAAAACAGACTGATCCCACAACCCACGACAATACAATTTCAGTAGAGAGGGTTATGCTTCTCTACAACATTA
TGAAGGGGTTGAAGATAAACATTGGGAGCATCATTAGAGAAGAGATCCTTTCATGTGGAAGGAAGATAGCGGGGAAGTTGTTCTTTGGGTCACTTATAACCCAATTGTGT
AAGAGGGTGAAGATAGTCCCTGAAAAAGATGAGGAACGTCACTTTTTCAGGCCCACCATCGACTTGCCTCTGATTGGGAAGCTTCAACAGAACAACGCTCAAAGGAAGGA
CAAGGCTTCCACATCCCAAGCTATTCCACCACTAGGGTTGAATCCTGCTTCACCTCCTCAGCACACTCCAGTTTCAGGGCCTACACCATCATCAGGGGCACTTGCATTTA
CCTTCCGACAGCTAGACCGGATTGAAGACAAGTTGAAAACTTATTGGGTCTATGCTAAGGAAAGAGATGAAGCAATTAGAGAGTTTTACCTTTCTATCGCCCCTAGCATT
GCTCCTGTGTTCCCTGACTTCCCTCAAGCCCTGCTGCCACAAGAAGAGAAAGAATCTGACGATGAAGATGAAGATGAAGATGAAGAAAATGAAGAGGTTCCCTCAAATAA
GGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCATAGAACAAGAAGCAGGAAACCCACGGGTTTTTCGCCAGTCGTCGAGGAAGGAAGAACCATCATTCATACTCCATCATCTTCGACAATGTCGGCCACTCCAAGGGA
GAATCCAAGTGGTTCTCGACAAAGAAGACTCACGCCCACCGATTCTCTCCACCAGACGCAAAAACCTGCAGACAAACCTCTTAGAAAGCGCTCAAAGAATGTGGGAGAAA
GCTCTCGGCAAGGAAAGACTCAAGAAGGTAACGTTTCTGCCCCTTCTACTAACTCAACTTCTTCTTGCAGGGACAAGCCCTTTGTGACCTACCTAGCCAAGAAGAAAAGC
TCCAAGAAGGCTGAACCCGAGAAGCCTCTAGTCCTTGAGCCCTTGAAAACAGTAAAAATGTTGCCGGATGTGTTCGAAGACATGATCCAACAGAGAGACCTAGAAGATGA
GAAGAAAGCTGAGAAAAAGAAAGAAGAAGAAAGACAAGAGGCCGAGAGGGCCAGAGAAGCTGAAAAAGCCATAAAACAAGATCAAGAACTTGGGAGGTTGGCTGTTGAAT
TGAAACTCCTTGAGGAAGAAAAAGAAAGACGAGAGATTTTGAGAGAAGAAGAAAAACGAAGAAAAGAAGCTGAGGATTTTCTTGCAGCTTTTGAGCCACTGCACAAGGCT
CAAAAGCAACAAGCTGATGAGGCCTTTGATCCTTTGTTCGAGTATGATGTTAGAGGACCTCCACCAGCTGCTGAGAGCACATCTTCGGGAGAGAAAAAGGGTGAAGAGAA
AATTGAAGAGGCCGAGACCTCTAGTGATAGTGAGACAGATTCAGATTTTGAGATTAAGGAATTGGACGATGACCAAGAGAAGATGGAGATTATGAGAAAAAGAGACTTCC
TAAACGAAAAAGGATTCTCAAATAGAGCAGGAACTTTGCGAGACTTTGTAACCAAGATTATCACCCAATACAAATGGCAGGAGTTCTGTGCTCACCCTCAGGAGGTCGTG
GTGCCTCTAGTGCGAGAATTCTATGTTGGTTTGAGGGAAGAGAGCATGAGTATGGCGGTGGTGAGAGTGGAAGGAGTCTCAGACAAAGGTGAAGACACTAGTGCCAAGCA
ATTTAAAGCCAGAGTCGACAATGTGGCTTCACTTCATCAAAAACAGACTGATCCCACAACCCACGACAATACAATTTCAGTAGAGAGGGTTATGCTTCTCTACAACATTA
TGAAGGGGTTGAAGATAAACATTGGGAGCATCATTAGAGAAGAGATCCTTTCATGTGGAAGGAAGATAGCGGGGAAGTTGTTCTTTGGGTCACTTATAACCCAATTGTGT
AAGAGGGTGAAGATAGTCCCTGAAAAAGATGAGGAACGTCACTTTTTCAGGCCCACCATCGACTTGCCTCTGATTGGGAAGCTTCAACAGAACAACGCTCAAAGGAAGGA
CAAGGCTTCCACATCCCAAGCTATTCCACCACTAGGGTTGAATCCTGCTTCACCTCCTCAGCACACTCCAGTTTCAGGGCCTACACCATCATCAGGGGCACTTGCATTTA
CCTTCCGACAGCTAGACCGGATTGAAGACAAGTTGAAAACTTATTGGGTCTATGCTAAGGAAAGAGATGAAGCAATTAGAGAGTTTTACCTTTCTATCGCCCCTAGCATT
GCTCCTGTGTTCCCTGACTTCCCTCAAGCCCTGCTGCCACAAGAAGAGAAAGAATCTGACGATGAAGATGAAGATGAAGATGAAGAAAATGAAGAGGTTCCCTCAAATAA
GGAATAA
Protein sequenceShow/hide protein sequence
MHRTRSRKPTGFSPVVEEGRTIIHTPSSSTMSATPRENPSGSRQRRLTPTDSLHQTQKPADKPLRKRSKNVGESSRQGKTQEGNVSAPSTNSTSSCRDKPFVTYLAKKKS
SKKAEPEKPLVLEPLKTVKMLPDVFEDMIQQRDLEDEKKAEKKKEEERQEAERAREAEKAIKQDQELGRLAVELKLLEEEKERREILREEEKRRKEAEDFLAAFEPLHKA
QKQQADEAFDPLFEYDVRGPPPAAESTSSGEKKGEEKIEEAETSSDSETDSDFEIKELDDDQEKMEIMRKRDFLNEKGFSNRAGTLRDFVTKIITQYKWQEFCAHPQEVV
VPLVREFYVGLREESMSMAVVRVEGVSDKGEDTSAKQFKARVDNVASLHQKQTDPTTHDNTISVERVMLLYNIMKGLKINIGSIIREEILSCGRKIAGKLFFGSLITQLC
KRVKIVPEKDEERHFFRPTIDLPLIGKLQQNNAQRKDKASTSQAIPPLGLNPASPPQHTPVSGPTPSSGALAFTFRQLDRIEDKLKTYWVYAKERDEAIREFYLSIAPSI
APVFPDFPQALLPQEEKESDDEDEDEDEENEEVPSNKE