; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011671 (gene) of Snake gourd v1 genome

Gene IDTan0011671
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibosomal RNA small subunit methyltransferase G
Genome locationLG09:67564579..67568165
RNA-Seq ExpressionTan0011671
SyntenyTan0011671
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035686.1 hypothetical protein SDJN02_02484 [Cucurbita argyrosperma subsp. argyrosperma]3.7e-11387.21Show/hide
Query:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK
        MKKMKGA SQ+PS FDDSK RFKHQ+LLQDYHELEKETETAKRKL+MMKQKKMTL+AEVRFL+KRYEYL+KNQ  +D HSNGNPVQQKQ N QVANN KK
Subjt:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK

Query:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL
        GKN SRRRPAL PLP ISD NQKERI+R +DIP Q+STPIPVLDLNQKAKT RKKANQQNST V DLNQKERMYS RDASERTITPFFDLNQISMEEEEL
Subjt:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL

Query:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        QTHY+ LRADELKKSLLRG NDEQQNDIKISACR+VGDGPSRAGKRKISWQDQVALRV
Subjt:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

XP_022932147.1 uncharacterized protein LOC111438466 [Cucurbita moschata]9.6e-11487.6Show/hide
Query:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK
        MKKMKGA SQ+PS FDDSK RFKHQ+LLQDYHELEKETETAKRKL+MMKQKKMTL+AEVRFL+KRYEYL+KNQ  +D HSNGNPVQQKQ N QVANN KK
Subjt:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK

Query:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL
        GKN SRRRPALQPLP ISD NQKERI+R +DIP Q+STPIPVLDLNQKAKT RKKANQQNST V DLNQKERMYS RDASERTITPFFDLNQISMEEEEL
Subjt:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL

Query:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        QTHY+ LRADELKKSLLRG NDEQQNDIKISACR+VGDGPSRAGKRKISWQDQVALRV
Subjt:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

XP_022965187.1 uncharacterized protein LOC111465117 [Cucurbita maxima]7.6e-11186.82Show/hide
Query:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK
        MKKMKGA SQ+PS FDDSK RFKHQ+LLQDYHELEKETETAKRKL+MMKQKKMTL+AEVRFLRKRYEYL+KNQ  +D HSNGNPVQQK  + QVANN KK
Subjt:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK

Query:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL
        GKN SRRRPALQPLP ISD NQKERI+R +DIP Q+STPIPVLDLNQKAKT RKKANQQNST V DLNQKERMYS RDASERTITPFFDLNQISMEEEEL
Subjt:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL

Query:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        QTHY+ LRADELKKSLLRG NDEQQNDIKISACR+VG+GPSRAGKRKISWQDQVALRV
Subjt:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

XP_023531576.1 uncharacterized protein LOC111793773 [Cucurbita pepo subsp. pepo]7.6e-11186.49Show/hide
Query:  MKKMKGAVSQYPSG-FDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNK
        MKKMKGA SQ+PS  FDDSK RFKHQ+LLQDYHELEKETETAKRKL+MMKQKKMTL+AEVRFLRKRYEYL+KNQ  +D HSNGNPVQQKQ N QVANN K
Subjt:  MKKMKGAVSQYPSG-FDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNK

Query:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEE
        KGKN SRRRPALQPLP ISD NQKERI+R +DIP Q+ST IPVLDLNQKAKT RKKANQQNST V DLNQKER+YS RDASERTITPFFDLNQISMEEEE
Subjt:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEE

Query:  LQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        LQT+Y+ LRADELKKSLLRG NDEQQNDIKISACR+VGDGPSRAGKRKISWQDQVALRV
Subjt:  LQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

XP_038879755.1 uncharacterized protein LOC120071506 isoform X2 [Benincasa hispida]3.4e-11185.38Show/hide
Query:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQ-SASDHHSNGNPVQQKQFNNQVANNNK
        MKKMKG VSQYP  F+DSKTRFKHQSLLQDY +LEKET T KRKL+MMK KKMTL+AEVRFLRKRYEYL+KNQ S +DH+SN  PVQQKQ NNQVANNNK
Subjt:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQ-SASDHHSNGNPVQQKQFNNQVANNNK

Query:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKT-SRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEE
        KGKN +RRR  LQPLP ISD NQKERID+G+D+PLQNSTPIPVLDLNQKAKT SRKKANQQNST V DLNQKERMYS RDASER ITPFFDLNQIS+EEE
Subjt:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKT-SRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEE

Query:  ELQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        ELQT+YEPLR DELKKSLLRG NDEQQNDIKISACRS+GDGPSRAGKRKISWQDQVALRV
Subjt:  ELQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

TrEMBL top hitse value%identityAlignment
A0A0A0LW13 Uncharacterized protein1.4e-10583.46Show/hide
Query:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQ-SASDHHSNGNPVQQKQFNNQVANNNK
        MKKMKG VSQYP  ++DSKTRFKHQSLLQDYH+LEKET T KRKL+MMKQKKMTL+AEVRFLRKRYEYL+KNQ S  DH+SN   VQQKQ+ NQVANNNK
Subjt:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQ-SASDHHSNGNPVQQKQFNNQVANNNK

Query:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKT-SRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEE
        KGKN  RRR AL+PLP ISD NQKERI    D+PLQNSTPIPVLDLNQKAKT SRKKA+Q NST V DLNQKERM S RDASER ITPFFDLNQIS+EEE
Subjt:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKT-SRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEE

Query:  ELQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        ELQTHYEPLR DELKKSLLRG NDEQQNDIKISACRS+GDGPSRAGKRKISWQDQVALRV
Subjt:  ELQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

A0A1S3B8I5 uncharacterized protein LOC1034873441.4e-10583.46Show/hide
Query:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQ-SASDHHSNGNPVQQKQFNNQVANNNK
        MKKMKG VSQYP  ++DSKTRFKHQSLLQDYH+LEKET T KRKL+MMKQKKMTL+AEVRFLRKRYEYL+KNQ S  DH+SNG  VQQKQ  NQVANNNK
Subjt:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQ-SASDHHSNGNPVQQKQFNNQVANNNK

Query:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKT-SRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEE
        K KN  RRR ALQPLP ISD NQKERI    D+P+QNSTPIPVLDLNQKAKT SRKKANQ NS  V DLNQKERM S RDASER ITPFFDLNQIS+EEE
Subjt:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKT-SRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEE

Query:  ELQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        ELQTHYEPLR DELKKSLLRG NDEQQNDIKISACRS+GDGPSRAGKRKISWQDQVALRV
Subjt:  ELQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

A0A5A7T0V7 Uncharacterized protein1.4e-10583.46Show/hide
Query:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQ-SASDHHSNGNPVQQKQFNNQVANNNK
        MKKMKG VSQYP  ++DSKTRFKHQSLLQDYH+LEKET T KRKL+MMKQKKMTL+AEVRFLRKRYEYL+KNQ S  DH+SNG  VQQKQ  NQVANNNK
Subjt:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQ-SASDHHSNGNPVQQKQFNNQVANNNK

Query:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKT-SRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEE
        K KN  RRR ALQPLP ISD NQKERI    D+P+QNSTPIPVLDLNQKAKT SRKKANQ NS  V DLNQKERM S RDASER ITPFFDLNQIS+EEE
Subjt:  KGKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKT-SRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEE

Query:  ELQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        ELQTHYEPLR DELKKSLLRG NDEQQNDIKISACRS+GDGPSRAGKRKISWQDQVALRV
Subjt:  ELQTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

A0A6J1EVU7 uncharacterized protein LOC1114384664.7e-11487.6Show/hide
Query:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK
        MKKMKGA SQ+PS FDDSK RFKHQ+LLQDYHELEKETETAKRKL+MMKQKKMTL+AEVRFL+KRYEYL+KNQ  +D HSNGNPVQQKQ N QVANN KK
Subjt:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK

Query:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL
        GKN SRRRPALQPLP ISD NQKERI+R +DIP Q+STPIPVLDLNQKAKT RKKANQQNST V DLNQKERMYS RDASERTITPFFDLNQISMEEEEL
Subjt:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL

Query:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        QTHY+ LRADELKKSLLRG NDEQQNDIKISACR+VGDGPSRAGKRKISWQDQVALRV
Subjt:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

A0A6J1HJN9 uncharacterized protein LOC1114651173.7e-11186.82Show/hide
Query:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK
        MKKMKGA SQ+PS FDDSK RFKHQ+LLQDYHELEKETETAKRKL+MMKQKKMTL+AEVRFLRKRYEYL+KNQ  +D HSNGNPVQQK  + QVANN KK
Subjt:  MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKK

Query:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL
        GKN SRRRPALQPLP ISD NQKERI+R +DIP Q+STPIPVLDLNQKAKT RKKANQQNST V DLNQKERMYS RDASERTITPFFDLNQISMEEEEL
Subjt:  GKNVSRRRPALQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEEL

Query:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV
        QTHY+ LRADELKKSLLRG NDEQQNDIKISACR+VG+GPSRAGKRKISWQDQVALRV
Subjt:  QTHYEPLRADELKKSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein5.2e-1732.74Show/hide
Query:  ELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKKGKNVSRRRPALQPLPPISDKNQKERIDRGVDI
        ELEKE E  +++LEM+KQK++TL +EVRFLR+RYE+L ++Q+        +P   +   +      +K     +++  ++   P  D   K  I    + 
Subjt:  ELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKKGKNVSRRRPALQPLPPISDKNQKERIDRGVDI

Query:  PLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEELQTHYEPLRADELKKSLLRGANDEQQNDIKISA
           N   +   DL++K K SR          + DLN       E + S     P FDLNQIS EEEE + + E + A+ +K ++L     +   + K+  
Subjt:  PLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEELQTHYEPLRADELKKSLLRGANDEQQNDIKISA

Query:  CRSVGDGPSRAGKRKISWQDQVALRV
        C  V    +RA KRK++WQD VAL V
Subjt:  CRSVGDGPSRAGKRKISWQDQVALRV

AT5G57910.1 unknown protein2.1e-2133.86Show/hide
Query:  FDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKKGKNVSRRRPALQPL
        F+D K RF+H SL+QDY EL  ETE  +++L+ ++++K TL+AEVRFLR+RY +L ++Q          P + K+     +N  KK +            
Subjt:  FDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKKGKNVSRRRPALQPL

Query:  PPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEE----LQTHYEPLRAD
          +S  N+ E   + V +P                                DLN  E+ + E   S +   P FDLNQIS EEE+    +  + E  R +
Subjt:  PPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEE----LQTHYEPLRAD

Query:  ELK--KSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQV-ALRV
        E    K L+  + + QQ D+K S+CR+ G+G   + KRKISWQD V ALRV
Subjt:  ELK--KSLLRGANDEQQNDIKISACRSVGDGPSRAGKRKISWQDQV-ALRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGATGAAAGGGGCTGTCTCTCAATACCCTTCTGGGTTTGACGATTCCAAGACCAGATTCAAGCATCAGAGTCTCCTTCAAGATTATCACGAATTGGAGAAGGA
AACAGAAACTGCTAAGAGGAAATTGGAGATGATGAAGCAAAAGAAGATGACCCTGGTTGCAGAAGTCCGGTTCTTGAGGAAAAGATATGAGTACTTAATCAAGAACCAGT
CAGCAAGCGACCATCATTCAAATGGCAATCCAGTACAGCAGAAACAATTTAACAATCAAGTGGCTAACAACAATAAGAAGGGGAAGAATGTTTCGAGAAGAAGACCCGCG
TTGCAACCCCTCCCGCCGATCTCTGATAAAAACCAAAAGGAAAGAATCGACAGAGGAGTCGATATTCCTCTGCAGAATTCTACTCCAATTCCTGTCCTTGACTTAAACCA
GAAGGCAAAGACTTCTAGGAAGAAAGCCAATCAACAGAATTCAACACAGGTTATTGACTTGAACCAGAAGGAAAGAATGTACAGTGAGAGAGATGCTAGCGAGAGAACCA
TCACTCCATTTTTTGACTTGAACCAAATTTCGATGGAGGAAGAGGAATTGCAGACACATTACGAGCCACTGCGAGCTGATGAGCTGAAGAAAAGCCTTCTTCGAGGTGCG
AACGATGAGCAGCAAAATGATATCAAGATTTCAGCGTGCAGGAGTGTTGGAGATGGTCCGAGTCGAGCTGGTAAAAGAAAGATTTCATGGCAAGACCAGGTGGCTTTAAG
GGTTTGA
mRNA sequenceShow/hide mRNA sequence
CCCAATACTAACTGGAGAGGAGGATATGCTCATCTGCTGGCTACTACAACAAAACCTTCACCATTCCATTCATTTCATTGGATGGAAAAAAGGCAAAGATCAAAAAAAGA
AAAAAGGAAAAAAAAAAAAAAAGTGGGCACTCTCTGTTGAATTTACCACTCTACCCCCACACCAAATCAGAAGCACTTTGGGCAACTTGGTCTTTGTCTCTGACGTGAAG
CCAAACCCATTTGGCCTTCCTACTTTACCTTCTCCATCTATGAAAAAGCAAGCTATATGGGTTTGAGACATTGAGTTGAGTTGAGGGTTAAAAAGTAAAGAAAAGGGCAA
AAACAATTTTCAGAGAGAGAGAGAGGAGGGGTATTTTAGTGATTTCACATAAATTGACCTGCTAAAATAATAGTAATAGTAATGGAAAATGGGGGAAAGCATCTTCCTTT
CTTTCTGATTCATGAATATGCAGAAGAAAAGATCATGATGAAAGAAACCAACATCACACAGCCAAACCGCGGTCCTGCTCCTGATCCTGGTTGTGTTCCTGTTCCTGCCC
CATCACAACAACAACAAAAAAAAGCCCTATAAAATAAGGTACATTTCCAATTCAACAAAACCAACCCTCTACACTGCAACAGAAACCTTCTCCCTTTAGCCAAAAATAGG
GGAAAAAAATTTCCTAACCCTTAAAACCCATCGGAAGCCAGAATCAAGCCCTCAAATTTATCTATAGGCAAAAATAAAATACCATCAGCCATCACGAGTCTTCCTTTTCT
TGTGTCTGTCTGTCTCTCTCTCTCTAGAAGTCTAGATTTCTCTGTCCTTGCAGAGAAAATCAAAGGGGAAGAAACAGAGAAACTCCACGCAGCCAATCAATTCTTGCTTT
CTTTCTCTCTCATTGACACCCTTTAATGGCCCAGCTGGACCAAAATTTACGTAGAGAAGAAAAACAAAAAAGAAAGAAAGAAAGAAAAGAAAGCGAAGATGGATAGATAT
CAGATAGATGTCTTCTAATGTCCTCCAACCAAACCCACCTCACAAAGAAACACACAAACCTTCTTTTCTCAACTCGCATTAAAAATGGCTCTCTCTTCCCTGAATTGCCC
TTTCTGCAACCCCCAAAAAATCCCCATCTTCATCACTGTCTCTGAAAAGAACAATATTTTTCACCGAGTTATGTGGGCTTTTTTGGGTTTTAGTTGGCTCGGTCAGAGCT
TCACAGAGAAAACAGGAAACCAAGAACCTTCACCCCTCTCTTCCTTCCCATTTCCAATTCCAATTCCAATTCCAATTCCCATCTATTTCTCTTTCTCCCCCACCTTTCTC
TTTCTCTTTCTTCTTCTTTGTCCTTTGCCTCTTCACCCACTTTCTCAATTTTCTTCCCCCACCTGTCTTTCTTCTTCATTACCCATCTTTCTCTCCCCCCATCTAACTTT
TCCCCTCTTTGCCCTTCTGGGTTTCTGATTTTTCTTCCTTCACATCTATGAAGAAGATGAAAGGGGCTGTCTCTCAATACCCTTCTGGGTTTGACGATTCCAAGACCAGA
TTCAAGCATCAGAGTCTCCTTCAAGATTATCACGAATTGGAGAAGGAAACAGAAACTGCTAAGAGGAAATTGGAGATGATGAAGCAAAAGAAGATGACCCTGGTTGCAGA
AGTCCGGTTCTTGAGGAAAAGATATGAGTACTTAATCAAGAACCAGTCAGCAAGCGACCATCATTCAAATGGCAATCCAGTACAGCAGAAACAATTTAACAATCAAGTGG
CTAACAACAATAAGAAGGGGAAGAATGTTTCGAGAAGAAGACCCGCGTTGCAACCCCTCCCGCCGATCTCTGATAAAAACCAAAAGGAAAGAATCGACAGAGGAGTCGAT
ATTCCTCTGCAGAATTCTACTCCAATTCCTGTCCTTGACTTAAACCAGAAGGCAAAGACTTCTAGGAAGAAAGCCAATCAACAGAATTCAACACAGGTTATTGACTTGAA
CCAGAAGGAAAGAATGTACAGTGAGAGAGATGCTAGCGAGAGAACCATCACTCCATTTTTTGACTTGAACCAAATTTCGATGGAGGAAGAGGAATTGCAGACACATTACG
AGCCACTGCGAGCTGATGAGCTGAAGAAAAGCCTTCTTCGAGGTGCGAACGATGAGCAGCAAAATGATATCAAGATTTCAGCGTGCAGGAGTGTTGGAGATGGTCCGAGT
CGAGCTGGTAAAAGAAAGATTTCATGGCAAGACCAGGTGGCTTTAAGGGTTTGAGTTCATTTTTTATCTCTAATGTCACTTAGAACTATATTTGGGTCAAGGGCTTCAGG
CATTTCCTCAGTTAAATGTAATTCCTTCTTTGGTTCTGTATTTTTCTATGTAAAGGGAAAGTAGTATTTGTACATGTTAATACATTTAAAAAAGAATGATGGTGAATATC
GAGAAGAATCAAAGTTGCAAATCCTTCGTTCTATTGTCTAGTTTTTTTTCTGAATTTGGCAATGGCTTAAAAAGAATAGGATCTGTTCTTTAAAGCCAGGCTCTGATTC
Protein sequenceShow/hide protein sequence
MKKMKGAVSQYPSGFDDSKTRFKHQSLLQDYHELEKETETAKRKLEMMKQKKMTLVAEVRFLRKRYEYLIKNQSASDHHSNGNPVQQKQFNNQVANNNKKGKNVSRRRPA
LQPLPPISDKNQKERIDRGVDIPLQNSTPIPVLDLNQKAKTSRKKANQQNSTQVIDLNQKERMYSERDASERTITPFFDLNQISMEEEELQTHYEPLRADELKKSLLRGA
NDEQQNDIKISACRSVGDGPSRAGKRKISWQDQVALRV