; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009488 (gene) of Snake gourd v1 genome

Gene IDTan0009488
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG08:7558531..7562056
RNA-Seq ExpressionTan0009488
SyntenyTan0009488
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]2.9e-9471.15Show/hide
Query:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK
        MKKARK VA +S A ALFE S +GIKH+ LLQDYEEL NETE MK+KLLIAKRKK  L  EVRFL HRYE LK QPAN  PKV  + PRN+E++PP  KK
Subjt:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK

Query:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL
        EKSSRKR+ SL PLA AHD+NQRG IYNG+EA+ R SQS F +N KS+  CS+KEV +++SFP FDQKERVY  HE AA RNMTPVFDLNQISR EEEEL
Subjt:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL

Query:  QACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA
        QA F+P+R E +P+NIF R+EHDAKNS+L++SSMCRN  NGSN++GKRKISWQDQVALRA
Subjt:  QACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]1.0e-9472.41Show/hide
Query:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK
        MKKARK VA DS A ALFE S +GIKH+ LLQDY+EL NETE +K+KLLIAKRKK+ L  EVRFL HRYE LKNQPAN  PKV  +L RN+E+RPP+ KK
Subjt:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK

Query:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL
        EKSSRKR+ SL PLA AHDLNQRG IYNG+EA+ R SQS F +N KS+  CS+KEV ++NSFP FDQKERVY  HE AA RNMTPVFDLNQISR EEEE+
Subjt:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL

Query:  QACFEPLR-TEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA
        QA FEPLR  E + +NIF R+EHDAKNSDL++SSMCRN  NGSN++GKRKISWQDQVALRA
Subjt:  QACFEPLR-TEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]1.2e-8770.23Show/hide
Query:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK
        MKKARK  AMD +A ALFE   IG KH  LLQDYE+L+N TE MKE+LLIAKRKKS L AEVRFL HRYEFLKNQ  N+ PK  LE P+N EIRPP AKK
Subjt:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK

Query:  EKSSRKRDPSLNPL--AWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEE
        EKSS+KR+ SL  L  A A DLNQRG IY+GMEA  R S+ VFH+N K  RMCS+ EV++HNS PIF+ KE +Y  HE AA RNMTPVFDLNQISR EEE
Subjt:  EKSSRKRDPSLNPL--AWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEE

Query:  ELQACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA
        ELQA FEP R E  P+N F R+E+D KNSDLMIS MCRNVG+GSN++GKRKISWQDQVALRA
Subjt:  ELQACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]1.5e-7770.97Show/hide
Query:  MKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKKEKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHI
        MK+KLLIAKRKK  L  EVRFL HRYE LK QPAN  PKV  + PRN+E++PP  KKEKSSRKR+ SL PLA AHD+NQRG IYNG+EA+ R SQS F +
Subjt:  MKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKKEKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHI

Query:  NPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEELQACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSN
        N KS+  CS+KEV +++SFP FDQKERVY  HE AA RNMTPVFDLNQISR EEEELQA F+P+R E +P+NIF R+EHDAKNS+L++SSMCRN  NGSN
Subjt:  NPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEELQACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSN

Query:  KSGKRKISWQDQVALRA
        ++GKRKISWQDQVALRA
Subjt:  KSGKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]6.3e-9775Show/hide
Query:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK
        MKKARK +A+DSEA ALFE S IG+KH+ LLQDYEEL+NETE MKEKLLIAKRKK  L AEVRFL HRYE LKN+PANT PKVA ELP ++EI PP+ KK
Subjt:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK

Query:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL
         KSSRK + SL PLA AHDLNQRG IYNGMEA  R SQS F+IN K SRMCS+KEV I +S PIFDQKERVY  HE   +RNMTPVFDLNQISR EEEEL
Subjt:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL

Query:  QACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA
        QA FEPLR E + +N+F+R+EHDAKNSDL++SSMCRN GNGSN +GKRKISWQDQVALRA
Subjt:  QACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein1.4e-9471.15Show/hide
Query:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK
        MKKARK VA +S A ALFE S +GIKH+ LLQDYEEL NETE MK+KLLIAKRKK  L  EVRFL HRYE LK QPAN  PKV  + PRN+E++PP  KK
Subjt:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK

Query:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL
        EKSSRKR+ SL PLA AHD+NQRG IYNG+EA+ R SQS F +N KS+  CS+KEV +++SFP FDQKERVY  HE AA RNMTPVFDLNQISR EEEEL
Subjt:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL

Query:  QACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA
        QA F+P+R E +P+NIF R+EHDAKNS+L++SSMCRN  NGSN++GKRKISWQDQVALRA
Subjt:  QACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936834.9e-9572.41Show/hide
Query:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK
        MKKARK VA DS A ALFE S +GIKH+ LLQDY+EL NETE +K+KLLIAKRKK+ L  EVRFL HRYE LKNQPAN  PKV  +L RN+E+RPP+ KK
Subjt:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK

Query:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL
        EKSSRKR+ SL PLA AHDLNQRG IYNG+EA+ R SQS F +N KS+  CS+KEV ++NSFP FDQKERVY  HE AA RNMTPVFDLNQISR EEEE+
Subjt:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL

Query:  QACFEPLR-TEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA
        QA FEPLR  E + +NIF R+EHDAKNSDL++SSMCRN  NGSN++GKRKISWQDQVALRA
Subjt:  QACFEPLR-TEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein4.9e-9572.41Show/hide
Query:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK
        MKKARK VA DS A ALFE S +GIKH+ LLQDY+EL NETE +K+KLLIAKRKK+ L  EVRFL HRYE LKNQPAN  PKV  +L RN+E+RPP+ KK
Subjt:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK

Query:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL
        EKSSRKR+ SL PLA AHDLNQRG IYNG+EA+ R SQS F +N KS+  CS+KEV ++NSFP FDQKERVY  HE AA RNMTPVFDLNQISR EEEE+
Subjt:  EKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEEL

Query:  QACFEPLR-TEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA
        QA FEPLR  E + +NIF R+EHDAKNSDL++SSMCRN  NGSN++GKRKISWQDQVALRA
Subjt:  QACFEPLR-TEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246965.8e-8870.23Show/hide
Query:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK
        MKKARK  AMD +A ALFE   IG KH  LLQDYE+L+N TE MKE+LLIAKRKKS L AEVRFL HRYEFLKNQ  N+ PK  LE P+N EIRPP AKK
Subjt:  MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKK

Query:  EKSSRKRDPSLNPL--AWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEE
        EKSS+KR+ SL  L  A A DLNQRG IY+GMEA  R S+ VFH+N K  RMCS+ EV++HNS PIF+ KE +Y  HE AA RNMTPVFDLNQISR EEE
Subjt:  EKSSRKRDPSLNPL--AWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEE

Query:  ELQACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA
        ELQA FEP R E  P+N F R+E+D KNSDLMIS MCRNVG+GSN++GKRKISWQDQVALRA
Subjt:  ELQACFEPLRTEVDPRNIFTRTEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA

A0A6J1KKM1 uncharacterized protein LOC1114940121.1e-7366.39Show/hide
Query:  IGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKKEKSSRKRDPSLNPLAWAHDLNQ
        I I H  LLQDY ELQNETE MKEKLLI K+KKS L AEVRFL H+YE LKN P  T PKV  +LP+N++IRPPV+KKE  SRKR+        A +LNQ
Subjt:  IGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKKEKSSRKRDPSLNPLAWAHDLNQ

Query:  RGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEELQACFEPLRTEVDP---RNIFTR
        RG I +GMEAT R ++SV ++N K SRMCS+KE++I + FPI  QKERVY  HEVA   NMTPVFDLNQISR EEEELQ  FEP+RTE +    +NI +R
Subjt:  RGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEELQACFEPLRTEVDP---RNIFTR

Query:  TEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA
        +E DAKNSDLM+SSMCRNVGNGSN++GKRKISWQD+VALRA
Subjt:  TEHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein5.1e-1229.87Show/hide
Query:  ELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLK-NQPANTLPKVALELPRNIEIRPPVAKKEKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATP
        EL+ E E  +++L + K+K+  L +EVRFL  RYE LK +Q   T P++ L L  +  +  P  +K    RK+   +       DL  +  I N  EA  
Subjt:  ELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLK-NQPANTLPKVALELPRNIEIRPPVAKKEKSSRKRDPSLNPLAWAHDLNQRGAIYNGMEATP

Query:  RISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEELQACFEPLRTEVDPRNIFTRTEHDA----KNSDLM
            S   ++ K  R      +    S P  + +          +  +  P FDLNQISREEEE           EV+  ++      +A    + SDL 
Subjt:  RISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEELQACFEPLRTEVDPRNIFTRTEHDA----KNSDLM

Query:  IS---SMCRNVGNGSNKSGKRKISWQDQVAL
        +     +C +V    N++ KRK++WQD VAL
Subjt:  IS---SMCRNVGNGSNKSGKRKISWQDQVAL

AT5G57910.1 unknown protein1.3e-1831.47Show/hide
Query:  FEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKKEKSSRKRDPSLNPLAWA
        FE  K+  +H  L+QDY EL  ETE M+++L   + +K+ L AEVRFL  RY  L+      + KV               ++    +K    ++P    
Subjt:  FEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKKEKSSRKRDPSLNPLAWA

Query:  HDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEELQACFEPLRTEVDPRNIF
                  N  EA  +                       H S P  +  E+ +   + +  R + P+FDLNQIS EEE+E +A        VD     
Subjt:  HDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEELQACFEPLRTEVDPRNIF

Query:  TRTEHDAKNSDLMISSM-----------CRNVGNGSNKSGKRKISWQDQVA
        TR E       LMISS+           CRN GNGSN   KRKISWQD VA
Subjt:  TRTEHDAKNSDLMISSM-----------CRNVGNGSNKSGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGGCTCGAAAAAGGGTGGCTATGGATTCTGAGGCGGGTGCTCTGTTCGAGGGCTCGAAAATCGGAATCAAGCATCGGGGGCTCTTGCAGGATTACGAGGAGTT
GCAGAACGAAACAGAAACCATGAAGGAGAAATTACTGATCGCGAAGCGAAAGAAGTCGATCCTTTCGGCTGAAGTTCGATTTTTGAGTCATAGATATGAATTCTTGAAGA
ACCAGCCTGCAAACACCCTACCAAAGGTTGCGCTCGAGCTGCCACGAAACATTGAAATCAGACCTCCCGTCGCGAAGAAAGAAAAAAGTTCTCGAAAACGAGACCCTTCT
TTGAATCCCCTTGCTTGGGCTCATGATTTAAACCAAAGGGGAGCAATCTACAATGGGATGGAAGCCACCCCTCGAATATCTCAGTCAGTTTTTCACATAAACCCGAAGTC
ATCAAGGATGTGCAGCGAGAAGGAAGTTGCAATACACAATTCTTTTCCCATTTTTGACCAGAAAGAGAGAGTATACATACCACATGAAGTTGCTGCCACCAGAAACATGA
CCCCGGTTTTCGATCTTAACCAGATCTCGAGGGAGGAAGAAGAAGAATTGCAAGCTTGTTTCGAACCGCTGAGAACGGAGGTGGATCCGAGGAATATCTTTACAAGAACC
GAACACGATGCGAAGAACAGTGACTTGATGATATCATCAATGTGTAGGAATGTGGGCAATGGCTCAAACAAATCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTTGC
TTTAAGAGCATGA
mRNA sequenceShow/hide mRNA sequence
GAGAGATGGATTTTGAGAGAGAAGGAAGAGGGAGAGGGGGGGAGAGAGAGAGAGAGATGGGTTTTGAGAGAGAATGGTTTTGAGAGAAAAGGGGAGGAGAGATTTGGAAG
AGAGAGGGAAAATATGGGTTTTGAGAGAGAGATGAGACAGAGAGATTGGTAGAAAATGTCCATTAATTTGATTAGAAAGAGAGAGTGGAAATTTGCAAAAAGAAAGAGGA
GAAAGCAAGAAATGGCCACAAACAGCATAATCTCCAACCAAACAGCATTAACTCAGAAGAGTCAGAACAGAGAAGGAAAAAGAAAAAAGCATCATCAACAACATATCCTT
TTTCCTCTTATCCAATCCAATCTCTTTTCTATTTCTCTCTCTTCCCTTTCAAAAAAAAAAAATCATTCCCTCAATTCACTGAATTCTTGTATTCAGATTTTTCACCCTCA
CCCATCAAATACACCAACAACAAGAATCCCCCTTACCCTTTCTCTCTCCATCTCTTCAATGGCCCAGTTGGACCAAACCAATTTTTATGCCCAACAAATTCGACAATCAA
CCAGAGAAAATGGTTACATTTCTCTCTCTTCCAAAACAAGTTCCCTGAATTCTCCAGTGATTTGGGCTTTTTGCGTCTCTTCTCTTCTAGGTCAAGACCTTCATCAACCA
AACGCCAAAACCCTCCACTCTTTTCGTCTCTTTTTTCTTCTCCGCCATTGTTTTTCCTTCTTCCTCTTGATTCTCTTTTCTGGGTCCCCTCTTCTTTCTGCGATCGCCCT
TTTTTCCCCTTTCCAATCGGATGAAGAAGGCTCGAAAAAGGGTGGCTATGGATTCTGAGGCGGGTGCTCTGTTCGAGGGCTCGAAAATCGGAATCAAGCATCGGGGGCTC
TTGCAGGATTACGAGGAGTTGCAGAACGAAACAGAAACCATGAAGGAGAAATTACTGATCGCGAAGCGAAAGAAGTCGATCCTTTCGGCTGAAGTTCGATTTTTGAGTCA
TAGATATGAATTCTTGAAGAACCAGCCTGCAAACACCCTACCAAAGGTTGCGCTCGAGCTGCCACGAAACATTGAAATCAGACCTCCCGTCGCGAAGAAAGAAAAAAGTT
CTCGAAAACGAGACCCTTCTTTGAATCCCCTTGCTTGGGCTCATGATTTAAACCAAAGGGGAGCAATCTACAATGGGATGGAAGCCACCCCTCGAATATCTCAGTCAGTT
TTTCACATAAACCCGAAGTCATCAAGGATGTGCAGCGAGAAGGAAGTTGCAATACACAATTCTTTTCCCATTTTTGACCAGAAAGAGAGAGTATACATACCACATGAAGT
TGCTGCCACCAGAAACATGACCCCGGTTTTCGATCTTAACCAGATCTCGAGGGAGGAAGAAGAAGAATTGCAAGCTTGTTTCGAACCGCTGAGAACGGAGGTGGATCCGA
GGAATATCTTTACAAGAACCGAACACGATGCGAAGAACAGTGACTTGATGATATCATCAATGTGTAGGAATGTGGGCAATGGCTCAAACAAATCAGGAAAAAGGAAGATC
TCATGGCAAGATCAAGTTGCTTTAAGAGCATGAAGTTATTCTCTCTTATTGAAAGAGCCGCTGAGGATATCCTTCATTTGAAGAGGTGGGAAAACTTGGCCTCCTTGAAA
TTCCATTATTATTATTGTTGTTTTATAATCCACACTCCGCTCAAGTGAGAAAATTGTGACACTATAACATTTTAGGGTGGAAAATTTGAAGGATATCTAATGTCTTAAAT
AGGTGATGATTGTGATGGAAACGTGTAATCTTAAACACATCTGAAATTACAGATTGTCCATCTTAAGAAAATCCATGACGAGTAATTCTATCTTAGTGGATTGTTCAAAA
ATTGTGAACGACAATTGTTTACTATATTTTGTTTGAAAGCG
Protein sequenceShow/hide protein sequence
MKKARKRVAMDSEAGALFEGSKIGIKHRGLLQDYEELQNETETMKEKLLIAKRKKSILSAEVRFLSHRYEFLKNQPANTLPKVALELPRNIEIRPPVAKKEKSSRKRDPS
LNPLAWAHDLNQRGAIYNGMEATPRISQSVFHINPKSSRMCSEKEVAIHNSFPIFDQKERVYIPHEVAATRNMTPVFDLNQISREEEEELQACFEPLRTEVDPRNIFTRT
EHDAKNSDLMISSMCRNVGNGSNKSGKRKISWQDQVALRA