; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G006880 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G006880
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUnknown protein
Genome locationchr11:7261788..7264789
RNA-Seq ExpressionLsi11G006880
SyntenyLsi11G006880
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]1.9e-11278.91Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK +A +S ACALF+SSM+GIKHQSLLQDYEELHNETEAMK+KLLIAKRKK TLL EV                 RFLRHRYE LK QPANIQPKV
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT
        GF+ PRNLE++PP +KKEKSSRKREASLKPLAQ HD+NQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV +++SFP FDQKERVYRAHE AAN NMT
Subjt:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT

Query:  PVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQISREEEELQAG KP R+EDEPKNI  RSEHDAKNS+LV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]7.3e-11279.35Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK +A DS ACALF++SM+GIKHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLL EV                 RFLRHRYE LKNQPANIQPKV
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT
        GF+L RNLE+RPPI+KKEKSSRKREASLKPLAQ HDLNQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV ++NSFP FDQKERVYRAHE AAN NMT
Subjt:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT

Query:  PVFDLNQISREEEELQAGSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQISREEEE+QAG +P R +EDE KNI  RSEHDAKNSDLV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISREEEELQAGSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]7.1e-9168.95Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK  A+D +A ALF++ MIG KH  LLQDYE+L N TE MKE+LLIAKRKK+TLLAEV                 RFLRHRYEFLKNQ  N QPK 
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFELPRNLEIRPPIIKKEKSSRKREASLKPL--AQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTN
        G E P+N EIRPP  KKEKSS+KREASLK L  AQ  DLNQRGGIY+GMEA SRK +  F  NQK RMCS  EV++HNS PIF+ KE +YR HE AA+ N
Subjt:  GFELPRNLEIRPPIIKKEKSSRKREASLKPL--AQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTN

Query:  MTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        MTPVFDLNQISREEEELQAG +PPRME+ PKN   RSE+D KNSDL+IS MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  MTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]1.4e-9178.02Show/hide
Query:  MKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGI
        MK+KLLIAKRKK TLL EV                 RFLRHRYE LK QPANIQPKVGF+ PRNLE++PP +KKEKSSRKREASLKPLAQ HD+NQRGGI
Subjt:  MKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGI

Query:  YNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSD
        YNG+EA SRK QSFFD NQKS  CSKKEV +++SFP FDQKERVYRAHE AAN NMTPVFDLNQISREEEELQAG KP R+EDEPKNI  RSEHDAKNS+
Subjt:  YNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSD

Query:  LVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        LV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  LVISSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]1.4e-11079.64Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK +AVDSEACALF++SMIG+KHQSLLQDYEEL NETEAMKEKLLIAKRKK TLLAEV                 RFLRHRYE LKN+PAN QPKV
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT
         FELP +LEI PPI KK KSSRK EASLKPLA+ HDLNQRGGIYNGMEAPSRK QSFF+ NQKSRMCSKKEVTI +S PIFDQKERVYR HE   + NMT
Subjt:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT

Query:  PVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQISREEEELQAG +P RMEDE KN+ SRSEHDAKNSDLV+SSMCRNDGNGSN AGKRKISWQDQVALRA
Subjt:  PVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein9.4e-11378.91Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK +A +S ACALF+SSM+GIKHQSLLQDYEELHNETEAMK+KLLIAKRKK TLL EV                 RFLRHRYE LK QPANIQPKV
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT
        GF+ PRNLE++PP +KKEKSSRKREASLKPLAQ HD+NQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV +++SFP FDQKERVYRAHE AAN NMT
Subjt:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT

Query:  PVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQISREEEELQAG KP R+EDEPKNI  RSEHDAKNS+LV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936833.6e-11279.35Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK +A DS ACALF++SM+GIKHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLL EV                 RFLRHRYE LKNQPANIQPKV
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT
        GF+L RNLE+RPPI+KKEKSSRKREASLKPLAQ HDLNQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV ++NSFP FDQKERVYRAHE AAN NMT
Subjt:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT

Query:  PVFDLNQISREEEELQAGSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQISREEEE+QAG +P R +EDE KNI  RSEHDAKNSDLV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISREEEELQAGSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein3.6e-11279.35Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK +A DS ACALF++SM+GIKHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLL EV                 RFLRHRYE LKNQPANIQPKV
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT
        GF+L RNLE+RPPI+KKEKSSRKREASLKPLAQ HDLNQRGGIYNG+EA SRK QSFFD NQKS  CSKKEV ++NSFP FDQKERVYRAHE AAN NMT
Subjt:  GFELPRNLEIRPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMT

Query:  PVFDLNQISREEEELQAGSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQISREEEE+QAG +P R +EDE KNI  RSEHDAKNSDLV+SSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISREEEELQAGSKPPR-MEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246963.5e-9168.95Show/hide
Query:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK  A+D +A ALF++ MIG KH  LLQDYE+L N TE MKE+LLIAKRKK+TLLAEV                 RFLRHRYEFLKNQ  N QPK 
Subjt:  MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFELPRNLEIRPPIIKKEKSSRKREASLKPL--AQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTN
        G E P+N EIRPP  KKEKSS+KREASLK L  AQ  DLNQRGGIY+GMEA SRK +  F  NQK RMCS  EV++HNS PIF+ KE +YR HE AA+ N
Subjt:  GFELPRNLEIRPPIIKKEKSSRKREASLKPL--AQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTN

Query:  MTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
        MTPVFDLNQISREEEELQAG +PPRME+ PKN   RSE+D KNSDL+IS MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  MTPVFDLNQISREEEELQAGSKPPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1KKM1 uncharacterized protein LOC1114940124.2e-8168.48Show/hide
Query:  MIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSS
        MI I H  LLQDY EL NETEAMKEKLLI K+KK+TLLAEV                 RFLRH+YE LKN P   QPKVGF+LP+NL+IRPP+ KKE  S
Subjt:  MIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKKEKSS

Query:  RKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKP
        RKREA         +LNQRGGI +GMEA +RK +S  + NQKSRMCSKKE++I + FPI  QKERVYRAHEVA NTNMTPVFDLNQISREEEELQ G +P
Subjt:  RKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKP

Query:  PRMEDEP---KNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA
         R EDE    KNICSRSE DAKNSDL++SSMCRN GNGSNRAGKRKISWQD+VALRA
Subjt:  PRMEDEP---KNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein2.1e-1128.92Show/hide
Query:  ELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKV-------GFELPRNLEIRPPIIKKEKSSRKREAS
        EL  E E  +++L + K+K+ TL +EV                 RFLR RYE LK +Q     P++       G E+PR          K    RK+++ 
Subjt:  ELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKV-------GFELPRNLEIRPPIIKKEKSSRKREAS

Query:  LKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDE
        ++      DL  +  I N  EA +    S     ++ R      +T   S P  + +          + T+  P FDLNQISREEEE +        E  
Subjt:  LKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSKPPRMEDE

Query:  PKNICSRSEHDAKNSDLVIS---SMCRNDGNGSNRAGKRKISWQDQVAL
               +  D + SDL +     +C +     NRA KRK++WQD VAL
Subjt:  PKNICSRSEHDAKNSDLVIS---SMCRNDGNGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein1.1e-1532.57Show/hide
Query:  FQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK
        F+   +  +H SL+QDY ELH ETEAM+++L   + +KATL+AEV                 RFLR RY  L+                    +P  IKK
Subjt:  FQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFELPRNLEIRPPIIKK

Query:  EKSSRKREASLKPLAQVHDLNQRGGIYNGME-APSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEE-EL
         + S                   GG    +E +PS K ++                T H S P  +  E+ +   + +    + P+FDLNQIS EEE E 
Subjt:  EKSSRKREASLKPLAQVHDLNQRGGIYNGME-APSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEE-EL

Query:  QA-GSKPPRMEDEPKNICSR---SEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVA
        +A  +       E  N C R   S  + +  D+  SS CRN GNGSN   KRKISWQD VA
Subjt:  QA-GSKPPRMEDEPKNICSR---SEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAGCTCGAAAACGGATGGCTGTGGATTCAGAGGCATGTGCTCTGTTCCAGAGCTCGATGATTGGGATCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGTT
GCATAACGAAACAGAAGCCATGAAGGAGAAACTGCTGATCGCAAAGCGGAAAAAGGCGACCCTTTTGGCTGAAGTTCGGAATTTGTTGATGTTGTTAACTGAGTCATGTT
TATTTCTCAACTCCTTTAGATTTTTGAGGCATAGATATGAATTCTTGAAGAACCAGCCTGCAAACATCCAACCGAAGGTTGGTTTCGAGCTGCCACGGAACCTTGAAATC
AGACCTCCCATCATTAAGAAAGAAAAGAGTTCTCGAAAAAGAGAAGCTTCTTTGAAACCCCTTGCTCAGGTTCATGACTTAAACCAAAGGGGAGGAATCTACAATGGGAT
GGAAGCCCCTTCTCGAAAACCTCAGTCGTTTTTCGACACAAACCAGAAGTCAAGGATGTGCAGCAAGAAGGAAGTCACTATACACAATTCTTTTCCTATTTTTGACCAGA
AAGAGAGAGTATACAGAGCACATGAAGTTGCTGCCAACACGAACATGACCCCAGTTTTCGACCTTAACCAGATCTCGAGAGAGGAGGAAGAACTGCAGGCTGGTTCCAAA
CCACCGAGAATGGAGGACGAGCCAAAGAATATCTGTTCAAGAAGCGAACACGATGCGAAGAACAGTGACTTGGTGATATCATCAATGTGTAGGAATGATGGTAATGGATC
AAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCCTTAAGAGCATGA
mRNA sequenceShow/hide mRNA sequence
AAGAAATTGGGGTATTGCAATTTTTGCAAGAAATGTCCATTAAAGAGAGAGAAAGAGAATCTTTTTGGATTGATTGATTTAATAGAAGAAAAGACCTTAAAAGCAAAGCA
AAGCAAAGCACTATCCACAAACATCTTCTGCAAAACCCCATTAAACAGTTAACATCCACAAACAGAAAATTCATTCCCTTTTCTCTTTCTTCTCTCAGAAAAAAAGGAAA
AAAAAAAAGAAAAAAATCACTCCCACAATTCACTGAATATTTGTACTCAAAGATTTTTCACCCTCCACCCATCAATGGAATCCCTTTTCTATATCTCATCAATGGCCCAG
TTGGACCAAACCAATTTTTTTGCCCAACAAATTCCTAAATCTATCAGAGAAAATTGGCTAAATTTTTCTTCTCTCTCTCTCTCTCAACTTGCCAGAATCGTCCTGTGATT
TGGGCTTTTTTTGCCTTCTTTCTTAGTTGTCTTCTAGGTCAAGACTTAACCAAAAAAACCCTCCATTCTTTTCTCCGCCATTGTTTTTCTTTCTTTCTCTTCATCCTCTT
TCTGGGTCTCTTCTTTTTTCTGCCATCGTTCCGGTTTTTCTCTTTCAATCGGATGAAGAAAGCTCGAAAACGGATGGCTGTGGATTCAGAGGCATGTGCTCTGTTCCAGA
GCTCGATGATTGGGATCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGTTGCATAACGAAACAGAAGCCATGAAGGAGAAACTGCTGATCGCAAAGCGGAAAAAGGCG
ACCCTTTTGGCTGAAGTTCGGAATTTGTTGATGTTGTTAACTGAGTCATGTTTATTTCTCAACTCCTTTAGATTTTTGAGGCATAGATATGAATTCTTGAAGAACCAGCC
TGCAAACATCCAACCGAAGGTTGGTTTCGAGCTGCCACGGAACCTTGAAATCAGACCTCCCATCATTAAGAAAGAAAAGAGTTCTCGAAAAAGAGAAGCTTCTTTGAAAC
CCCTTGCTCAGGTTCATGACTTAAACCAAAGGGGAGGAATCTACAATGGGATGGAAGCCCCTTCTCGAAAACCTCAGTCGTTTTTCGACACAAACCAGAAGTCAAGGATG
TGCAGCAAGAAGGAAGTCACTATACACAATTCTTTTCCTATTTTTGACCAGAAAGAGAGAGTATACAGAGCACATGAAGTTGCTGCCAACACGAACATGACCCCAGTTTT
CGACCTTAACCAGATCTCGAGAGAGGAGGAAGAACTGCAGGCTGGTTCCAAACCACCGAGAATGGAGGACGAGCCAAAGAATATCTGTTCAAGAAGCGAACACGATGCGA
AGAACAGTGACTTGGTGATATCATCAATGTGTAGGAATGATGGTAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCCTTAAGAGCATGA
AGTTTTCTCTTATTGAAAGAACCATTCAGGATCTTCATTTGCATAGATGGGAAAACTGAGATTCCTTGAAATTCCATTATTATTATTATTATTATTATTTTATATTTCAC
ACTCGGCTCTAAGTGAGAAAAATTTGTGACGCCATCACGGTAAGCTTGTGTAGAAAGTCATTACAAACAAACTTTATAGGACTATACTTGAAGGATATCTAATGTCGTAG
CAAGTCCGAGGCAACCATTGTGATTGAAACTTATGAAGAATGTCGATTTCTCCATCTTAGGGACCCTATCTTTGTGAATTATTTAGGACATGATGAAATAAAAATGTGTA
CCATATTTGTTGG
Protein sequenceShow/hide protein sequence
MKKARKRMAVDSEACALFQSSMIGIKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLAEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFELPRNLEI
RPPIIKKEKSSRKREASLKPLAQVHDLNQRGGIYNGMEAPSRKPQSFFDTNQKSRMCSKKEVTIHNSFPIFDQKERVYRAHEVAANTNMTPVFDLNQISREEEELQAGSK
PPRMEDEPKNICSRSEHDAKNSDLVISSMCRNDGNGSNRAGKRKISWQDQVALRA