; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039719 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039719
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationscaffold10:46808315..46810408
RNA-Seq ExpressionSpg039719
SyntenySpg039719
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]4.9e-4240.34Show/hide
Query:  KEEDSQRDGQRRVEDEDIEELIGQMEPPFTDEIMEGE-------------VPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRF
        + E S++   ++ +  D+EEL+ Q + PFT+EIM  +                K +VEG  D V+L A I G++DE L  S G+  P T+ E ++RAQR+
Subjt:  KEEDSQRDGQRRVEDEDIEELIGQMEPPFTDEIMEGE-------------VPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRF

Query:  ISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRS
        +SA E   SK               +E  GKR    D  RER G      R   E +D   + +P  KF +YT  TVPLEQVL  I +  LLK PERM +
Subjt:  ISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRS

Query:  DPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDRGKRPTPTDQGKGGNNPPLEIRTILGGPSGGESNRKRKAAVREA
           +R++ +YC+FH DHGH T++C  L++E+E LI  GYLKE+V++    P  T  G+   +P  EIRTI+GGP   ES RKRK  VREA
Subjt:  DPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDRGKRPTPTDQGKGGNNPPLEIRTILGGPSGGESNRKRKAAVREA

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.3e-3931.11Show/hide
Query:  RKRARAVSKAEQGPKQREQKLSQWL--KEEDSQRDGQRRVEDEDIEELIGQMEPPFTDEIMEGEVPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQ
        R  AR++S   Q  K+   + S W   ++  +     R+ E E + E + + +              + KV    D  A+   ++ L DE L   +GE  
Subjt:  RKRARAVSKAEQGPKQREQKLSQWL--KEEDSQRDGQRRVEDEDIEELIGQMEPPFTDEIMEGEVPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQ

Query:  PRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAI
        P T+VE + +A++ I  +ELL++K    E +     D+ K  + ++R+ +   R++    SA+      ++  P R+ P   + RYT  T+P+ ++L  I
Subjt:  PRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAI

Query:  HDT---NLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFV-QDRGKRPTPTDQGKGGNNPPLE------IRTILGGPSG
         ++    LLKRPE++R D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FV + R       ++ K    PP        I TI GGP+G
Subjt:  HDT---NLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFV-QDRGKRPTPTDQGKGGNNPPLE------IRTILGGPSG

Query:  GESNRKRKAAVREAWIE--------PTEQVTFGEG-------------------------------------------SQTVTRMINFLVVDCIPAYNAI
        G+S  KRK   REA  E        PT  +TFG+                                            +  VT+M  F+V+D   AYNAI
Subjt:  GESNRKRKAAVREAWIE--------PTEQVTFGEG-------------------------------------------SQTVTRMINFLVVDCIPAYNAI

Query:  LGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE
         GRP +H  +AV ST HQVLK+ T + VG V GE
Subjt:  LGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE

XP_023916366.1 uncharacterized protein LOC112027956 [Quercus suber]3.2e-4132.58Show/hide
Query:  VEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEI
        V+  DD + LAA  +G+  +  ++ + E  P+T  E +  AQ F++AE+ + +K+ +R  R V      + ++G R +    GR +            + 
Subjt:  VEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEI

Query:  KDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDRGKRPTPTDQ
        KD   +A P A+  +YTPL +PLEQVL  I D   LK PE+MR DP++RNRSKYC FH DHGH T EC  L+ +IE LIR+G LK F+    K      +
Subjt:  KDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDRGKRPTPTDQ

Query:  GKGGNNPPL-EIRTILGGPSGGESNRKRKAAVREAW----------------------------------------------------------------
         +  + PPL EIR I+GG S G+S+  +KA ++E                                                                  
Subjt:  GKGGNNPPL-EIRTILGGPSGGESNRKRKAAVREAW----------------------------------------------------------------

Query:  ---------------------------IEPTEQVTF----GEGSQTVTRMINFLVVDCIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE
                                   ++P   VT     G   + +T  +NFLVVDC  +YNAI+GRPTL+  KAV STYH  +KFPTE GVGEV G+
Subjt:  ---------------------------IEPTEQVTF----GEGSQTVTRMINFLVVDCIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]8.4e-4230.22Show/hide
Query:  KVEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRV---EDSGRERHGHPSANGRG
        +V+GYDDG+AL+ ++ GL+  +L  S+ +  P +Y E + RA+++ +AEE  K++ +E   +G S   + K+D  + RRV   + S R     P      
Subjt:  KVEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRV---EDSGRERHGHPSANGRG

Query:  RAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQD-----R
        R+E++D   R    ++F  +T L  P EQ+L  + +  L + P  M+++P RRN +KYC FH DHGH T EC +L+++IE+L+R+G L+E+V++     +
Subjt:  RAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQD-----R

Query:  GKRPTPTDQGKG-----GNNPPLEIRTILGGPSGGESNRKRKAAVREAWIEP-------TEQ--------------------------------------
         ++P  + + KG      +    ++  I GGP+ G+S + RK   R+A  EP       T Q                                      
Subjt:  GKRPTPTDQGKG-----GNNPPLEIRTILGGPSGGESNRKRKAAVREAWIEP-------TEQ--------------------------------------

Query:  -------------------------------------------------------VTFGEGSQTVTRMINFLVVDCIPAYNAILGRPTLHGLKAVASTYH
                                                               V+FG+    VT M+NF+VVD   +YNA+LGRPTL+ LKA  S YH
Subjt:  -------------------------------------------------------VTFGEGSQTVTRMINFLVVDCIPAYNAILGRPTLHGLKAVASTYH

Query:  QVLKFPTEDGVGEVHGE
          LKFPTE GVG V GE
Subjt:  QVLKFPTEDGVGEVHGE

XP_030963320.1 uncharacterized protein LOC115984434 [Quercus lobata]1.7e-4235.96Show/hide
Query:  VEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKED--RGKRRRVEDSGRERHGHPSANGRGRA
        V+  DD + LAA  +G+  +  ++ + E +P+T  E +  AQ F++AE+ + +K+ +R  +  +   RH E   R K+ R ED                 
Subjt:  VEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKED--RGKRRRVEDSGRERHGHPSANGRGRA

Query:  EIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDRGKRPTPT
          KD   +A P A+  +YTPL +P +QVL  I D   LK  E+M+ DP++ NR+KYC FH DHGH T EC  L+ +IE LIR+G L+ F+    K     
Subjt:  EIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDRGKRPTPT

Query:  DQGKGGNNPPL-EIRTILGGPSGGESNRK-------------------------RKAAVREAW-------IEPTEQVTF----GEGSQTVTRMINFLVVD
         + +  + PPL EIR I+GG S  +S+R+                         R   V           ++P   VT     G   Q VT+ ++FLVVD
Subjt:  DQGKGGNNPPL-EIRTILGGPSGGESNRK-------------------------RKAAVREAW-------IEPTEQVTF----GEGSQTVTRMINFLVVD

Query:  CIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE
        C  +YNAI+GRPTL+  K V STYH  +KFPTE GVG+V G+
Subjt:  CIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE

TrEMBL top hitse value%identityAlignment
A0A2N9H694 Ribonuclease H3.8e-4034.48Show/hide
Query:  VEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELL--------KSKQEERESRGVSVYD-RHKEDRGKRRRVEDSGRERHGHPS
        V+G DD V L A ISGLQ    L S+ +  P T  E M  AQR ++ EE L        K ++ E   R    +D R K  R + RR ED          
Subjt:  VEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELL--------KSKQEERESRGVSVYD-RHKEDRGKRRRVEDSGRERHGHPS

Query:  ANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDR
         NGRG  E            +FN +TPL  P++ +   I +   LK P ++ +DPD+R R KYC FH DHGH T +C  L+ +IE LI++G L+ FV +R
Subjt:  ANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDR

Query:  GKRPTPTDQGKGGNNPPL----------EIRTILGG-PSGGESNRKRKAAVRE-----------------------------------AWIEPTE-----
        G+R     QG     PP+          EI  I GG  +GG S   RKA  R+                                   A + P E     
Subjt:  GKRPTPTDQGKGGNNPPL----------EIRTILGG-PSGGESNRKRKAAVRE-----------------------------------AWIEPTE-----

Query:  --------------QVTFGEGSQTVTRMINFLVVDCIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE
                      Q+  G   +  T+ ++FLVVDC  AYN I+GRPTL+ L+AV STYH +++FPTE+G+GE+ G+
Subjt:  --------------QVTFGEGSQTVTRMINFLVVDCIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE

A0A2N9J1E3 Ribonuclease H3.8e-4034.48Show/hide
Query:  VEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELL--------KSKQEERESRGVSVYD-RHKEDRGKRRRVEDSGRERHGHPS
        V+G DD V L A ISGLQ    L S+ +  P T  E M  AQR ++ EE L        K ++ E   R    +D R K  R + RR ED          
Subjt:  VEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELL--------KSKQEERESRGVSVYD-RHKEDRGKRRRVEDSGRERHGHPS

Query:  ANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDR
         NGRG  E            +FN +TPL  P++ +   I +   LK P ++ +DPD+R R KYC FH DHGH T +C  L+ +IE LI++G L+ FV +R
Subjt:  ANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDR

Query:  GKRPTPTDQGKGGNNPPL----------EIRTILGG-PSGGESNRKRKAAVRE-----------------------------------AWIEPTE-----
        G+R     QG     PP+          EI  I GG  +GG S   RKA  R+                                   A + P E     
Subjt:  GKRPTPTDQGKGGNNPPL----------EIRTILGG-PSGGESNRKRKAAVRE-----------------------------------AWIEPTE-----

Query:  --------------QVTFGEGSQTVTRMINFLVVDCIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE
                      Q+  G   +  T+ ++FLVVDC  AYN I+GRPTL+ L+AV STYH +++FPTE+G+GE+ G+
Subjt:  --------------QVTFGEGSQTVTRMINFLVVDCIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE

A0A6J1CNT2 uncharacterized protein LOC1110128052.4e-4240.34Show/hide
Query:  KEEDSQRDGQRRVEDEDIEELIGQMEPPFTDEIMEGE-------------VPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRF
        + E S++   ++ +  D+EEL+ Q + PFT+EIM  +                K +VEG  D V+L A I G++DE L  S G+  P T+ E ++RAQR+
Subjt:  KEEDSQRDGQRRVEDEDIEELIGQMEPPFTDEIMEGE-------------VPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRF

Query:  ISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRS
        +SA E   SK               +E  GKR    D  RER G      R   E +D   + +P  KF +YT  TVPLEQVL  I +  LLK PERM +
Subjt:  ISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRS

Query:  DPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDRGKRPTPTDQGKGGNNPPLEIRTILGGPSGGESNRKRKAAVREA
           +R++ +YC+FH DHGH T++C  L++E+E LI  GYLKE+V++    P  T  G+   +P  EIRTI+GGP   ES RKRK  VREA
Subjt:  DPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDRGKRPTPTDQGKGGNNPPLEIRTILGGPSGGESNRKRKAAVREA

A0A6J1DZB9 uncharacterized protein LOC1110249046.5e-4031.11Show/hide
Query:  RKRARAVSKAEQGPKQREQKLSQWL--KEEDSQRDGQRRVEDEDIEELIGQMEPPFTDEIMEGEVPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQ
        R  AR++S   Q  K+   + S W   ++  +     R+ E E + E + + +              + KV    D  A+   ++ L DE L   +GE  
Subjt:  RKRARAVSKAEQGPKQREQKLSQWL--KEEDSQRDGQRRVEDEDIEELIGQMEPPFTDEIMEGEVPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQ

Query:  PRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAI
        P T+VE + +A++ I  +ELL++K    E +     D+ K  + ++R+ +   R++    SA+      ++  P R+ P   + RYT  T+P+ ++L  I
Subjt:  PRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAI

Query:  HDT---NLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFV-QDRGKRPTPTDQGKGGNNPPLE------IRTILGGPSG
         ++    LLKRPE++R D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FV + R       ++ K    PP        I TI GGP+G
Subjt:  HDT---NLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFV-QDRGKRPTPTDQGKGGNNPPLE------IRTILGGPSG

Query:  GESNRKRKAAVREAWIE--------PTEQVTFGEG-------------------------------------------SQTVTRMINFLVVDCIPAYNAI
        G+S  KRK   REA  E        PT  +TFG+                                            +  VT+M  F+V+D   AYNAI
Subjt:  GESNRKRKAAVREAWIE--------PTEQVTFGEG-------------------------------------------SQTVTRMINFLVVDCIPAYNAI

Query:  LGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE
         GRP +H  +AV ST HQVLK+ T + VG V GE
Subjt:  LGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE

A0A7J0DSP8 Integrase catalytic domain-containing protein8.5e-4034.46Show/hide
Query:  EDEDIEELIGQMEPPFTDEIMEGEVPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHK
        E+E ++E + +    F   I+E E P         D V + A++ GL+   L +S+ ++ P T     ++A ++I+AEEL ++K+  R        D HK
Subjt:  EDEDIEELIGQMEPPFTDEIMEGEVPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQPRTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHK

Query:  EDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQ
              RR +     R+  P    R      +   R  P+       PL  P+ QVL+ I     +K P ++++DP +RNR+KYC FH DHGH T +C Q
Subjt:  EDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPERMRSDPDRRNRSKYCMFHGDHGHTTRECIQ

Query:  LRDEIETLIREGYLKEFVQDRGKRPTPTDQGKGGNNPPL-EIRTILGG-PSGGESNRKRKAAVREAWIEPTEQV--------------TFGEGSQTVTRM
        L+++I  LI+ GYL++++ DR   P   ++  G N P    I+TI GG  SGG S   RK   R A     E+V              TFG        +
Subjt:  LRDEIETLIREGYLKEFVQDRGKRPTPTDQGKGGNNPPL-EIRTILGG-PSGGESNRKRKAAVREAWIEPTEQV--------------TFGEGSQTVTRM

Query:  -----INFLVVDCIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE
             ++F+VVDC   YNAILGRPTL G+KA+ STYH  +KFP+  G+GEV G+
Subjt:  -----INFLVVDCIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACACCAAGATCAACCGATGGCTGACGAACCGAATCCCCAAGTTCAACTCCAGGCCCAGGAAATCGAGATCGCAGCAATTAAGGGGAGGATGAACGAGATGGGGCA
GAATCTGACTGAAATCCTCAGTCTGTTGAAAAAGCCCGAGTCTGTAAGACCCGAGGAAGAGCATCTGCGCAGAGATCCCAAAAAGGGCAAGGAAATAGCAGACGAGGGGG
TAGGGGATTCGGAAAGCGTGACGAGTCGAATGTACCATCCGGGGAATGATCGGGTCCGGAGAGAAGCAGGACCGAGCCGTCAGAAGCTCCGTGGGAATTTGCCATCAAAG
TCAATGTCGGGCTTGTATGCAGAGAATGATGGCAGGAAAAGGGCCCGAGCGGTGTCCAAGGCCGAGCAGGGCCCGAAGCAACGAGAGCAGAAGCTGTCCCAGTGGTTGAA
AGAAGAAGACAGCCAACGCGACGGCCAGAGGAGAGTTGAGGATGAAGACATAGAGGAGCTGATCGGACAGATGGAGCCGCCCTTCACCGACGAGATAATGGAAGGGGAAG
TACCCCACAAATTCAAGGTGGAGGGGTACGATGATGGTGTCGCCCTAGCAGCCGTTATCTCAGGATTACAGGATGAGAGGTTGCTGAATTCAATTGGAGAAAGCCAGCCG
CGGACATACGTGGAGTTCATGACTCGGGCACAAAGGTTTATAAGCGCCGAAGAACTATTGAAATCTAAACAGGAAGAAAGGGAGAGTCGGGGAGTTTCGGTGTACGACCG
ACATAAAGAGGACAGAGGAAAAAGGCGTCGGGTAGAGGACAGCGGCCGGGAGCGACATGGACACCCCTCGGCCAATGGCCGAGGCCGAGCAGAAATCAAAGATTTGCCAG
GCAGGGCTGAGCCGAAAGCCAAGTTCAACAGGTATACGCCACTAACAGTTCCGCTGGAACAGGTGTTAGCCGCAATACATGACACAAATCTGTTGAAGCGCCCAGAAAGG
ATGAGGTCGGATCCAGACAGGAGAAATAGAAGTAAATATTGCATGTTCCATGGGGACCACGGCCACACGACCCGAGAATGCATACAGTTGAGGGATGAAATAGAAACCCT
AATTCGTGAAGGCTATCTCAAGGAATTCGTACAAGACAGAGGGAAAAGGCCAACACCAACAGATCAAGGTAAAGGAGGCAACAACCCCCCACTGGAGATAAGGACCATCC
TTGGAGGACCCTCGGGGGGGGAGTCAAATAGAAAGCGGAAAGCGGCAGTTCGAGAGGCTTGGATAGAGCCAACAGAGCAAGTGACCTTCGGGGAAGGGTCACAGACGGTA
ACTCGAATGATCAACTTCCTGGTGGTGGACTGCATTCCAGCATATAATGCAATTCTGGGACGACCAACTTTGCATGGACTTAAGGCCGTAGCCTCCACATATCATCAAGT
CCTGAAATTTCCGACTGAAGATGGTGTCGGGGAAGTGCACGGGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACACCAAGATCAACCGATGGCTGACGAACCGAATCCCCAAGTTCAACTCCAGGCCCAGGAAATCGAGATCGCAGCAATTAAGGGGAGGATGAACGAGATGGGGCA
GAATCTGACTGAAATCCTCAGTCTGTTGAAAAAGCCCGAGTCTGTAAGACCCGAGGAAGAGCATCTGCGCAGAGATCCCAAAAAGGGCAAGGAAATAGCAGACGAGGGGG
TAGGGGATTCGGAAAGCGTGACGAGTCGAATGTACCATCCGGGGAATGATCGGGTCCGGAGAGAAGCAGGACCGAGCCGTCAGAAGCTCCGTGGGAATTTGCCATCAAAG
TCAATGTCGGGCTTGTATGCAGAGAATGATGGCAGGAAAAGGGCCCGAGCGGTGTCCAAGGCCGAGCAGGGCCCGAAGCAACGAGAGCAGAAGCTGTCCCAGTGGTTGAA
AGAAGAAGACAGCCAACGCGACGGCCAGAGGAGAGTTGAGGATGAAGACATAGAGGAGCTGATCGGACAGATGGAGCCGCCCTTCACCGACGAGATAATGGAAGGGGAAG
TACCCCACAAATTCAAGGTGGAGGGGTACGATGATGGTGTCGCCCTAGCAGCCGTTATCTCAGGATTACAGGATGAGAGGTTGCTGAATTCAATTGGAGAAAGCCAGCCG
CGGACATACGTGGAGTTCATGACTCGGGCACAAAGGTTTATAAGCGCCGAAGAACTATTGAAATCTAAACAGGAAGAAAGGGAGAGTCGGGGAGTTTCGGTGTACGACCG
ACATAAAGAGGACAGAGGAAAAAGGCGTCGGGTAGAGGACAGCGGCCGGGAGCGACATGGACACCCCTCGGCCAATGGCCGAGGCCGAGCAGAAATCAAAGATTTGCCAG
GCAGGGCTGAGCCGAAAGCCAAGTTCAACAGGTATACGCCACTAACAGTTCCGCTGGAACAGGTGTTAGCCGCAATACATGACACAAATCTGTTGAAGCGCCCAGAAAGG
ATGAGGTCGGATCCAGACAGGAGAAATAGAAGTAAATATTGCATGTTCCATGGGGACCACGGCCACACGACCCGAGAATGCATACAGTTGAGGGATGAAATAGAAACCCT
AATTCGTGAAGGCTATCTCAAGGAATTCGTACAAGACAGAGGGAAAAGGCCAACACCAACAGATCAAGGTAAAGGAGGCAACAACCCCCCACTGGAGATAAGGACCATCC
TTGGAGGACCCTCGGGGGGGGAGTCAAATAGAAAGCGGAAAGCGGCAGTTCGAGAGGCTTGGATAGAGCCAACAGAGCAAGTGACCTTCGGGGAAGGGTCACAGACGGTA
ACTCGAATGATCAACTTCCTGGTGGTGGACTGCATTCCAGCATATAATGCAATTCTGGGACGACCAACTTTGCATGGACTTAAGGCCGTAGCCTCCACATATCATCAAGT
CCTGAAATTTCCGACTGAAGATGGTGTCGGGGAAGTGCACGGGGAATAG
Protein sequenceShow/hide protein sequence
MEHQDQPMADEPNPQVQLQAQEIEIAAIKGRMNEMGQNLTEILSLLKKPESVRPEEEHLRRDPKKGKEIADEGVGDSESVTSRMYHPGNDRVRREAGPSRQKLRGNLPSK
SMSGLYAENDGRKRARAVSKAEQGPKQREQKLSQWLKEEDSQRDGQRRVEDEDIEELIGQMEPPFTDEIMEGEVPHKFKVEGYDDGVALAAVISGLQDERLLNSIGESQP
RTYVEFMTRAQRFISAEELLKSKQEERESRGVSVYDRHKEDRGKRRRVEDSGRERHGHPSANGRGRAEIKDLPGRAEPKAKFNRYTPLTVPLEQVLAAIHDTNLLKRPER
MRSDPDRRNRSKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVQDRGKRPTPTDQGKGGNNPPLEIRTILGGPSGGESNRKRKAAVREAWIEPTEQVTFGEGSQTV
TRMINFLVVDCIPAYNAILGRPTLHGLKAVASTYHQVLKFPTEDGVGEVHGE