; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020643 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020643
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold9:15803793..15808165
RNA-Seq ExpressionSpg020643
SyntenySpg020643
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]2.2e-3336.17Show/hide
Query:  GPSCKRIRRSSPQKLGSGKHVEYNDRRKSEARTGPRAEQDQRGRERELSRWLKEEDSYWDSQRRTENEDIE---------GNEVLKVEGYDDGVALTVVI
        G S    R S P  L  GK     DR +S  +       +Q+ +  +L   L + DS +  +   E    E           E L+VEG  D V+L   I
Subjt:  GPSCKRIRRSSPQKLGSGKHVEYNDRRKSEARTGPRAEQDQRGRERELSRWLKEEDSYWDSQRRTENEDIE---------GNEVLKVEGYDDGVALTVVI

Query:  SGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRGRAEAKDLRGRAEPKAKFD
         G++DE L    GK  P  + E +++AQRY+SA E   SK+E         SN  R D+ + R  D     R E+     R R+  KD      P  KF+
Subjt:  SGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRGRAEAKDLRGRAEPKAKFD

Query:  RYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKPAQTESGKGGNNPPLEIRT
        +YT  T  LEQVL  I++  LLK PER+ +   +R++ +YC+FH DHGH T++C  L++E+E LI  GYLKE+V     ++P  T++G+   +P  EIRT
Subjt:  RYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKPAQTESGKGGNNPPLEIRT

Query:  ILGGPTGGESSRKRKAAIREAHMEPGEQE
        I+GGP   ES RKRK  +REA     + E
Subjt:  ILGGPTGGESSRKRKAAIREAHMEPGEQE

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]5.7e-2935.97Show/hide
Query:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSK--QEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNG
        E LKV    D  A+   ++ L DE L   +G+  P  +VE + +A++ I  +ELL++K  + E++     LS        + R+AD +SR   ++ S++ 
Subjt:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSK--QEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNG

Query:  RGRAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD
          R E + L         ++RYT  T  + ++L  I+++    LLKRPE+LR D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FVG  
Subjt:  RGRAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD

Query:  MGKKPAQTESGKGGNNPPLE------IRTILGGPTGGESSRKRKAAIREAHME
              + E  K    PP        I TI GGP GG+S  KRK   REA  E
Subjt:  MGKKPAQTESGKGGNNPPLE------IRTILGGPTGGESSRKRKAAIREAHME

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]3.9e-3840.32Show/hide
Query:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRG
        E L+VEG  D V+L   +SG++DE L    GK  P  + E +++AQRY+SA E   SK+E                   G++ D     + E+S    +G
Subjt:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRG

Query:  -RAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKK
         R E +D   + +P  KF++YTP T  +EQVL  I+D  LLK PER+++   +R++ +YC+FH DHGH T++C  L++E+E LIR GYLKE+V     ++
Subjt:  -RAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKK

Query:  PAQTESGKGGNNPPLEIRTILGGPTGGESSRKRKAAIREAHMEPGEQE
        P  T++G+   +P  EIRTI+GGP   ES RKRKA +REA     + E
Subjt:  PAQTESGKGGNNPPLEIRTILGGPTGGESSRKRKAAIREAHMEPGEQE

XP_022159192.1 uncharacterized protein LOC111025612 [Momordica charantia]8.0e-3138.94Show/hide
Query:  ISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRGRAEAKDLRGRAEPKAKF
        +SG++DE L    GK  P  + E +++AQ+Y+SA+E    K+E                  +G++AD + R R  +   + +   + + + G+ +   KF
Subjt:  ISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRGRAEAKDLRGRAEPKAKF

Query:  DRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKPAQTESGKGGNNPPLEIR
        ++YTP T  LEQVL  I+D  LLK PE +++ P++R++ +YC+FH DHGH T +C  L++E+E LIR+GYLKE+V  D+   P +    K   +P  EIR
Subjt:  DRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKPAQTESGKGGNNPPLEIR

Query:  TILGGPTGGESSRKRKAAIREAHMEP
        TI+GG T  ES RKRKA++REA   P
Subjt:  TILGGPTGGESSRKRKAAIREAHMEP

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]1.0e-3339.68Show/hide
Query:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRG
        E L++E   D V+L   +SG++DE L    GK  P  + E +++AQRY+SA E   SK+E          +  R D+ + R  D     R E+     R 
Subjt:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRG

Query:  RAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKP
        R+  KD      P  KF++YTP T  LEQVL  I+D  LLK PER++    +R++ +YC+FH DH H T++   L++E+E LIR GYL+E+V     ++P
Subjt:  RAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKP

Query:  AQTESGKGGNNPPLEIRTILGGPTGGESSRKRKAAIREAHMEPGEQE
          T++G+   +P  EIRTI+GGP   ES+RKRKA +REA     + E
Subjt:  AQTESGKGGNNPPLEIRTILGGPTGGESSRKRKAAIREAHMEPGEQE

TrEMBL top hitse value%identityAlignment
A0A6J1CNT2 uncharacterized protein LOC1110128051.1e-3336.17Show/hide
Query:  GPSCKRIRRSSPQKLGSGKHVEYNDRRKSEARTGPRAEQDQRGRERELSRWLKEEDSYWDSQRRTENEDIE---------GNEVLKVEGYDDGVALTVVI
        G S    R S P  L  GK     DR +S  +       +Q+ +  +L   L + DS +  +   E    E           E L+VEG  D V+L   I
Subjt:  GPSCKRIRRSSPQKLGSGKHVEYNDRRKSEARTGPRAEQDQRGRERELSRWLKEEDSYWDSQRRTENEDIE---------GNEVLKVEGYDDGVALTVVI

Query:  SGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRGRAEAKDLRGRAEPKAKFD
         G++DE L    GK  P  + E +++AQRY+SA E   SK+E         SN  R D+ + R  D     R E+     R R+  KD      P  KF+
Subjt:  SGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRGRAEAKDLRGRAEPKAKFD

Query:  RYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKPAQTESGKGGNNPPLEIRT
        +YT  T  LEQVL  I++  LLK PER+ +   +R++ +YC+FH DHGH T++C  L++E+E LI  GYLKE+V     ++P  T++G+   +P  EIRT
Subjt:  RYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKPAQTESGKGGNNPPLEIRT

Query:  ILGGPTGGESSRKRKAAIREAHMEPGEQE
        I+GGP   ES RKRK  +REA     + E
Subjt:  ILGGPTGGESSRKRKAAIREAHMEPGEQE

A0A6J1DWY0 uncharacterized protein LOC1110252931.9e-3840.32Show/hide
Query:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRG
        E L+VEG  D V+L   +SG++DE L    GK  P  + E +++AQRY+SA E   SK+E                   G++ D     + E+S    +G
Subjt:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRG

Query:  -RAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKK
         R E +D   + +P  KF++YTP T  +EQVL  I+D  LLK PER+++   +R++ +YC+FH DHGH T++C  L++E+E LIR GYLKE+V     ++
Subjt:  -RAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKK

Query:  PAQTESGKGGNNPPLEIRTILGGPTGGESSRKRKAAIREAHMEPGEQE
        P  T++G+   +P  EIRTI+GGP   ES RKRKA +REA     + E
Subjt:  PAQTESGKGGNNPPLEIRTILGGPTGGESSRKRKAAIREAHMEPGEQE

A0A6J1DYL6 uncharacterized protein LOC1110257854.9e-3439.68Show/hide
Query:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRG
        E L++E   D V+L   +SG++DE L    GK  P  + E +++AQRY+SA E   SK+E          +  R D+ + R  D     R E+     R 
Subjt:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRG

Query:  RAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKP
        R+  KD      P  KF++YTP T  LEQVL  I+D  LLK PER++    +R++ +YC+FH DH H T++   L++E+E LIR GYL+E+V     ++P
Subjt:  RAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKP

Query:  AQTESGKGGNNPPLEIRTILGGPTGGESSRKRKAAIREAHMEPGEQE
          T++G+   +P  EIRTI+GGP   ES+RKRKA +REA     + E
Subjt:  AQTESGKGGNNPPLEIRTILGGPTGGESSRKRKAAIREAHMEPGEQE

A0A6J1DZ52 uncharacterized protein LOC1110256123.9e-3138.94Show/hide
Query:  ISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRGRAEAKDLRGRAEPKAKF
        +SG++DE L    GK  P  + E +++AQ+Y+SA+E    K+E                  +G++AD + R R  +   + +   + + + G+ +   KF
Subjt:  ISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRGRAEAKDLRGRAEPKAKF

Query:  DRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKPAQTESGKGGNNPPLEIR
        ++YTP T  LEQVL  I+D  LLK PE +++ P++R++ +YC+FH DHGH T +C  L++E+E LIR+GYLKE+V  D+   P +    K   +P  EIR
Subjt:  DRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKPAQTESGKGGNNPPLEIR

Query:  TILGGPTGGESSRKRKAAIREAHMEP
        TI+GG T  ES RKRKA++REA   P
Subjt:  TILGGPTGGESSRKRKAAIREAHMEP

A0A6J1DZB9 uncharacterized protein LOC1110249042.8e-2935.97Show/hide
Query:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSK--QEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNG
        E LKV    D  A+   ++ L DE L   +G+  P  +VE + +A++ I  +ELL++K  + E++     LS        + R+AD +SR   ++ S++ 
Subjt:  EVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSK--QEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNG

Query:  RGRAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD
          R E + L         ++RYT  T  + ++L  I+++    LLKRPE+LR D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FVG  
Subjt:  RGRAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD

Query:  MGKKPAQTESGKGGNNPPLE------IRTILGGPTGGESSRKRKAAIREAHME
              + E  K    PP        I TI GGP GG+S  KRK   REA  E
Subjt:  MGKKPAQTESGKGGNNPPLE------IRTILGGPTGGESSRKRKAAIREAHME

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCGGCCTCGGCCCATTTCCGAGGCTGACCAGGGCCAAAGGCCCGGAGATCATTTGAAGTCCGGCATCGACGAAAACCCTAAGGGGAAATCTATAAAAAGGGAGAC
CGCACACGCATTCAGGTCACGTTTCTTCCCGCTCATCTACAAATTCACTGTTGACTGTCACGTGAAGCGAGGGCATCAAGACCAGCCAGCGACCGACGAAGTTCCGCTCC
AAGCCCAAGAAACCGAGATTGCAGCGATCAAAGGAAGGATGAACGAGATGGGGCAGAACTTGACGGAGATCCTTAGTCTATTAAGGAGGCCCGAGTCGGTAAGGCGCGAG
GAAGAGCACGTGCGAAGAGACCCCAAGAAGGGTAAAGGGATAGCGGACGAAGAAGTAGGGGACTCAGAAAGTGTAACTAGCCGAGTACACCGTCCAGAGGATGGTGAAAC
TCGAAAAGAGGCTGGACCCAGTTGCAAAAGGATTCGCAGGAGTTCTCCACAGAAACTAGGGTCAGGTAAGCATGTGGAATATAATGACAGGAGAAAGTCGGAGGCTCGGA
CAGGTCCCAGGGCCGAGCAGGACCAGAGGGGGCGAGAGCGGGAGCTGTCCAGGTGGCTGAAAGAGGAGGACAGTTATTGGGACTCCCAAAGAAGAACAGAGAACGAAGAC
ATAGAAGGCAACGAGGTTTTGAAGGTAGAAGGTTATGACGACGGAGTCGCCCTGACTGTAGTGATCTCGGGTTTGCAGGATGAGAGGTTGCTCAACTTGATTGGCAAAAG
CCAACCACGAATGTATGTGGAGTTCATGACCCAAGCACAGAGGTACATCAGTGCCAAGGAGTTGCTCAAGTCCAAGCAGGAAGAAAGAGAGAGTCGAGGAGTTTCTTTAT
CCAACCGGCATCGAGAGGATCGGGCAAAGGGGCGCCAGGCCGATGATAGAAGCCGAGGTAGACATGAGCAATCCTCGACCAATGGTCGAGGCCGAGCAGAAGCCAAGGAT
CTGCGGGGCCGTGCAGAGCCGAAAGCTAAGTTCGACAGGTATACCCCACTAACGGCTTCACTTGAACAGGTTTTGGCCGCGATACAGGATACGAACCTGCTAAAACGTCC
GGAAAGGCTGAGGTCGGACCCAGACAGGAGAAACCGGAACAAGTATTGCATGTTCCATGGAGACCACGGTCACACAACTCGGGAGTGCATACAGCTAAGGGATGAAATAG
AAACCCTAATTCGAGAGGGTTACCTCAAGGAGTTCGTGGGACATGATATGGGGAAGAAGCCAGCGCAGACAGAGTCAGGCAAGGGGGGCAACAACCCACCATTGGAGATA
CGAACTATTCTTGGGGGACCCACCGGGGGAGAATCGAGCAGGAAGCGAAAAGCCGCGATTCGAGAAGCACATATGGAGCCTGGAGAGCAAGAGAGGCTTGGGAATTTCCT
CAGCCTCCCCAAGCTTGCCCCAAAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAG
AGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCCAGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCTTGGGATGACTCC
AAGACGCTTAAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTACCCCAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCCTGGGAA
TTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCC
GAGAAGAGTGTACGCACCTCTGGGATGACTCCAAGACGCTTGAAGAGAGACCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGAT
GACTCCAAGATGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCTCAGGAGAGTGTACACACCCCTGGGATGACTCCAAGATGCTTGAAGAAAGGCC
TTGGAATTTCCTCAGCCTCCCCAAGCGAAATGCTCGACCTCATGCCAAGGCCGAGTGTTGGGCTCCTAGACTTGCGGATGATGGTGTTCTTGGTAATCTCCAATCTCTCT
ATACTACTCGGCTTCCAAGGAAAGTTGGAATGAATTTGACTTGGGCGGCTATCTCCTTTTATAGAGGTGGTAGGAGGCTTGAATTAGGTCAACTTGGACACGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCGGCCTCGGCCCATTTCCGAGGCTGACCAGGGCCAAAGGCCCGGAGATCATTTGAAGTCCGGCATCGACGAAAACCCTAAGGGGAAATCTATAAAAAGGGAGAC
CGCACACGCATTCAGGTCACGTTTCTTCCCGCTCATCTACAAATTCACTGTTGACTGTCACGTGAAGCGAGGGCATCAAGACCAGCCAGCGACCGACGAAGTTCCGCTCC
AAGCCCAAGAAACCGAGATTGCAGCGATCAAAGGAAGGATGAACGAGATGGGGCAGAACTTGACGGAGATCCTTAGTCTATTAAGGAGGCCCGAGTCGGTAAGGCGCGAG
GAAGAGCACGTGCGAAGAGACCCCAAGAAGGGTAAAGGGATAGCGGACGAAGAAGTAGGGGACTCAGAAAGTGTAACTAGCCGAGTACACCGTCCAGAGGATGGTGAAAC
TCGAAAAGAGGCTGGACCCAGTTGCAAAAGGATTCGCAGGAGTTCTCCACAGAAACTAGGGTCAGGTAAGCATGTGGAATATAATGACAGGAGAAAGTCGGAGGCTCGGA
CAGGTCCCAGGGCCGAGCAGGACCAGAGGGGGCGAGAGCGGGAGCTGTCCAGGTGGCTGAAAGAGGAGGACAGTTATTGGGACTCCCAAAGAAGAACAGAGAACGAAGAC
ATAGAAGGCAACGAGGTTTTGAAGGTAGAAGGTTATGACGACGGAGTCGCCCTGACTGTAGTGATCTCGGGTTTGCAGGATGAGAGGTTGCTCAACTTGATTGGCAAAAG
CCAACCACGAATGTATGTGGAGTTCATGACCCAAGCACAGAGGTACATCAGTGCCAAGGAGTTGCTCAAGTCCAAGCAGGAAGAAAGAGAGAGTCGAGGAGTTTCTTTAT
CCAACCGGCATCGAGAGGATCGGGCAAAGGGGCGCCAGGCCGATGATAGAAGCCGAGGTAGACATGAGCAATCCTCGACCAATGGTCGAGGCCGAGCAGAAGCCAAGGAT
CTGCGGGGCCGTGCAGAGCCGAAAGCTAAGTTCGACAGGTATACCCCACTAACGGCTTCACTTGAACAGGTTTTGGCCGCGATACAGGATACGAACCTGCTAAAACGTCC
GGAAAGGCTGAGGTCGGACCCAGACAGGAGAAACCGGAACAAGTATTGCATGTTCCATGGAGACCACGGTCACACAACTCGGGAGTGCATACAGCTAAGGGATGAAATAG
AAACCCTAATTCGAGAGGGTTACCTCAAGGAGTTCGTGGGACATGATATGGGGAAGAAGCCAGCGCAGACAGAGTCAGGCAAGGGGGGCAACAACCCACCATTGGAGATA
CGAACTATTCTTGGGGGACCCACCGGGGGAGAATCGAGCAGGAAGCGAAAAGCCGCGATTCGAGAAGCACATATGGAGCCTGGAGAGCAAGAGAGGCTTGGGAATTTCCT
CAGCCTCCCCAAGCTTGCCCCAAAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAG
AGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCCAGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCTTGGGATGACTCC
AAGACGCTTAAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTACCCCAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCCTGGGAA
TTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCC
GAGAAGAGTGTACGCACCTCTGGGATGACTCCAAGACGCTTGAAGAGAGACCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGAT
GACTCCAAGATGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCTCAGGAGAGTGTACACACCCCTGGGATGACTCCAAGATGCTTGAAGAAAGGCC
TTGGAATTTCCTCAGCCTCCCCAAGCGAAATGCTCGACCTCATGCCAAGGCCGAGTGTTGGGCTCCTAGACTTGCGGATGATGGTGTTCTTGGTAATCTCCAATCTCTCT
ATACTACTCGGCTTCCAAGGAAAGTTGGAATGAATTTGACTTGGGCGGCTATCTCCTTTTATAGAGGTGGTAGGAGGCTTGAATTAGGTCAACTTGGACACGTGTAG
Protein sequenceShow/hide protein sequence
MSRPRPISEADQGQRPGDHLKSGIDENPKGKSIKRETAHAFRSRFFPLIYKFTVDCHVKRGHQDQPATDEVPLQAQETEIAAIKGRMNEMGQNLTEILSLLRRPESVRRE
EEHVRRDPKKGKGIADEEVGDSESVTSRVHRPEDGETRKEAGPSCKRIRRSSPQKLGSGKHVEYNDRRKSEARTGPRAEQDQRGRERELSRWLKEEDSYWDSQRRTENED
IEGNEVLKVEGYDDGVALTVVISGLQDERLLNLIGKSQPRMYVEFMTQAQRYISAKELLKSKQEERESRGVSLSNRHREDRAKGRQADDRSRGRHEQSSTNGRGRAEAKD
LRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDMGKKPAQTESGKGGNNPPLEI
RTILGGPTGGESSRKRKAAIREAHMEPGEQERLGNFLSLPKLAPKECTHPWDDSKTLEERPGNFLSLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLAPEECTHPWDDS
KTLKERPGNFLSLPKLTPEECTHPWDDSKTLEERPGNFLSLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLAREECTHLWDDSKTLEERPGNFLSLPKLAPEECTHPWD
DSKMLEERPGNFLSLPKLASGECTHPWDDSKMLEERPWNFLSLPKRNARPHAKAECWAPRLADDGVLGNLQSLYTTRLPRKVGMNLTWAAISFYRGGRRLELGQLGHV