; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036209 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036209
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold5:41967209..41969434
RNA-Seq ExpressionSpg036209
SyntenySpg036209
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFZ03258.1 hypothetical protein Acr_15g0018660 [Actinidia rufa]8.1e-4036.82Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR
        GL+   L +S+ ++ P+T     ++A +YI+AEEL ++K      RG    + H+      RRA+ R   R+++     R    +   R R  P+     
Subjt:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR

Query:  YTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPL-EIRT
           L   + QVL+ I+    +K P ++++DP +RNRNKYC F+ DHGH T +  QL+++I  LI+ GYL++++    R  P   E   G N P    I+T
Subjt:  YTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPL-EIRT

Query:  ILGG-PTGGESSRKRKAAIREAHMEPGEQAVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALK
        I GG  +GG S+  RK   R A     E+  T    F+VVDC   YN+ILG+PTL  +KA+ STYH  +KFPT  G+GEV+G+Q+ +R+C+  A+K
Subjt:  ILGG-PTGGESSRKRKAAIREAHMEPGEQAVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALK

XP_022144467.1 uncharacterized protein LOC111014147 [Momordica charantia]1.5e-3838.1Show/hide
Query:  IQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLEIRTILGGPTGGESSRKRK
        I+D  LLK PER+++   +R++++YC+FH  HGH T++C  L++E+E LIR GYLKE+V     + P  T+ G+   +P  EIRTI+GGP   ES RKRK
Subjt:  IQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLEIRTILGGPTGGESSRKRK

Query:  AAIREAH----------------------------------MEPGE----------------------------------QAVTRMISFLVVDCVPAYNA
        A +REA                                   M+ GE                                  ++VT+M+  LVV+   +YNA
Subjt:  AAIREAH----------------------------------MEPGE----------------------------------QAVTRMISFLVVDCVPAYNA

Query:  ILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALKNIDK
        ILGRPT+H L+A+ STYHQ +KFPT  GVGE++GEQR SRECY+ ++++ D+
Subjt:  ILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALKNIDK

XP_022158257.1 uncharacterized protein LOC111024791 [Momordica charantia]3.1e-3938.36Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR
        GL DE L   +GE    T+ E + +A++ I  +ELL+ K    E +     D+ +  + K RRAE +S+   ++ S++  GRAE             ++R
Subjt:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR

Query:  YTSLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLE-
        YT  T ++ ++L  I++T    LLKRP++LR D ++RN++KYC FH DHGH T  C +L+ +IE LI++G  K+FVG  R     + E  K    PP + 
Subjt:  YTSLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLE-

Query:  -----IRTILGGPTGGESSRKRKAAIREAHMEPGEQAVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQR
             I TI GG +GG+S  KRK   RE+  +        M  F+V+D   AYNAI GRP +H  +AV ST HQVLK+ T  GVG VR  ++
Subjt:  -----IRTILGGPTGGESSRKRKAAIREAHMEPGEQAVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQR

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.4e-4436.15Show/hide
Query:  LQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDRY
        L DE L   +GE  P T+VE + +A++ I  +ELL++K    E +     D+ +  + K R+A+ +SR +   SSA+   R E + L         ++RY
Subjt:  LQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDRY

Query:  TSLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLE--
        TS T  + ++L  I+++    LLKRPE+LR D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FVG  R     + E  K    PP    
Subjt:  TSLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLE--

Query:  ----IRTILGGPTGGESSRKRKAAIREAHMEP---------------------------------------------------------GEQA--VTRMI
            I TI GGP GG+S  KRK   REA  E                                                          G+ A  VT+M 
Subjt:  ----IRTILGGPTGGESSRKRKAAIREAHMEP---------------------------------------------------------GEQA--VTRMI

Query:  SFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALKNIDKKAKAAPTSGNGRGRTSEG-LDHPME
         F+V+D   AYNAI GRP +H  +AV ST HQVLK+ T   VG VRGEQ+TSRECY  ALK     A  A      RG+  E   D P E
Subjt:  SFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALKNIDKKAKAAPTSGNGRGRTSEG-LDHPME

XP_030963320.1 uncharacterized protein LOC115984434 [Quercus lobata]2.8e-4034.32Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGR-AEPKAKFD
        G+  +  ++ + E +P+T  E +  AQ +++AE+ + +K+ +R  +  ++  RH E   +G R +              +GR E K+ R R A P A+  
Subjt:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGR-AEPKAKFD

Query:  RYTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLEIRT
        +YT L    +QVL  I+D   LK  E+++ DP++ NRNKYC FH DHGH T EC  L+ +IE LIR+G L+ F+G D +    + +  +    P  EIR 
Subjt:  RYTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLEIRT

Query:  ILGGPTGGESSRK-----------------RKAAIREAHMEPGE---------------------------QAVTRMISFLVVDCVPAYNAILGRPTLHG
        I+GG +  +SSR+                 ++  +    + P                             Q VT+ +SFLVVDC  +YNAI+GRPTL+ 
Subjt:  ILGGPTGGESSRK-----------------RKAAIREAHMEPGE---------------------------QAVTRMISFLVVDCVPAYNAILGRPTLHG

Query:  LKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAL
         K V STYH  +KFPTE GVG+V+G+Q  +RECY   L
Subjt:  LKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAL

TrEMBL top hitse value%identityAlignment
A0A2N9F5L1 RNase H domain-containing protein3.9e-4033.06Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR
        GL+    L  + +  P T  E M +A ++++AE+ L++  +    R   + DR                 + E +        E  E +    P  KF  
Subjt:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR

Query:  YTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPA-------QTEPGKGGNNP
        +T L   ++++L  IQD   L+ P ++RSDP+ R +N YC FH DHGH T++C+ L +++ETLIR+G L+++V      RPA       Q EP +    P
Subjt:  YTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPA-------QTEPGKGGNNP

Query:  PLEIRTILGGP-TGGESSRKRKAAIREAH-----------------------------MEPGEQA------------------------------VTRMI
          EIRTI+GGP +GG S   R+A  R+AH                              +P + A                              V++ +
Subjt:  PLEIRTILGGP-TGGESSRKRKAAIREAH-----------------------------MEPGEQA------------------------------VTRMI

Query:  SFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAL
         FLVV+C  AYNAI+GRPTL+ L+AV STYH +LKFPTE G+GEVRG+Q  +RECY ++L
Subjt:  SFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAL

A0A2N9GRM8 Uncharacterized protein7.9e-4137.65Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR
        GL+    L  + +  P T  E M +A ++++AE+ L++  +    R               + AEDR   + E S        E  E +    P  KF  
Subjt:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR

Query:  YTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPA-------QTEPGKGGNNP
        +T L   ++++L  IQD   L+ P ++RSDP+ R +N YC FH DHGH T EC+ L+++IETLIR+G L+++V      RPA       Q EP + G  P
Subjt:  YTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPA-------QTEPGKGGNNP

Query:  PLEIRTILGGP-TGGESSRKRKAAIREAH-----MEPGE--QAVTRMISFLVVDC----------------VPAYNAILGRPTLHGLKAVASTYHQVLKF
          EIRTI+GGP +GG S   RKA  R+ H       P +  +   ++ISF   D                 + AYNAI+GRPTL+ L+AV STYH +LKF
Subjt:  PLEIRTILGGP-TGGESSRKRKAAIREAH-----MEPGE--QAVTRMISFLVVDC----------------VPAYNAILGRPTLHGLKAVASTYHQVLKF

Query:  PTEEGVGEVRGEQRTSRECYFMAL
        PTE G+GEVRG+Q  +RECY  +L
Subjt:  PTEEGVGEVRGEQRTSRECYFMAL

A0A6J1DWS1 uncharacterized protein LOC1110247911.5e-3938.36Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR
        GL DE L   +GE    T+ E + +A++ I  +ELL+ K    E +     D+ +  + K RRAE +S+   ++ S++  GRAE             ++R
Subjt:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR

Query:  YTSLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLE-
        YT  T ++ ++L  I++T    LLKRP++LR D ++RN++KYC FH DHGH T  C +L+ +IE LI++G  K+FVG  R     + E  K    PP + 
Subjt:  YTSLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLE-

Query:  -----IRTILGGPTGGESSRKRKAAIREAHMEPGEQAVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQR
             I TI GG +GG+S  KRK   RE+  +        M  F+V+D   AYNAI GRP +H  +AV ST HQVLK+ T  GVG VR  ++
Subjt:  -----IRTILGGPTGGESSRKRKAAIREAHMEPGEQAVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQR

A0A6J1DZB9 uncharacterized protein LOC1110249046.9e-4536.15Show/hide
Query:  LQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDRY
        L DE L   +GE  P T+VE + +A++ I  +ELL++K    E +     D+ +  + K R+A+ +SR +   SSA+   R E + L         ++RY
Subjt:  LQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDRY

Query:  TSLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLE--
        TS T  + ++L  I+++    LLKRPE+LR D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FVG  R     + E  K    PP    
Subjt:  TSLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLE--

Query:  ----IRTILGGPTGGESSRKRKAAIREAHMEP---------------------------------------------------------GEQA--VTRMI
            I TI GGP GG+S  KRK   REA  E                                                          G+ A  VT+M 
Subjt:  ----IRTILGGPTGGESSRKRKAAIREAHMEP---------------------------------------------------------GEQA--VTRMI

Query:  SFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALKNIDKKAKAAPTSGNGRGRTSEG-LDHPME
         F+V+D   AYNAI GRP +H  +AV ST HQVLK+ T   VG VRGEQ+TSRECY  ALK     A  A      RG+  E   D P E
Subjt:  SFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALKNIDKKAKAAPTSGNGRGRTSEG-LDHPME

A0A7J0FZ98 Retrotrans_gag domain-containing protein3.9e-4036.82Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR
        GL+   L +S+ ++ P+T     ++A +YI+AEEL ++K      RG    + H+      RRA+ R   R+++     R    +   R R  P+     
Subjt:  GLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELLKSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDR

Query:  YTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPL-EIRT
           L   + QVL+ I+    +K P ++++DP +RNRNKYC F+ DHGH T +  QL+++I  LI+ GYL++++    R  P   E   G N P    I+T
Subjt:  YTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPL-EIRT

Query:  ILGG-PTGGESSRKRKAAIREAHMEPGEQAVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALK
        I GG  +GG S+  RK   R A     E+  T    F+VVDC   YN+ILG+PTL  +KA+ STYH  +KFPT  G+GEV+G+Q+ +R+C+  A+K
Subjt:  ILGG-PTGGESSRKRKAAIREAHMEPGEQAVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCACCAAGATCAACCGGTAACTGACGAACTGAATCCCCAAGTTCGACTCCAGGCCCAGGAAGCCGAGATCGCAACAATTAAGGGGAGGATGAAAGAGATGGGGCA
GAATTTGACTGAAATCCTCAGTCTGTTGAAGAAACCCGAGTCTATAAGGCCTGCGGAAGAGCATGTACGCAGAGACCCCAAGAAGGGTAAAGGAATAGCAGATGAGGATG
GAGATTCGGAAAGTGTGACTAGTCGAATGCACTGTCCAGGGGATGACCAGACCCGGAGGGAAGCCGGACCCAGTCACAAGAGGGTTCGCAGGAATTCACCACTGAAATCA
GTGCCAGATATGCGTACAGAGGACAATGACAGGAAAAAGTTGGAGGCTCGGGCAAGATCCAGGGCCGAGCAAGACCAGAGGGGGCGAGAGCGGGAGCTGTCCAAGTGGCT
GAGGGAGGAGGACAGCCATCGAGGCTACCAAAAAAGAACGGAGAACGAAGACATAGAAGGGTTGATCGGGCAAATGGAGCCGCCCTTCACTGATGAGATAATGGGGGGGG
GAGGTTTGCAGGATGAAAGGTTGCTCAACTCGATCGGCGAAAGCCAGCCGCGAACATATGTGGAGTTCATGACTCAAGCGCAGAGGTACATCAGCGCCGAGGAGCTGCTT
AAGTCCAAGCAGGAAGAGAGAGAGAGCCGAGGGGTTTCTTCATCCGACCGACATCGAGAGGATCGGGCAAAGGGGCGTCGAGCCGAGGATAGAAGCCGAGGCCGACATGA
ACAGTCCTCGGCCAATGGCCGAGGCCGAGCAGAAGCCAAGGAGCTGCGAGGCCGTGCGGAGCCAAAAGCCAAATTTGACAGGTATACCTCGCTAACAGCTTCGCTTGAGC
AGGTTTTGGCCGCGATACAGGATACGAACCTGCTAAAACGTCCAGAAAGGCTGAGGTCGGACCCAGACAGGAGAAACCGGAACAAGTATTGCATGTTCCATGGAGACCAC
GGTCACACAACTCGGGAGTGCATACAGCTAAGGGATGAAATAGAAACCCTAATTCGAGAGGGTTACCTCAAGGAGTTCGTGGGACATGATAGGAGGAAGAGGCCAGCGCA
GACAGAGCCAGGCAAGGGGGGCAACAACCCACCGTTGGAGATACGAACTATTCTTGGGGGACCTACCGGGGGAGAATCGAGCAGGAAGCGAAAAGCTGCGATTCGAGAAG
CACATATGGAGCCCGGAGAGCAAGCAGTTACTAGAATGATAAGCTTCCTAGTAGTGGACTGTGTCCCAGCATACAACGCAATCTTGGGACGACCAACCCTACATGGGCTC
AAAGCAGTAGCCTCAACCTACCATCAAGTCCTGAAGTTCCCAACCGAAGAAGGCGTAGGAGAGGTACGAGGGGAGCAGAGGACATCGAGAGAGTGCTATTTCATGGCGCT
CAAGAATATAGACAAAAAGGCCAAGGCAGCGCCCACCTCGGGAAATGGCCGAGGCCGAACGTCCGAGGGGTTGGATCACCCAATGGAGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCACCAAGATCAACCGGTAACTGACGAACTGAATCCCCAAGTTCGACTCCAGGCCCAGGAAGCCGAGATCGCAACAATTAAGGGGAGGATGAAAGAGATGGGGCA
GAATTTGACTGAAATCCTCAGTCTGTTGAAGAAACCCGAGTCTATAAGGCCTGCGGAAGAGCATGTACGCAGAGACCCCAAGAAGGGTAAAGGAATAGCAGATGAGGATG
GAGATTCGGAAAGTGTGACTAGTCGAATGCACTGTCCAGGGGATGACCAGACCCGGAGGGAAGCCGGACCCAGTCACAAGAGGGTTCGCAGGAATTCACCACTGAAATCA
GTGCCAGATATGCGTACAGAGGACAATGACAGGAAAAAGTTGGAGGCTCGGGCAAGATCCAGGGCCGAGCAAGACCAGAGGGGGCGAGAGCGGGAGCTGTCCAAGTGGCT
GAGGGAGGAGGACAGCCATCGAGGCTACCAAAAAAGAACGGAGAACGAAGACATAGAAGGGTTGATCGGGCAAATGGAGCCGCCCTTCACTGATGAGATAATGGGGGGGG
GAGGTTTGCAGGATGAAAGGTTGCTCAACTCGATCGGCGAAAGCCAGCCGCGAACATATGTGGAGTTCATGACTCAAGCGCAGAGGTACATCAGCGCCGAGGAGCTGCTT
AAGTCCAAGCAGGAAGAGAGAGAGAGCCGAGGGGTTTCTTCATCCGACCGACATCGAGAGGATCGGGCAAAGGGGCGTCGAGCCGAGGATAGAAGCCGAGGCCGACATGA
ACAGTCCTCGGCCAATGGCCGAGGCCGAGCAGAAGCCAAGGAGCTGCGAGGCCGTGCGGAGCCAAAAGCCAAATTTGACAGGTATACCTCGCTAACAGCTTCGCTTGAGC
AGGTTTTGGCCGCGATACAGGATACGAACCTGCTAAAACGTCCAGAAAGGCTGAGGTCGGACCCAGACAGGAGAAACCGGAACAAGTATTGCATGTTCCATGGAGACCAC
GGTCACACAACTCGGGAGTGCATACAGCTAAGGGATGAAATAGAAACCCTAATTCGAGAGGGTTACCTCAAGGAGTTCGTGGGACATGATAGGAGGAAGAGGCCAGCGCA
GACAGAGCCAGGCAAGGGGGGCAACAACCCACCGTTGGAGATACGAACTATTCTTGGGGGACCTACCGGGGGAGAATCGAGCAGGAAGCGAAAAGCTGCGATTCGAGAAG
CACATATGGAGCCCGGAGAGCAAGCAGTTACTAGAATGATAAGCTTCCTAGTAGTGGACTGTGTCCCAGCATACAACGCAATCTTGGGACGACCAACCCTACATGGGCTC
AAAGCAGTAGCCTCAACCTACCATCAAGTCCTGAAGTTCCCAACCGAAGAAGGCGTAGGAGAGGTACGAGGGGAGCAGAGGACATCGAGAGAGTGCTATTTCATGGCGCT
CAAGAATATAGACAAAAAGGCCAAGGCAGCGCCCACCTCGGGAAATGGCCGAGGCCGAACGTCCGAGGGGTTGGATCACCCAATGGAGCACTGA
Protein sequenceShow/hide protein sequence
MEHQDQPVTDELNPQVRLQAQEAEIATIKGRMKEMGQNLTEILSLLKKPESIRPAEEHVRRDPKKGKGIADEDGDSESVTSRMHCPGDDQTRREAGPSHKRVRRNSPLKS
VPDMRTEDNDRKKLEARARSRAEQDQRGRERELSKWLREEDSHRGYQKRTENEDIEGLIGQMEPPFTDEIMGGGGLQDERLLNSIGESQPRTYVEFMTQAQRYISAEELL
KSKQEERESRGVSSSDRHREDRAKGRRAEDRSRGRHEQSSANGRGRAEAKELRGRAEPKAKFDRYTSLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDH
GHTTRECIQLRDEIETLIREGYLKEFVGHDRRKRPAQTEPGKGGNNPPLEIRTILGGPTGGESSRKRKAAIREAHMEPGEQAVTRMISFLVVDCVPAYNAILGRPTLHGL
KAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMALKNIDKKAKAAPTSGNGRGRTSEGLDHPMEH