; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg017540 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg017540
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold4:42782010..42784808
RNA-Seq ExpressionSpg017540
SyntenySpg017540
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]6.4e-4134.81Show/hide
Query:  KEEDGHRDSQRRTENEDIEGLIGDMEPPFTDEIM----GGEVPHKF---------------------------KDERLLNSIGESQPRTYVEFMTRAQR-
        + E   +   ++ +  D+E L+   + PFT+EIM      E  H +                           +DE L  S G+  P T+ E ++RAQR 
Subjt:  KEEDGHRDSQRRTENEDIEGLIGDMEPPFTDEIM----GGEVPHKF---------------------------KDERLLNSIGESQPRTYVEFMTRAQR-

Query:  ------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTNLLRRPEKLRSDPDRRNRNKYCMF---H
                      G R ++K   R+         R E  +   + +P  +F +YT  T PLEQVL+ I +  LL+ PE++ +   +R++ +YC+F   H
Subjt:  ------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTNLLRRPEKLRSDPDRRNRNKYCMF---H

Query:  GHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQQESGEQGMYSLLLDENSPKLEFTEK
        GHAT++C  L++E+E LI  GYLKE+V     + P     G+   +P  EIRT++GGP   ESGRKRK  +REA+    +  +Y +        +EF+E 
Subjt:  GHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQQESGEQGMYSLLLDENSPKLEFTEK

Query:  VAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLS
         A  + HPHNDALV+TL IAN +VH ILVDGGSSAD++S
Subjt:  VAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLS

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.5e-5038.1Show/hide
Query:  KDERLLNSIGESQPRTYVEFMTRAQR-------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTN
        +DE L  S G+  P T+ E ++RAQR               G R + K   R+         R E  +   + +P  +F +YTP T P+EQVL+ I D  
Subjt:  KDERLLNSIGESQPRTYVEFMTRAQR-------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTN

Query:  LLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIRE
        LL+ PE++++   +R++ +YC+F   HGHAT++C  L++E+E LIR GYLKE+V     + P     G+   +P  EIRT++GGP   ESGRKRK+ +RE
Subjt:  LLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIRE

Query:  AQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKLEERPR
        A+    +  +Y          +EF+E  A  + HPHNDALV+ L IAN KVH +LVDGGSSAD++S TA+ AM L    L+ S  PLVGFG E++     
Subjt:  AQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKLEERPR

Query:  DSLGLPKLPQKSVRTPGMDTTLEERLTHALNLPKSP
               +P+  +  P    TLE R +   ++   P
Subjt:  DSLGLPKLPQKSVRTPGMDTTLEERLTHALNLPKSP

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]6.4e-4140.23Show/hide
Query:  KDERLLNSIGESQPRTYVEFMTRAQR-------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTN
        +DE L  S G+  P T+ E ++RAQR               G R ++K   R+         R E  +   + +P  +F +YTP T PLEQVL+ I D  
Subjt:  KDERLLNSIGESQPRTYVEFMTRAQR-------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTN

Query:  LLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIRE
        LL+ PE+++    +R++ +YC+F   H HAT++   L++E+E LIR GYL+E+V     + P     G+   +P  EIRT++GGP   ES RKRK+ +RE
Subjt:  LLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIRE

Query:  AQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLS
        A+    +  +Y       S  +EF+E  A  + HPHNDALV+TL IAN KVH ILVDGGSSAD++S
Subjt:  AQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLS

XP_030932439.1 uncharacterized protein LOC115958186 [Quercus lobata]1.9e-4039.8Show/hide
Query:  LNSIGESQPRTYVEFMTRAQR--GKGHRVEEKGRSRQEHFSAN-----------GRGRPENNEPRGR-AEPKARFYRYTPLTAPLEQVLVAIHDTNLLRR
        ++ + E +P+T  E +  AQ        +  K R R E   AN            +GR E+ + R R   P AR  +YTPL  PL+QVL+ + D   L+ 
Subjt:  LNSIGESQPRTYVEFMTRAQR--GKGHRVEEKGRSRQEHFSAN-----------GRGRPENNEPRGR-AEPKARFYRYTPLTAPLEQVLVAIHDTNLLRR

Query:  PEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQ--
        PEK++ DP++RNRNKYC F   HGH T EC  L+ +IE LIR+G L+ F+G +     L     +    P  EIR ++GG S  +S R RK+ ++  Q  
Subjt:  PEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQ--

Query:  QESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKLEERPRDS
        Q SG         D +   + FTE+ A  I HPH+DA+V+TL IA+     +LVD GSSAD+L   AF  MKLG DRL    +PLVGFGG K++  P  +
Subjt:  QESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKLEERPRDS

Query:  LGLP
        + LP
Subjt:  LGLP

XP_030936700.1 uncharacterized protein LOC115961955 [Quercus lobata]7.6e-4240.82Show/hide
Query:  LNSIGESQPRTYVEFMTRAQR--GKGHRVEEKGRSRQEHFSAN-----------GRGRPENNEPRGR-AEPKARFYRYTPLTAPLEQVLVAIHDTNLLRR
        ++ + E +P+T  E +  AQ        +  K R + E   AN            +GR E+ + R R A P  R  +YTPL  PL+QVL+ I D   L+ 
Subjt:  LNSIGESQPRTYVEFMTRAQR--GKGHRVEEKGRSRQEHFSAN-----------GRGRPENNEPRGR-AEPKARFYRYTPLTAPLEQVLVAIHDTNLLRR

Query:  PEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQ--
        PEK++ DP++RNRNKYC F   HGH T EC  L+ +IE LIR+G L+ F+G +     L     +    P  EIR ++GG S  +S R RK+ ++  Q  
Subjt:  PEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQ--

Query:  QESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKLE
        Q SG         D +   + FTE+ A  I HPH+DA+V+TL IA+     +LVD GSSAD+L   AF  MKLG DRLRP  +PLVGFGG K++
Subjt:  QESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKLE

TrEMBL top hitse value%identityAlignment
A0A6J1CNT2 uncharacterized protein LOC1110128053.1e-4134.81Show/hide
Query:  KEEDGHRDSQRRTENEDIEGLIGDMEPPFTDEIM----GGEVPHKF---------------------------KDERLLNSIGESQPRTYVEFMTRAQR-
        + E   +   ++ +  D+E L+   + PFT+EIM      E  H +                           +DE L  S G+  P T+ E ++RAQR 
Subjt:  KEEDGHRDSQRRTENEDIEGLIGDMEPPFTDEIM----GGEVPHKF---------------------------KDERLLNSIGESQPRTYVEFMTRAQR-

Query:  ------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTNLLRRPEKLRSDPDRRNRNKYCMF---H
                      G R ++K   R+         R E  +   + +P  +F +YT  T PLEQVL+ I +  LL+ PE++ +   +R++ +YC+F   H
Subjt:  ------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTNLLRRPEKLRSDPDRRNRNKYCMF---H

Query:  GHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQQESGEQGMYSLLLDENSPKLEFTEK
        GHAT++C  L++E+E LI  GYLKE+V     + P     G+   +P  EIRT++GGP   ESGRKRK  +REA+    +  +Y +        +EF+E 
Subjt:  GHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQQESGEQGMYSLLLDENSPKLEFTEK

Query:  VAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLS
         A  + HPHNDALV+TL IAN +VH ILVDGGSSAD++S
Subjt:  VAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLS

A0A6J1D8C9 uncharacterized protein LOC1110183001.1e-3844.24Show/hide
Query:  PLEQVLVAIHDTNLLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGA-NPPLEIRTLLGGPS
        PLEQVL+ I    LL+ PE++ +   +R++ +YC+F   H HAT++C  L+ E++ LI+ GYLKE+V     + P     G+  + +P  EIRT++GGP 
Subjt:  PLEQVLVAIHDTNLLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGA-NPPLEIRTLLGGPS

Query:  GGESGRKRKSAIREAQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTP
          E GRKRK++IRE +    +  +Y   + +   K+EF+E  A  + HPHND LV+TL IANAKVH ILVDGGSSAD++S TA+ AM LG    + S   
Subjt:  GGESGRKRKSAIREAQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTP

Query:  LVGFGGEKLEERPRDSL
        LV F GE++    R  L
Subjt:  LVGFGGEKLEERPRDSL

A0A6J1DWY0 uncharacterized protein LOC1110252935.7e-5138.39Show/hide
Query:  KDERLLNSIGESQPRTYVEFMTRAQR-------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTN
        +DE L  S G+  P T+ E ++RAQR               G R + K   R+         R E  +   + +P  +F +YTP T P+EQVL+ I D  
Subjt:  KDERLLNSIGESQPRTYVEFMTRAQR-------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTN

Query:  LLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIRE
        LL+ PE++++   +R++ +YC+F   HGHAT++C  L++E+E LIR GYLKE+V     + P     G+   +P  EIRT++GGP   ESGRKRK+ +RE
Subjt:  LLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIRE

Query:  AQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKLEERPR
        A+    +  +Y          +EF+E  A  + HPHNDALV+ L IAN KVH +LVDGGSSAD+LS TA+ AM L    L+ S  PLVGFG E++     
Subjt:  AQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKLEERPR

Query:  DSLGLPKLPQKSVRTPGMDTTLEERLTHALNLPKSP
               +P+  +  P    TLE R +   ++   P
Subjt:  DSLGLPKLPQKSVRTPGMDTTLEERLTHALNLPKSP

A0A6J1DY78 uncharacterized protein LOC1110252912.9e-3949.46Show/hide
Query:  DPDRR-NRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQQESGEQG
        DP  + N+ +YC+F   HGHAT++C  L++E+E L R GYLKE+V ++           +   +P  EIRT++GGP   ESGRKRK+ +REA+   G+  
Subjt:  DPDRR-NRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQQESGEQG

Query:  MYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKL
        +Y   +   S  +EF+E  A  + HPHNDALV+TL IAN KVH ILVDGGSSAD++S TA+ AM LG   L+ SL PLVGFGGE++
Subjt:  MYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLSTTAFDAMKLGSDRLRPSLTPLVGFGGEKL

A0A6J1DYL6 uncharacterized protein LOC1110257853.1e-4140.23Show/hide
Query:  KDERLLNSIGESQPRTYVEFMTRAQR-------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTN
        +DE L  S G+  P T+ E ++RAQR               G R ++K   R+         R E  +   + +P  +F +YTP T PLEQVL+ I D  
Subjt:  KDERLLNSIGESQPRTYVEFMTRAQR-------------GKGHRVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTN

Query:  LLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIRE
        LL+ PE+++    +R++ +YC+F   H HAT++   L++E+E LIR GYL+E+V     + P     G+   +P  EIRT++GGP   ES RKRK+ +RE
Subjt:  LLRRPEKLRSDPDRRNRNKYCMF---HGHATRECRQLRDEIEALIREGYLKEFVGNNIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIRE

Query:  AQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLS
        A+    +  +Y       S  +EF+E  A  + HPHNDALV+TL IAN KVH ILVDGGSSAD++S
Subjt:  AQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCACGAAAATCAGCCAGTAATAGACGAGGCGCATCCCCTGGTCCGGCTCCAAGCCCAAGAGACTGAGATTGCAGCAATTAAGGGGAGGATGAACGAGATGGGGCA
GAACTTGGCCGAAATCCTTAATTTGCTGAAGAAGCCCGAGTCTGTGGAGCACGGGGAAGAGCATCTGCGCAGAGATCCCAAGAAGGGTAAAGGAGTAGCGGATGAGGAGG
TAGGAGATTCGGAGAGTGTAACCAGCCGGATGCACCATCCAGTAGATGGTCGAACCCGGAAAGAGGTTGGACCCAGCCACAAAAGGATCCGCAGAAATTCGCCGCCGGAA
CCGGCGCCAGGTATGTATACAGGGAATAATGACAGGAAAAAGTTGGAGGCTCGGGCAAGGTCCGAGACCGAGCAGGGCCAAAAGGGGCGAGAGCGGGAGCTATCAAAGTG
GCTGAAAGAGGAAGACGGCCATCGCGACTCCCAAAGAAGAACTGAAAATGAAGACATCGAAGGGCTAATTGGAGATATGGAACCACCCTTCACCGACGAAATAATGGGAG
GGGAGGTGCCTCATAAATTTAAGGATGAAAGACTGCTCAACTCGATCGGTGAGAGCCAGCCACGAACATACGTGGAGTTCATGACCCGAGCACAAAGAGGAAAAGGGCAT
CGGGTCGAAGAGAAAGGTCGAAGTCGACAAGAGCACTTCTCGGCCAATGGCCGAGGCCGACCAGAGAACAATGAGCCTCGGGGCCGTGCGGAACCAAAAGCTAGATTTTA
CAGGTATACACCACTAACAGCTCCACTTGAACAGGTCTTGGTCGCAATACATGACACAAACCTGCTAAGACGCCCAGAAAAATTAAGGTCGGACCCAGACAGGAGGAATC
GAAACAAGTACTGCATGTTCCACGGTCACGCAACTCGGGAATGCAGACAGTTGAGGGACGAGATAGAAGCCCTAATCCGAGAAGGTTACCTCAAGGAGTTTGTGGGAAAT
AACATAGGCAAGAGGCCATTGCCAGCAAATCAAGGTAAGGGCGGTGCCAACCCACCGCTCGAGATACGAACACTTTTAGGAGGACCATCTGGAGGAGAGTCAGGCAGAAA
GCGAAAGTCCGCAATTCGAGAGGCACAGCAAGAGTCCGGAGAGCAAGGTATGTACTCACTCCTACTCGATGAAAACTCACCAAAGTTGGAGTTTACAGAGAAAGTGGCCG
CGGGGATACGTCATCCGCACAATGACGCGCTGGTGGTCACCCTAACGATTGCCAACGCAAAAGTTCACTGGATCCTCGTTGATGGGGGGAGTTCCGCTGATGTTCTCTCA
ACCACTGCGTTCGACGCCATGAAGCTGGGAAGCGATCGCCTGAGGCCGAGCCTCACGCCGTTGGTGGGATTTGGAGGAGAAAAACTTGAAGAGAGGCCCAGGGACTCCCT
AGGCCTCCCCAAGCTTCCCCAAAAGAGTGTACGCACCCCTGGGATGGACACAACACTTGAAGAGAGGTTGACGCATGCCCTCAACCTCCCCAAGTCCCCAACTAAGTGCA
GACACCCTTGGGATGGAAAGCAAGCAAGCAAACAGCACAGAGTAGCTAGTCGGCCTCATGCTAAGGCTGAGGCCGACCATCCAAAGGCCAAGGCCAAACTCCACCCTCGA
AGGCCTAGCCCTCAGGCACGCATCCAAAGTAGCATGACTAGTTCGAGTTCTCAGTCGCTTCACTTTTTAGCACAGCCGAAGCGCTCAAAGCCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCACGAAAATCAGCCAGTAATAGACGAGGCGCATCCCCTGGTCCGGCTCCAAGCCCAAGAGACTGAGATTGCAGCAATTAAGGGGAGGATGAACGAGATGGGGCA
GAACTTGGCCGAAATCCTTAATTTGCTGAAGAAGCCCGAGTCTGTGGAGCACGGGGAAGAGCATCTGCGCAGAGATCCCAAGAAGGGTAAAGGAGTAGCGGATGAGGAGG
TAGGAGATTCGGAGAGTGTAACCAGCCGGATGCACCATCCAGTAGATGGTCGAACCCGGAAAGAGGTTGGACCCAGCCACAAAAGGATCCGCAGAAATTCGCCGCCGGAA
CCGGCGCCAGGTATGTATACAGGGAATAATGACAGGAAAAAGTTGGAGGCTCGGGCAAGGTCCGAGACCGAGCAGGGCCAAAAGGGGCGAGAGCGGGAGCTATCAAAGTG
GCTGAAAGAGGAAGACGGCCATCGCGACTCCCAAAGAAGAACTGAAAATGAAGACATCGAAGGGCTAATTGGAGATATGGAACCACCCTTCACCGACGAAATAATGGGAG
GGGAGGTGCCTCATAAATTTAAGGATGAAAGACTGCTCAACTCGATCGGTGAGAGCCAGCCACGAACATACGTGGAGTTCATGACCCGAGCACAAAGAGGAAAAGGGCAT
CGGGTCGAAGAGAAAGGTCGAAGTCGACAAGAGCACTTCTCGGCCAATGGCCGAGGCCGACCAGAGAACAATGAGCCTCGGGGCCGTGCGGAACCAAAAGCTAGATTTTA
CAGGTATACACCACTAACAGCTCCACTTGAACAGGTCTTGGTCGCAATACATGACACAAACCTGCTAAGACGCCCAGAAAAATTAAGGTCGGACCCAGACAGGAGGAATC
GAAACAAGTACTGCATGTTCCACGGTCACGCAACTCGGGAATGCAGACAGTTGAGGGACGAGATAGAAGCCCTAATCCGAGAAGGTTACCTCAAGGAGTTTGTGGGAAAT
AACATAGGCAAGAGGCCATTGCCAGCAAATCAAGGTAAGGGCGGTGCCAACCCACCGCTCGAGATACGAACACTTTTAGGAGGACCATCTGGAGGAGAGTCAGGCAGAAA
GCGAAAGTCCGCAATTCGAGAGGCACAGCAAGAGTCCGGAGAGCAAGGTATGTACTCACTCCTACTCGATGAAAACTCACCAAAGTTGGAGTTTACAGAGAAAGTGGCCG
CGGGGATACGTCATCCGCACAATGACGCGCTGGTGGTCACCCTAACGATTGCCAACGCAAAAGTTCACTGGATCCTCGTTGATGGGGGGAGTTCCGCTGATGTTCTCTCA
ACCACTGCGTTCGACGCCATGAAGCTGGGAAGCGATCGCCTGAGGCCGAGCCTCACGCCGTTGGTGGGATTTGGAGGAGAAAAACTTGAAGAGAGGCCCAGGGACTCCCT
AGGCCTCCCCAAGCTTCCCCAAAAGAGTGTACGCACCCCTGGGATGGACACAACACTTGAAGAGAGGTTGACGCATGCCCTCAACCTCCCCAAGTCCCCAACTAAGTGCA
GACACCCTTGGGATGGAAAGCAAGCAAGCAAACAGCACAGAGTAGCTAGTCGGCCTCATGCTAAGGCTGAGGCCGACCATCCAAAGGCCAAGGCCAAACTCCACCCTCGA
AGGCCTAGCCCTCAGGCACGCATCCAAAGTAGCATGACTAGTTCGAGTTCTCAGTCGCTTCACTTTTTAGCACAGCCGAAGCGCTCAAAGCCGTAG
Protein sequenceShow/hide protein sequence
MEHENQPVIDEAHPLVRLQAQETEIAAIKGRMNEMGQNLAEILNLLKKPESVEHGEEHLRRDPKKGKGVADEEVGDSESVTSRMHHPVDGRTRKEVGPSHKRIRRNSPPE
PAPGMYTGNNDRKKLEARARSETEQGQKGRERELSKWLKEEDGHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFKDERLLNSIGESQPRTYVEFMTRAQRGKGH
RVEEKGRSRQEHFSANGRGRPENNEPRGRAEPKARFYRYTPLTAPLEQVLVAIHDTNLLRRPEKLRSDPDRRNRNKYCMFHGHATRECRQLRDEIEALIREGYLKEFVGN
NIGKRPLPANQGKGGANPPLEIRTLLGGPSGGESGRKRKSAIREAQQESGEQGMYSLLLDENSPKLEFTEKVAAGIRHPHNDALVVTLTIANAKVHWILVDGGSSADVLS
TTAFDAMKLGSDRLRPSLTPLVGFGGEKLEERPRDSLGLPKLPQKSVRTPGMDTTLEERLTHALNLPKSPTKCRHPWDGKQASKQHRVASRPHAKAEADHPKAKAKLHPR
RPSPQARIQSSMTSSSSQSLHFLAQPKRSKP