; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005733 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005733
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr6:27393739..27395626
RNA-Seq ExpressionLag0005733
SyntenyLag0005733
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155186.1 uncharacterized protein LOC111022321 [Momordica charantia]7.6e-4960.95Show/hide
Query:  LEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFL
        +EF+E EA  + HPHNDALV+TL IAN +VH+ILVDGGSSAD++S  A +AM LG   L+S+   LVGF GE+V LEG IELPVTFG G +SVTKM++FL
Subjt:  LEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFL

Query:  VVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTELKDTKRKARTAPC
        VV+ TS+YNAILGR T+H LKAI STY+Q  KFPT  GVG ++ EQ++SRECY T ++D  R +    C
Subjt:  VVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTELKDTKRKARTAPC

XP_023895777.1 uncharacterized protein LOC112007641 [Quercus suber]8.2e-5135.32Show/hide
Query:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDGI--PRKSIGSFKELVWAFVTQFLEGWNRCKPQINLLKIKQ
        E+E+ +  K  D   +S R+ +   +E+L+ + D PF+  I    +P KFK+PT   YDG   P   I +FK       T  L+G ++ +   +LL I+Q
Subjt:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDGI--PRKSIGSFKELVWAFVTQFLEGWNRCKPQINLLKIKQ

Query:  GPRESWRDYITRFSNEVLQVEGYDDGVALTA---------------------------------------MISERTEAEDV------------RQMRVAE
        G  ES   +ITRF+ E L V+  D+ + L A                                       ++ +R  AE +            R  +   
Subjt:  GPRESWRDYITRFSNEVLQVEGYDDGVALTA---------------------------------------MISERTEAEDV------------RQMRVAE

Query:  ADRSIPQVEAGQRETLKNV-------------SHCPSLSNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLG
         D+   +  AGQ    K               S C  +++   Q + FT+ +A  +HHPH+DA+VVTL IA+ +  ++L+D GSSAD+L   A + M+LG
Subjt:  ADRSIPQVEAGQRETLKNV-------------SHCPSLSNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLG

Query:  RDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFT
        RD LR   + LVGF G KV   GS+ LPVT G   Q VTK +NFLVVDC+S+YNAI+GR TL+  KAI STY+  +KFPTE G G V  +Q  +RECY  
Subjt:  RDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFT

Query:  EL
         L
Subjt:  EL

XP_023900876.1 uncharacterized protein LOC112012715 [Quercus suber]7.6e-4934.72Show/hide
Query:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDGI--PRKSIGSFK----------ELVWAFVTQFLEGWNR--
        E+E+ +  K  D   ++ R+A+   IE+L+ + D PF   I G  +P KFK+P+   YDG   P   I +FK          E++       L+G  R  
Subjt:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDGI--PRKSIGSFK----------ELVWAFVTQFLEGWNR--

Query:  CKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAMIS---------------ERTEAEDVRQ-----------------------------
         +   +LL I+QG  ES R +ITRF+ E L V+  DD + L A  +               +  E  D++Q                             
Subjt:  CKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAMIS---------------ERTEAEDVRQ-----------------------------

Query:  ---------MRVAEADRSIPQVEAGQRETLKNVSHC------PSLSNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVA
                 +RV     S  Q    ++  LK V +       P         + FT+ EA  IHHPH+DA+V+TL IA+ +  ++LVD GSSAD+L   A
Subjt:  ---------MRVAEADRSIPQVEAGQRETLKNVSHC------PSLSNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVA

Query:  LEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKI
         + M++G+D+LR   + L+GF G KV   G+I LPV  G   Q +TK +NFLVVDC+S+YNAI+GR TL+  KA+ STY+  +KFPTE GVG V+ +Q  
Subjt:  LEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKI

Query:  SRECYFTEL
        +RECY   L
Subjt:  SRECYFTEL

XP_023902033.1 uncharacterized protein LOC112013883 [Quercus suber]7.6e-4934.72Show/hide
Query:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDGI--PRKSIGSFK----------ELVWAFVTQFLEGWNR--
        E+E+ +  K  D   ++ R+A+   IE+L+ + D PF   I G  +P KFK+P+   YDG   P   I +FK          E++       L+G  R  
Subjt:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDGI--PRKSIGSFK----------ELVWAFVTQFLEGWNR--

Query:  CKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAMIS---------------ERTEAEDVRQ-----------------------------
         +   +LL I+QG  ES R +ITRF+ E L V+  DD + L A  +               +  E  D++Q                             
Subjt:  CKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAMIS---------------ERTEAEDVRQ-----------------------------

Query:  ---------MRVAEADRSIPQVEAGQRETLKNVSHC------PSLSNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVA
                 +RV     S  Q    ++  LK V +       P         + FT+ EA  IHHPH+DA+V+TL IA+ +  ++LVD GSSAD+L   A
Subjt:  ---------MRVAEADRSIPQVEAGQRETLKNVSHC------PSLSNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVA

Query:  LEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKI
         + M++G+D+LR   + L+GF G KV   G+I LPV  G   Q +TK +NFLVVDC+S+YNAI+GR TL+  KA+ STY+  +KFPTE GVG V+ +Q  
Subjt:  LEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKI

Query:  SRECYFTEL
        +RECY   L
Subjt:  SRECYFTEL

XP_030963816.1 uncharacterized protein LOC115984981 [Quercus lobata]2.5e-5234.78Show/hide
Query:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDG----------------------------------------
        ++E+ +  K  +   ++ R+ +   IE+L+ + D PF   I G  +P KFK+P+   YDG                                        
Subjt:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDG----------------------------------------

Query:  ---IPRKSIGSFKELVWAFVTQFLEGWNRCKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAM-------------------------IS
           IP  S+ SF+EL   FV  F+ G    +   +LL I+QG  ES R +ITRF+ E L V+  DD + L A                           S
Subjt:  ---IPRKSIGSFKELVWAFVTQFLEGWNRCKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAM-------------------------IS

Query:  ERTEAEDV------------RQMRVAEADRSIPQVEAGQRETLKNVSHCPSL--SNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGS
         R EAE              +  R  E D +I QVE    +  ++ + CP+L  + +    + FT+ +A  IHH H+DA+V+TL IA+    ++LVD GS
Subjt:  ERTEAEDV------------RQMRVAEADRSIPQVEAGQRETLKNVSHCPSL--SNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGS

Query:  SADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGV
        SAD+L   A + M+LGRD+LR   + L+GF G KV   G++ LPV  G   Q +TK +NFLVVDCTS+YNAI+GR TL+  KAI STY+  +KFPTE G+
Subjt:  SADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGV

Query:  GVVRVEQKISRECY
        G  + +Q  +RECY
Subjt:  GVVRVEQKISRECY

TrEMBL top hitse value%identityAlignment
A0A2N9EQY5 Ribonuclease H6.5e-4629.82Show/hide
Query:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDG----------------------------------------
        E+EL    K+     +S R    + ++ L+ + D PFI  I    +P +FK+P  + +DG                                        
Subjt:  EQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDG----------------------------------------

Query:  ---IPRKSIGSFKELVWAFVTQFLEGWNRCKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAMISER-------TEAEDVRQMRVAEAD-
           +  +SIGSF +L  AF+  F+    R +P  +LL +KQ   ES R ++ RF+ E ++++   + V +TA ++ R       T  +    +R+A  + 
Subjt:  ---IPRKSIGSFKELVWAFVTQFLEGWNRCKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAMISER-------TEAEDVRQMRVAEAD-

Query:  -----------RSIPQ-----------VEAGQRET-----LKNVSHCPSLSNRF-------------------W-------------QRLEFTEREAAGI
                   R++PQ           +E   R+T            PSL ++                    W             Q + F+E +A G 
Subjt:  -----------RSIPQ-----------VEAGQRET-----LKNVSHCPSLSNRF-------------------W-------------QRLEFTEREAAGI

Query:  HHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAI
        H PH+DALV+T+ IA     +++VD GSSAD+L   A + M+L +D+LR     LVGF G+K+   G + LP+  G   ++V+K ++FLVV+C SAYNAI
Subjt:  HHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAI

Query:  LGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTEL
        +GR TL+ L+A+ STY+ +LKFPTE G+G VR +Q  +RECY   L
Subjt:  LGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTEL

A0A2N9ESE9 Integrase catalytic domain-containing protein5.0e-4631.27Show/hide
Query:  VEQGQGGREQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDG--------------------------------
        +E+     EQE+     +     D ++    K +++L+   D PF   +    +P KF++PT + +DG                                
Subjt:  VEQGQGGREQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDG--------------------------------

Query:  -----------IPRKSIGSFKELVWAFVTQFLEGWNRCKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAMISERTEAEDVRQMRVAEAD
                   +   S+GSF +L   F   F+ G    +P  +LL +KQ   E  R Y+TRF+ E L VEG DD V       ++ + ED+         
Subjt:  -----------IPRKSIGSFKELVWAFVTQFLEGWNRCKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAMISERTEAEDVRQMRVAEAD

Query:  RSIPQVEAGQRETLKNVSHCPSLSNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFE
                                      + FTE +A  + HPH+DALVVTL IA     ++L+D GS AD++   A + MK+G+D+LR   T LVGF 
Subjt:  RSIPQVEAGQRETLKNVSHCPSLSNRFWQRLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFE

Query:  GEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTEL
        G  V   G I L +  G   +  TK ++FLVVDC SAYN I+GR TL+ L+A+ STY+ +++FPTE G+G +R +Q ++RECY T +
Subjt:  GEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTEL

A0A6J1CZ14 uncharacterized protein LOC1110159019.4e-4554.49Show/hide
Query:  RLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINF
        ++EF+E EA  + HPHNDALV+TL IAN +VH+ILVDGGSS D+ S  A +AM LG + L+S+LT L+GF GE++  +G IELP+TF    +S+T+M++F
Subjt:  RLEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINF

Query:  LVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTELKDTKRKART
        LVVD TS+YN IL R T+H L+AI STY+Q +KFPT  GVG ++ EQ++SRECY+T ++   + + T
Subjt:  LVVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTELKDTKRKART

A0A6J1DQY2 uncharacterized protein LOC1110223213.7e-4960.95Show/hide
Query:  LEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFL
        +EF+E EA  + HPHNDALV+TL IAN +VH+ILVDGGSSAD++S  A +AM LG   L+S+   LVGF GE+V LEG IELPVTFG G +SVTKM++FL
Subjt:  LEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFL

Query:  VVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTELKDTKRKARTAPC
        VV+ TS+YNAILGR T+H LKAI STY+Q  KFPT  GVG ++ EQ++SRECY T ++D  R +    C
Subjt:  VVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTELKDTKRKARTAPC

A0A6J1DXV4 uncharacterized protein LOC1110251478.5e-4657.4Show/hide
Query:  LEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFL
        +EF+E EA  + HPHNDALV+TL IAN +VH+ILVDGGSSAD++S    + M LG   L+S+   L+GF  E+V  EG IEL VTFG G +SVTKM++FL
Subjt:  LEFTEREAAGIHHPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFL

Query:  VVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTELKDTKRKARTAPC
        VVD TS+YNAILGR+T+H LKAI STY+Q +KFPT  GVG ++ EQ++SRECY+T ++   R +    C
Subjt:  VVDCTSAYNAILGRATLHELKAIASTYYQVLKFPTEKGVGVVRVEQKISRECYFTELKDTKRKARTAPC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGTATGCATGTTAAAGTCAGAGGCAGGAAAAAGTTAAGGGCTCGAGCAGGATCCAAGGTCGAGCAGGGCCAAGGAGGACGTGAGCAGGAGTTGTCAAGGTGGCT
GAAAGAGGAGGATAGCCCTTATGACTCGTATAGAAAAGCGGATAATAAATACATAGAAGAGTTGATAGGACAGATGGATCCACCCTTCATCCATGATATTATGGGAGCAG
AGGTGCCACAGAAATTCAAGATACCAACCTTCCAGCAGTACGACGGGATACCCCGGAAATCTATAGGATCATTTAAGGAGCTGGTATGGGCTTTTGTTACGCAGTTCTTA
GAGGGTTGGAATCGATGCAAGCCACAAATTAATCTACTGAAAATCAAGCAAGGACCAAGGGAGAGCTGGAGGGATTACATCACAAGATTCAGTAACGAGGTTTTGCAGGT
GGAAGGCTATGACGATGGAGTTGCACTAACCGCTATGATTTCAGAGAGGACAGAAGCGGAAGACGTCAGGCAGATGAGGGTGGCCGAAGCCGACAGGAGCATTCCTCAGG
TCGAGGCTGGACAAAGGGAAACTTTGAAAAATGTATCCCACTGTCCGTCCCTCTCAAATAGGTTTTGGCAGCGCTTGGAGTTCACAGAAAGGGAGGCTGCAGGAATCCAT
CATCCACACAATGATGCATTGGTGGTAACCCTAACTATTGCCAATCCAAGGGTTCACCAAATCTTAGTTGATGGAGGGAGCTCCGCTGATGTACTCTCTACAGTAGCGCT
CGAGGCCATGAAGTTAGGGAGGGATCGTTTGAGATCGAACCTCACACTACTGGTTGGGTTCGAGGGAGAAAAGGTGAGTCTAGAGGGAAGCATTGAGCTACCAGTAACTT
TTGGTGAAGGGCAGCAGTCGGTTACGAAGATGATCAATTTCCTGGTGGTGGATTGCACCTCGGCGTATAACGCCATATTGGGAAGAGCAACCTTGCATGAACTCAAAGCA
ATTGCCTCAACCTATTACCAAGTCTTGAAATTTCCAACAGAAAAAGGCGTAGGAGTGGTGCGCGTCGAGCAGAAAATATCCAGGGAATGTTACTTCACAGAACTCAAGGA
CACCAAAAGGAAGGCCCGAACAGCTCCCTGCTCGGACAATGGTCGAGGTCGAGGCCGAGCACTCCCTAGCACGAAGAATCCAGAGACTCCCCTCCCAATGGAGCACTTGG
CTACCTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGTATGCATGTTAAAGTCAGAGGCAGGAAAAAGTTAAGGGCTCGAGCAGGATCCAAGGTCGAGCAGGGCCAAGGAGGACGTGAGCAGGAGTTGTCAAGGTGGCT
GAAAGAGGAGGATAGCCCTTATGACTCGTATAGAAAAGCGGATAATAAATACATAGAAGAGTTGATAGGACAGATGGATCCACCCTTCATCCATGATATTATGGGAGCAG
AGGTGCCACAGAAATTCAAGATACCAACCTTCCAGCAGTACGACGGGATACCCCGGAAATCTATAGGATCATTTAAGGAGCTGGTATGGGCTTTTGTTACGCAGTTCTTA
GAGGGTTGGAATCGATGCAAGCCACAAATTAATCTACTGAAAATCAAGCAAGGACCAAGGGAGAGCTGGAGGGATTACATCACAAGATTCAGTAACGAGGTTTTGCAGGT
GGAAGGCTATGACGATGGAGTTGCACTAACCGCTATGATTTCAGAGAGGACAGAAGCGGAAGACGTCAGGCAGATGAGGGTGGCCGAAGCCGACAGGAGCATTCCTCAGG
TCGAGGCTGGACAAAGGGAAACTTTGAAAAATGTATCCCACTGTCCGTCCCTCTCAAATAGGTTTTGGCAGCGCTTGGAGTTCACAGAAAGGGAGGCTGCAGGAATCCAT
CATCCACACAATGATGCATTGGTGGTAACCCTAACTATTGCCAATCCAAGGGTTCACCAAATCTTAGTTGATGGAGGGAGCTCCGCTGATGTACTCTCTACAGTAGCGCT
CGAGGCCATGAAGTTAGGGAGGGATCGTTTGAGATCGAACCTCACACTACTGGTTGGGTTCGAGGGAGAAAAGGTGAGTCTAGAGGGAAGCATTGAGCTACCAGTAACTT
TTGGTGAAGGGCAGCAGTCGGTTACGAAGATGATCAATTTCCTGGTGGTGGATTGCACCTCGGCGTATAACGCCATATTGGGAAGAGCAACCTTGCATGAACTCAAAGCA
ATTGCCTCAACCTATTACCAAGTCTTGAAATTTCCAACAGAAAAAGGCGTAGGAGTGGTGCGCGTCGAGCAGAAAATATCCAGGGAATGTTACTTCACAGAACTCAAGGA
CACCAAAAGGAAGGCCCGAACAGCTCCCTGCTCGGACAATGGTCGAGGTCGAGGCCGAGCACTCCCTAGCACGAAGAATCCAGAGACTCCCCTCCCAATGGAGCACTTGG
CTACCTTCTAA
Protein sequenceShow/hide protein sequence
MSSMHVKVRGRKKLRARAGSKVEQGQGGREQELSRWLKEEDSPYDSYRKADNKYIEELIGQMDPPFIHDIMGAEVPQKFKIPTFQQYDGIPRKSIGSFKELVWAFVTQFL
EGWNRCKPQINLLKIKQGPRESWRDYITRFSNEVLQVEGYDDGVALTAMISERTEAEDVRQMRVAEADRSIPQVEAGQRETLKNVSHCPSLSNRFWQRLEFTEREAAGIH
HPHNDALVVTLTIANPRVHQILVDGGSSADVLSTVALEAMKLGRDRLRSNLTLLVGFEGEKVSLEGSIELPVTFGEGQQSVTKMINFLVVDCTSAYNAILGRATLHELKA
IASTYYQVLKFPTEKGVGVVRVEQKISRECYFTELKDTKRKARTAPCSDNGRGRGRALPSTKNPETPLPMEHLATF