; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035626 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035626
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr3:25804881..25806410
RNA-Seq ExpressionLag0035626
SyntenyLag0035626
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150858.1 uncharacterized protein LOC111018902 [Momordica charantia]7.1e-6549.81Show/hide
Query:  PLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV-----GQDRGKRPI-----------PADQESTE
        PLEQV   I+D  LLK PE++++ P +R K +YC FH DH+H T++C  L++E+  LI+ GY KE+V      Q++ +R             P   E  E
Subjt:  PLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV-----GQDRGKRPI-----------PADQESTE

Query:  KEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMINFLVVDCV
         E   I HPHND LV+ LK+AN KVH IL+DGGSS D++S   +  M LG   L+ S  PLVGFGGE V P G IEL VTF  GP ++T+M++FLVVD  
Subjt:  KEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMINFLVVDCV

Query:  STYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR
        S+ NAILGRPT+H LKA+ S YHQ +KFPT  G+GE++GEQ++SRECY+  +K  D+
Subjt:  STYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR

XP_022155873.1 uncharacterized protein LOC111022886 [Momordica charantia]9.9e-6749.24Show/hide
Query:  TPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV--------GQDRGKRPI----------P
        TP T  LEQVL  I+D  LLK PE++++ P +R+K +Y +FH DH H T++C  L++E+  LIR GY KE+V        G++    P+          P
Subjt:  TPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV--------GQDRGKRPI----------P

Query:  ADQESTEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMIN
        A  E +E EA  + HPHND LV+ LK+AN KVHRIL+DGGSST ++S   + AM LG   L+ S  PLVGFGGE+V P G IEL V F  GP ++ +M++
Subjt:  ADQESTEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMIN

Query:  FLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR
        FLVV   S+YN ILGRP +H LK + STYHQ +KFPT  G+GE++GEQ++ RECY   +K  DR
Subjt:  FLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR

XP_022156748.1 uncharacterized protein LOC111023587 [Momordica charantia]6.0e-6448.31Show/hide
Query:  YTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFVGQDRGKRPI------------------
        YT  T PLEQVL  I+D  LLK PE++++   +R+K +YC+FH DH+H T++C  L++E+  LIR GY KE    ++ K  +                  
Subjt:  YTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFVGQDRGKRPI------------------

Query:  --PADQESTEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTR
          P   E +E EA  + H HND LV+ LK+ANVKVHRIL+DGGSS D++S   + AM L    L+ S  PLVGFGGE+V   G IEL VTF  GP  VT+
Subjt:  --PADQESTEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTR

Query:  MINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR
        M++FLVV+  S+YN ILGR T+H LK + STYHQ +KFPT  GV E++GEQ++SRECY+  ++  DR
Subjt:  MINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR

XP_030936700.1 uncharacterized protein LOC115961955 [Quercus lobata]2.3e-6338.08Show/hide
Query:  LTAVISGLQDERLLNSIGESQLQTYVEFMTQAQRYISTEELLKSKQEERERESRGVFVSNRHREDRGKGHRVEDRGRSRHEHSSAN-GRGRLEAKETRDR
        L A  +G+  +  ++ + E + QT  E +  AQ +++ E+ + +K+ ++  +                     D   +RH     +  +GR E K+ RDR
Subjt:  LTAVISGLQDERLLNSIGESQLQTYVEFMTQAQRYISTEELLKSKQEERERESRGVFVSNRHREDRGKGHRVEDRGRSRHEHSSAN-GRGRLEAKETRDR

Query:  -AELKAKFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFVGQD------RGK-----
         A    +  +YTPL  PL+QVL  I+D   LK PEK++ DP +RN+NKYC FH DH H T EC  L+ +I  LIR+G  + F+G+D      +GK     
Subjt:  -AELKAKFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFVGQD------RGK-----

Query:  -------------------------------------RPIPADQES-------TEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFD
                                             RP P D+++       TE+EA  IHHPH+D +V+ L +A+    R+L+D GSS D+L    F 
Subjt:  -------------------------------------RPIPADQES-------TEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFD

Query:  AMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISR
         MKLG DRLRP  +PLVGFGG KV P G + L V     P  VT+ ++FLVVDC S+YNAI+GRPTL+  KAVTSTYH  +KFPT  GVG+V+G+Q  +R
Subjt:  AMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISR

Query:  ECYFMVL
        ECY  +L
Subjt:  ECYFMVL

XP_030970463.1 uncharacterized protein LOC115990823 [Quercus lobata]5.5e-6536.95Show/hide
Query:  LTAVISGLQDERLLNSIGESQLQTYVEFMTQAQRYISTEELLKSKQEERERESRGVFVSNRHREDRGKGHRVEDRGRSRHEHSSANGRGRLEAKETRD--
        L A  +G+  +  ++ + E + Q+  E +  AQ +++ E+ + +K+ +R                     R++      HE  +   +GR + +  R+  
Subjt:  LTAVISGLQDERLLNSIGESQLQTYVEFMTQAQRYISTEELLKSKQEERERESRGVFVSNRHREDRGKGHRVEDRGRSRHEHSSANGRGRLEAKETRD--

Query:  RAELKAKFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFVGQD--------------
        +  L  +  +YTPL APLEQVL  I+D   LK PEKL+ DP +RN+NKYC FH DH H T EC  L+ +I  LIR+G  + F+G+D              
Subjt:  RAELKAKFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFVGQD--------------

Query:  -------------------------------------RGKRPIPADQES---TEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDA
                                             R  R +  D+++   T+++A  IHHPH+D LV++L +AN    R+L+D GSSTD+L    F  
Subjt:  -------------------------------------RGKRPIPADQES---TEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDA

Query:  MKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRE
        M+LG D+LRP  +PLVGFGG KV P G+I L V     P  +T+ +NFLVVDC S+YNAI+GRPTL+  KAVTSTYH  +KFPT+ GVG+V+G+Q  ++E
Subjt:  MKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRE

Query:  CYFMVL
        CY  +L
Subjt:  CYFMVL

TrEMBL top hitse value%identityAlignment
A0A2N9I9Q7 Ribonuclease H3.0e-6137.9Show/hide
Query:  VALTAVISGLQDERLLNSIGESQLQTYVEFMTQAQRYISTEELLKSKQEERERESRGVFVSNRHREDRGKGHRVEDRGRSRHEHSSANGRGRLEAKETRD
        V LTA ISGLQ    L S+ +    T  E M +AQR+++ EE L ++     ++ +G                  DR    HE      R R   +E R 
Subjt:  VALTAVISGLQDERLLNSIGESQLQTYVEFMTQAQRYISTEELLKSKQEERERESRGVFVSNRHREDRGKGHRVEDRGRSRHEHSSANGRGRLEAKETRD

Query:  RAELKAKFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV--GQDRGKRPIPADQES
              +F+ +TPL AP++ +   I++   LK P KL +DP +R ++KYC FH DH H T +C  L+ +I  LI++G  + FV  GQ  G+  + + Q  
Subjt:  RAELKAKFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV--GQDRGKRPIPADQES

Query:  ------------------TEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVT
                          TE++A  + HPH+D LVV L++A      +LID GSS D +    F  MK+G+D+L+P  TPLVGF G  V P G I L +T
Subjt:  ------------------TEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVT

Query:  FREGPHTVTRMINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVL
            P   T+ +NFLVVDC S YN I+GRPTL+ L+AVTSTYH +++FPT+  +GE++G+Q ++RECY  ++
Subjt:  FREGPHTVTRMINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVL

A0A6J1DAK4 uncharacterized protein LOC1110189023.5e-6549.81Show/hide
Query:  PLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV-----GQDRGKRPI-----------PADQESTE
        PLEQV   I+D  LLK PE++++ P +R K +YC FH DH+H T++C  L++E+  LI+ GY KE+V      Q++ +R             P   E  E
Subjt:  PLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV-----GQDRGKRPI-----------PADQESTE

Query:  KEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMINFLVVDCV
         E   I HPHND LV+ LK+AN KVH IL+DGGSS D++S   +  M LG   L+ S  PLVGFGGE V P G IEL VTF  GP ++T+M++FLVVD  
Subjt:  KEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMINFLVVDCV

Query:  STYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR
        S+ NAILGRPT+H LKA+ S YHQ +KFPT  G+GE++GEQ++SRECY+  +K  D+
Subjt:  STYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR

A0A6J1DP33 uncharacterized protein LOC1110228864.8e-6749.24Show/hide
Query:  TPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV--------GQDRGKRPI----------P
        TP T  LEQVL  I+D  LLK PE++++ P +R+K +Y +FH DH H T++C  L++E+  LIR GY KE+V        G++    P+          P
Subjt:  TPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFV--------GQDRGKRPI----------P

Query:  ADQESTEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMIN
        A  E +E EA  + HPHND LV+ LK+AN KVHRIL+DGGSST ++S   + AM LG   L+ S  PLVGFGGE+V P G IEL V F  GP ++ +M++
Subjt:  ADQESTEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMIN

Query:  FLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR
        FLVV   S+YN ILGRP +H LK + STYHQ +KFPT  G+GE++GEQ++ RECY   +K  DR
Subjt:  FLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR

A0A6J1DRG9 uncharacterized protein LOC1110235872.9e-6448.31Show/hide
Query:  YTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFVGQDRGKRPI------------------
        YT  T PLEQVL  I+D  LLK PE++++   +R+K +YC+FH DH+H T++C  L++E+  LIR GY KE    ++ K  +                  
Subjt:  YTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFVGQDRGKRPI------------------

Query:  --PADQESTEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTR
          P   E +E EA  + H HND LV+ LK+ANVKVHRIL+DGGSS D++S   + AM L    L+ S  PLVGFGGE+V   G IEL VTF  GP  VT+
Subjt:  --PADQESTEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTR

Query:  MINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR
        M++FLVV+  S+YN ILGR T+H LK + STYHQ +KFPT  GV E++GEQ++SRECY+  ++  DR
Subjt:  MINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDR

A0A7J0FQE9 Uncharacterized protein1.1e-6038.07Show/hide
Query:  KNPRSTVALTAVISGLQDERLLNSIGESQLQTYVEFMTQAQRYISTEELLKSKQEERERESRGVFVSNRHREDRGKGHRVEDRGRSRHEHSSANGRGRLE
        ++P   V + A++ GL+   L +S+ ++  +T     ++A +YI+ EEL ++K+  R ++         H+       R + R  +R++      R    
Subjt:  KNPRSTVALTAVISGLQDERLLNSIGESQLQTYVEFMTQAQRYISTEELLKSKQEERERESRGVFVSNRHREDRGKGHRVEDRGRSRHEHSSANGRGRLE

Query:  AKETRDRAELKAKFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGY--HKEFVGQDRG----
        +   R R   +       PL AP+ QVL  I+    +K P K+++DP +RNKNKYC FH DH H T +C QL+++I  LI+ GY   K       G    
Subjt:  AKETRDRAELKAKFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGY--HKEFVGQDRG----

Query:  -----KRPIPADQES---TEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVT
               P  ADQ     +  +   +H PH+D LVV+  +AN  V RILID GSS D+L  + F+ MK+G D+L P  TPLV FGG K  P G I L +T
Subjt:  -----KRPIPADQES---TEKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVT

Query:  FREGPHTVTRMINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLK
            PH  T   +F+VVDC S YNAILGRPTL  +KA+TST+H  +KFPT  G+GEV+G+QK++R+C+   +K
Subjt:  FREGPHTVTRMINFLVVDCVSTYNAILGRPTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGGGAGGTGCCGCATAAATTTAAGGAGCTCGGAGCCGACAAAAACCCCAGATCAACAGTTGCCTTGACCGCTGTGATTTCAGGTCTGCAGGATGAAAGATTGCT
CAACTCGATTGGTGAGAGTCAGCTGCAAACATATGTGGAATTCATGACCCAAGCACAAAGATACATAAGCACCGAGGAATTGCTGAAATCCAAGCAGGAAGAGAGAGAGA
GAGAGAGCCGAGGAGTTTTTGTATCTAACCGGCATCGAGAAGACAGAGGGAAGGGGCACCGGGTCGAGGATAGAGGTCGAAGCCGACATGAGCACTCATCGGCCAATGGC
CGAGGTCGACTAGAGGCCAAGGAGACGCGGGACCGTGCAGAACTGAAAGCCAAGTTTGACAGGTATACACCACTAACAGCTCCACTTGAACAGGTCTTGGCCGCAATACA
AGATACAAATCTACTGAAACGTCCAGAAAAGCTGAGGTCAGACCCAGGCAGGAGAAACAAGAATAAATACTGCATGTTCCACGAAGACCACAACCACACTACCCGAGAAT
GCATACAGTTAAGGGATGAAATAGGAACTCTAATCCGAGAGGGCTACCACAAGGAATTCGTGGGACAGGACAGAGGGAAAAGACCAATACCAGCAGATCAAGAGTCTACA
GAGAAAGAGGCTGCATGGATACACCACCCACACAATGACACGTTGGTGGTAGCCCTAAAAGTTGCCAATGTCAAAGTGCATCGAATTTTAATTGATGGAGGAAGCTCAAC
CGATGTCCTCTCCACTGCCGAGTTCGATGCCATGAAGCTGGGAAGCGATCGCCTGAGGCCGAGCCTTACACCGTTGGTGGGATTTGGCGGAGAAAAAGTAAGCCCAAGGG
GAAGCATCGAACTACTAGTGACGTTTAGGGAAGGACCGCATACAGTTACAAGAATGATCAATTTTCTAGTGGTGGACTGCGTCTCAACATATAATGCCATTTTGGGGCGA
CCAACCTTGCATGGGCTTAAAGCTGTAACCTCAACTTACCACCAAGTCCTGAAGTTCCCAACCAAAGAAGGTGTAGGAGAGGTGCGTGGTGAACAGAAGATATCAAGAGA
GTGCTACTTTATGGTGCTCAAGAATATGGATAGGAAGGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGGGAGGTGCCGCATAAATTTAAGGAGCTCGGAGCCGACAAAAACCCCAGATCAACAGTTGCCTTGACCGCTGTGATTTCAGGTCTGCAGGATGAAAGATTGCT
CAACTCGATTGGTGAGAGTCAGCTGCAAACATATGTGGAATTCATGACCCAAGCACAAAGATACATAAGCACCGAGGAATTGCTGAAATCCAAGCAGGAAGAGAGAGAGA
GAGAGAGCCGAGGAGTTTTTGTATCTAACCGGCATCGAGAAGACAGAGGGAAGGGGCACCGGGTCGAGGATAGAGGTCGAAGCCGACATGAGCACTCATCGGCCAATGGC
CGAGGTCGACTAGAGGCCAAGGAGACGCGGGACCGTGCAGAACTGAAAGCCAAGTTTGACAGGTATACACCACTAACAGCTCCACTTGAACAGGTCTTGGCCGCAATACA
AGATACAAATCTACTGAAACGTCCAGAAAAGCTGAGGTCAGACCCAGGCAGGAGAAACAAGAATAAATACTGCATGTTCCACGAAGACCACAACCACACTACCCGAGAAT
GCATACAGTTAAGGGATGAAATAGGAACTCTAATCCGAGAGGGCTACCACAAGGAATTCGTGGGACAGGACAGAGGGAAAAGACCAATACCAGCAGATCAAGAGTCTACA
GAGAAAGAGGCTGCATGGATACACCACCCACACAATGACACGTTGGTGGTAGCCCTAAAAGTTGCCAATGTCAAAGTGCATCGAATTTTAATTGATGGAGGAAGCTCAAC
CGATGTCCTCTCCACTGCCGAGTTCGATGCCATGAAGCTGGGAAGCGATCGCCTGAGGCCGAGCCTTACACCGTTGGTGGGATTTGGCGGAGAAAAAGTAAGCCCAAGGG
GAAGCATCGAACTACTAGTGACGTTTAGGGAAGGACCGCATACAGTTACAAGAATGATCAATTTTCTAGTGGTGGACTGCGTCTCAACATATAATGCCATTTTGGGGCGA
CCAACCTTGCATGGGCTTAAAGCTGTAACCTCAACTTACCACCAAGTCCTGAAGTTCCCAACCAAAGAAGGTGTAGGAGAGGTGCGTGGTGAACAGAAGATATCAAGAGA
GTGCTACTTTATGGTGCTCAAGAATATGGATAGGAAGGCCTAA
Protein sequenceShow/hide protein sequence
MGGEVPHKFKELGADKNPRSTVALTAVISGLQDERLLNSIGESQLQTYVEFMTQAQRYISTEELLKSKQEERERESRGVFVSNRHREDRGKGHRVEDRGRSRHEHSSANG
RGRLEAKETRDRAELKAKFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPGRRNKNKYCMFHEDHNHTTRECIQLRDEIGTLIREGYHKEFVGQDRGKRPIPADQEST
EKEAAWIHHPHNDTLVVALKVANVKVHRILIDGGSSTDVLSTAEFDAMKLGSDRLRPSLTPLVGFGGEKVSPRGSIELLVTFREGPHTVTRMINFLVVDCVSTYNAILGR
PTLHGLKAVTSTYHQVLKFPTKEGVGEVRGEQKISRECYFMVLKNMDRKA