; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005413 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005413
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold6:34736474..34742736
RNA-Seq ExpressionSpg005413
SyntenySpg005413
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]9.3e-1530.27Show/hide
Query:  SWSWWMLNSKPEDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQ--------IHNQIDNQ--LGQLKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLN
        SW+W +     E++ +  +I WQIW  RN+ +         Q        I++ ID    + Q +R      + P+     ++    RWS PPTNCWKLN
Subjt:  SWSWWMLNSKPEDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQ--------IHNQIDNQ--LGQLKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLN

Query:  ADASWNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLS
         DASW++    G +GW++ D  G  + AG   + +   I  LE + I+ GL+ I  N   +  + LES++++VI  +  +  DL+
Subjt:  ADASWNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLS

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]3.7e-1926.98Show/hide
Query:  IRIRKARESTVHVIWECKEVRNIGLLYLPNSEILFSLNKNGWASKDYWSVTEHACFRSKIECLKCFRSSIASACVICTPSWSWWMLNSKPEDLNSVAIIM
        +  RK  E+T H++WECK +++I +   P     F +++  W +K+Y                                 W W M  +  E+     II 
Subjt:  IRIRKARESTVHVIWECKEVRNIGLLYLPNSEILFSLNKNGWASKDYWSVTEHACFRSKIECLKCFRSSIASACVICTPSWSWWMLNSKPEDLNSVAIIM

Query:  WQIWNGRNQRLIDPRKFHSNQIHNQID----NQLGQLKRISGKYQ-FGPKSSLTESLPSPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVVRDFGGSPI
         QIW  RN+ +       +  I   ID    N  GQ   +  K + F P   + ++  + +RW PP +N WKLN DA+W        +GW++RD  G  I
Subjt:  WQIWNGRNQRLIDPRKFHSNQIHNQID----NQLGQLKRISGKYQ-FGPKSSLTESLPSPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVVRDFGGSPI

Query:  YAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCIN
          G +++     I  LE +AI EGL+ I   +   ++  LES++L+ I  ++
Subjt:  YAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCIN

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]5.5e-2334.8Show/hide
Query:  EDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLT---ESLPSPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVV
        EDL+ + I  W IWN RN  +           H+     + QL +   +  +  ++SL+   ++L +  +W PPP + W LNADASW+D   +G +GW++
Subjt:  EDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLT---ESLPSPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVV

Query:  RDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHCPREQNHIAH
        R + G  + AG + VE    +K LEA AILEGL+ +  N G    L +E+++ +V   +N K EDL+++  ++E I+ L        F    RE N  AH
Subjt:  RDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHCPREQNHIAH

Query:  SLAQ
        SLAQ
Subjt:  SLAQ

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]2.9e-1630.05Show/hide
Query:  EDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQID----NQLGQLKRISGKYQFGPKSSLTESL--PSPSRWSPPPTNCWKLNADASWNDRLMKGDLG
        E+     II WQIW  RN+ +       +  I   ID    N  G+   + GK        L   +   + +RW PP +N WKLN DA+W      G +G
Subjt:  EDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQID----NQLGQLKRISGKYQFGPKSSLTESL--PSPSRWSPPPTNCWKLNADASWNDRLMKGDLG

Query:  WVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLN------LLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHC
        W++RD  G  I A  +++     I  LE +AI EGL+ I   +   +       + LES++L+ I  ++ + +D +E   +LE I  +   +     +H 
Subjt:  WVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLN------LLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHC

Query:  PREQNHIAHSLAQ
         RE N +AH LA+
Subjt:  PREQNHIAHSLAQ

XP_024041966.1 uncharacterized protein LOC112099096 [Citrus clementina]6.1e-1430.18Show/hide
Query:  WMLNSKPE-----DLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLNADASWNDRL
        W+L   P+     +   VA ++W IWN RN+ L + ++ +  ++    +  +   K+I    Q         +     +WSPPP    K+N DA+ +   
Subjt:  WMLNSKPE-----DLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLNADASWNDRL

Query:  MKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNL---LLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVF
            LG VVRD  G+   A +K +   G++   EA A+  GL+       EK N+   + ES++L+VID IN K+  L+E   ++  I        +   
Subjt:  MKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNL---LLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVF

Query:  KHCPREQNHIAHSLAQVTHLQK
        +HCPR+ N+ AHSLA++  LQK
Subjt:  KHCPREQNHIAHSLAQVTHLQK

TrEMBL top hitse value%identityAlignment
A0A6J1CQG0 uncharacterized protein LOC1110132164.5e-1530.27Show/hide
Query:  SWSWWMLNSKPEDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQ--------IHNQIDNQ--LGQLKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLN
        SW+W +     E++ +  +I WQIW  RN+ +         Q        I++ ID    + Q +R      + P+     ++    RWS PPTNCWKLN
Subjt:  SWSWWMLNSKPEDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQ--------IHNQIDNQ--LGQLKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLN

Query:  ADASWNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLS
         DASW++    G +GW++ D  G  + AG   + +   I  LE + I+ GL+ I  N   +  + LES++++VI  +  +  DL+
Subjt:  ADASWNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLS

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X11.8e-1926.98Show/hide
Query:  IRIRKARESTVHVIWECKEVRNIGLLYLPNSEILFSLNKNGWASKDYWSVTEHACFRSKIECLKCFRSSIASACVICTPSWSWWMLNSKPEDLNSVAIIM
        +  RK  E+T H++WECK +++I +   P     F +++  W +K+Y                                 W W M  +  E+     II 
Subjt:  IRIRKARESTVHVIWECKEVRNIGLLYLPNSEILFSLNKNGWASKDYWSVTEHACFRSKIECLKCFRSSIASACVICTPSWSWWMLNSKPEDLNSVAIIM

Query:  WQIWNGRNQRLIDPRKFHSNQIHNQID----NQLGQLKRISGKYQ-FGPKSSLTESLPSPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVVRDFGGSPI
         QIW  RN+ +       +  I   ID    N  GQ   +  K + F P   + ++  + +RW PP +N WKLN DA+W        +GW++RD  G  I
Subjt:  WQIWNGRNQRLIDPRKFHSNQIHNQID----NQLGQLKRISGKYQ-FGPKSSLTESLPSPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVVRDFGGSPI

Query:  YAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCIN
          G +++     I  LE +AI EGL+ I   +   ++  LES++L+ I  ++
Subjt:  YAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCIN

A0A6J1DNV9 uncharacterized protein LOC1110224032.6e-2334.8Show/hide
Query:  EDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLT---ESLPSPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVV
        EDL+ + I  W IWN RN  +           H+     + QL +   +  +  ++SL+   ++L +  +W PPP + W LNADASW+D   +G +GW++
Subjt:  EDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLT---ESLPSPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVV

Query:  RDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHCPREQNHIAH
        R + G  + AG + VE    +K LEA AILEGL+ +  N G    L +E+++ +V   +N K EDL+++  ++E I+ L        F    RE N  AH
Subjt:  RDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHCPREQNHIAH

Query:  SLAQ
        SLAQ
Subjt:  SLAQ

A0A6J1DSV1 uncharacterized protein LOC1110236081.4e-1630.05Show/hide
Query:  EDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQID----NQLGQLKRISGKYQFGPKSSLTESL--PSPSRWSPPPTNCWKLNADASWNDRLMKGDLG
        E+     II WQIW  RN+ +       +  I   ID    N  G+   + GK        L   +   + +RW PP +N WKLN DA+W      G +G
Subjt:  EDLNSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQID----NQLGQLKRISGKYQFGPKSSLTESL--PSPSRWSPPPTNCWKLNADASWNDRLMKGDLG

Query:  WVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLN------LLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHC
        W++RD  G  I A  +++     I  LE +AI EGL+ I   +   +       + LES++L+ I  ++ + +D +E   +LE I  +   +     +H 
Subjt:  WVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLN------LLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHC

Query:  PREQNHIAHSLAQ
         RE N +AH LA+
Subjt:  PREQNHIAHSLAQ

A0A803L9N8 Uncharacterized protein2.7e-1231.68Show/hide
Query:  IIMWQIWNGRNQRL-----IDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLNADASWN-DRLMKGDLGWVVRDFG
        + +W+IW  RN+ L     +DP    S  + N  D  +    R        P +S+  S      W PP ++ +KLN DAS + DR  +G LG VVRD  
Subjt:  IIMWQIWNGRNQRL-----IDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLNADASWN-DRLMKGDLGWVVRDFG

Query:  GSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHCPREQNHIAHSLAQ
        G  +    K +   G I ++EA+A+L G++ +      KL   + S+ LQVI+ +NG   + S ++ I+  I++ A       F  CPR  N +AHS+A 
Subjt:  GSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHCPREQNHIAHSLAQ

Query:  VT
        ++
Subjt:  VT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27870.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-0430.11Show/hide
Query:  RWSPPPTNCWKLNADASWNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLK-CIHGNYGEKLNLLLESNALQVIDCING
        RW  P     K N D S+ +  +K   GWVVRD  GS + AG  +  K+    + E  A++  ++ C    Y     +  E +   + D ING
Subjt:  RWSPPPTNCWKLNADASWNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLK-CIHGNYGEKLNLLLESNALQVIDCING

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.3e-0821.82Show/hide
Query:  WWMLNSKPEDL------NSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQ---LKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLNADAS
        +W+LN + E        N V  ++W++W  RN+ +   +++ + ++  +      +    + + GK   GP+     S+    +W  PP    K N DA+
Subjt:  WWMLNSKPEDL------NSVAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQ---LKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLNADAS

Query:  WNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEG-LKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLSES-KNILEAIVALAPKLG
        W     +  +GW++R+  G  ++ G + + +   + + E  A+    L     NY     ++ ES+A  +++ +N  ++D   + +  LE I  L     
Subjt:  WNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEG-LKCIHGNYGEKLNLLLESNALQVIDCINGKAEDLSES-KNILEAIVALAPKLG

Query:  SSVFKHCPREQNHIAHSLAQ
           F+  PR  N +A  +A+
Subjt:  SSVFKHCPREQNHIAHSLAQ

AT3G09510.1 Ribonuclease H-like superfamily protein4.5e-0726.7Show/hide
Query:  IMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLTES---LPSPSR--------WSPPPTNCWKLNADASWNDRLMKGDLGWVV
        ++W+IW  RN           N + N+      +   +S K +     + T+S    PSP+R        W  PP    K N DA ++ + ++   GW++
Subjt:  IMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLTES---LPSPSR--------WSPPPTNCWKLNADASWNDRLMKGDLGWVV

Query:  RDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLK--CIHGNYGEKLNLLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHCPREQNHI
        R+  G+PI  G   +       + E  A+L  L+   I G       + +E +   +I+ ING +   S   N LE I   A K  S  F    R+ N +
Subjt:  RDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLK--CIHGNYGEKLNLLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHCPREQNHI

Query:  AHSLAQ
        AH LA+
Subjt:  AHSLAQ

AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-1322Show/hide
Query:  LQATPFLIALQQRRTPPFDLPRYNSEIR-NVVVGLPNLEWEKNSREKILPRVL-----EL----REYESALWWRYLNPRDTPARMSPTWTPWINTSVSNT
        L + P   AL+ +R PP +    +S ++ + ++     EW K+  E + P V      EL    R    +  W Y +  D   +       W+ T + N 
Subjt:  LQATPFLIALQQRRTPPFDLPRYNSEIR-NVVVGLPNLEWEKNSREKILPRVL-----EL----REYESALWWRYLNPRDTPARMSPTWTPWINTSVSNT

Query:  K---RAVSHSVI-----RIRKARES--TVHVIWECKEVRNIGLLYLPNSEILFSLNKNGWASKDYWSVTEHACFRSKIECLKCFRSSIASACVICTPSWS
        +   + VS   +     +I K++ S    H +W+C          L NS  +         SK      E AC R    C  C + ++      CT +  
Subjt:  K---RAVSHSVI-----RIRKARES--TVHVIWECKEVRNIGLLYLPNSEILFSLNKNGWASKDYWSVTEHACFRSKIECLKCFRSSIASACVICTPSWS

Query:  WWMLNSKPEDLNS-----------------------------VAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLTESLP
         W ++S P  L                               V  ++W++W  RN+ +   R+F++ ++  + ++ L + +  +     G K  +  S  
Subjt:  WWMLNSKPEDLNS-----------------------------VAIIMWQIWNGRNQRLIDPRKFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLTESLP

Query:  SPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAI---LEGLKCIHGNYGEKLNLLLESNALQVIDCINGKA
        S  RW PPP    K N DA+WN    +  +GWV+R+  G   + G + + K+ ++ + E  A+   +  L     NY     ++ ES++  +I+ +N   
Subjt:  SPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAI---LEGLKCIHGNYGEKLNLLLESNALQVIDCINGKA

Query:  EDLSESKNILEAIVALAPKLGSSVFKHCPREQNHIAHSLAQ
        E     K  ++ +  L  +     F   PRE N +A  +A+
Subjt:  EDLSESKNILEAIVALAPKLGSSVFKHCPREQNHIAHSLAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCCTGCGTCCTTCTCTTCAGGCAACGCCTTTCTTGATTGCTCTGCAACAACGAAGAACACCACCATTTGATCTTCCCCGCTATAACTCTGAGATTAGGAACGT
AGTTGTGGGACTACCTAATCTTGAATGGGAAAAAAACTCTAGAGAGAAAATTCTCCCTAGAGTTCTTGAGTTGAGAGAATATGAGAGTGCCTTATGGTGGCGGTACCTAA
ACCCTAGGGATACCCCCGCTCGCATGTCTCCTACATGGACGCCTTGGATTAATACGTCTGTATCGAATACAAAGCGGGCCGTATCACATAGTGTTATCAGGATAAGGAAA
GCAAGGGAGTCTACAGTGCACGTGATCTGGGAGTGTAAAGAGGTGAGGAACATTGGGCTGCTCTATCTCCCTAATTCTGAAATCCTTTTCTCTCTTAACAAGAATGGTTG
GGCGAGCAAAGACTATTGGAGTGTAACCGAACATGCATGCTTTAGGAGTAAAATTGAATGTCTTAAGTGCTTTCGATCTTCAATTGCTTCCGCATGTGTAATATGTACAC
CTTCATGGAGCTGGTGGATGCTCAATTCTAAGCCGGAGGACCTGAATTCGGTGGCGATTATAATGTGGCAGATTTGGAACGGAAGGAATCAGAGGCTTATCGACCCAAGG
AAATTTCACAGCAACCAGATTCACAATCAAATTGACAATCAATTAGGGCAGCTGAAAAGGATCTCAGGAAAGTACCAGTTCGGTCCCAAGTCGTCGCTGACAGAGAGCCT
CCCAAGTCCTTCCAGATGGTCTCCCCCGCCGACCAATTGCTGGAAGCTGAACGCTGATGCTTCCTGGAACGACAGATTGATGAAGGGCGACCTCGGTTGGGTGGTTCGCG
ACTTTGGAGGATCTCCGATCTATGCGGGAATGAAAGTGGTGGAGAAAATTGGGACGATTAAAGATCTTGAAGCATTGGCAATCCTCGAGGGGTTGAAATGCATCCATGGA
AATTACGGTGAAAAGCTTAACCTGTTACTTGAGTCCAACGCGTTGCAGGTGATCGACTGCATCAATGGCAAGGCGGAAGACCTCTCGGAAAGCAAAAACATTTTGGAAGC
CATAGTCGCGTTGGCTCCGAAGCTTGGTAGCTCTGTCTTCAAGCATTGCCCCCGTGAGCAAAATCACATCGCCCACTCTTTAGCTCAGGTTACTCATTTACAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCCTGCGTCCTTCTCTTCAGGCAACGCCTTTCTTGATTGCTCTGCAACAACGAAGAACACCACCATTTGATCTTCCCCGCTATAACTCTGAGATTAGGAACGT
AGTTGTGGGACTACCTAATCTTGAATGGGAAAAAAACTCTAGAGAGAAAATTCTCCCTAGAGTTCTTGAGTTGAGAGAATATGAGAGTGCCTTATGGTGGCGGTACCTAA
ACCCTAGGGATACCCCCGCTCGCATGTCTCCTACATGGACGCCTTGGATTAATACGTCTGTATCGAATACAAAGCGGGCCGTATCACATAGTGTTATCAGGATAAGGAAA
GCAAGGGAGTCTACAGTGCACGTGATCTGGGAGTGTAAAGAGGTGAGGAACATTGGGCTGCTCTATCTCCCTAATTCTGAAATCCTTTTCTCTCTTAACAAGAATGGTTG
GGCGAGCAAAGACTATTGGAGTGTAACCGAACATGCATGCTTTAGGAGTAAAATTGAATGTCTTAAGTGCTTTCGATCTTCAATTGCTTCCGCATGTGTAATATGTACAC
CTTCATGGAGCTGGTGGATGCTCAATTCTAAGCCGGAGGACCTGAATTCGGTGGCGATTATAATGTGGCAGATTTGGAACGGAAGGAATCAGAGGCTTATCGACCCAAGG
AAATTTCACAGCAACCAGATTCACAATCAAATTGACAATCAATTAGGGCAGCTGAAAAGGATCTCAGGAAAGTACCAGTTCGGTCCCAAGTCGTCGCTGACAGAGAGCCT
CCCAAGTCCTTCCAGATGGTCTCCCCCGCCGACCAATTGCTGGAAGCTGAACGCTGATGCTTCCTGGAACGACAGATTGATGAAGGGCGACCTCGGTTGGGTGGTTCGCG
ACTTTGGAGGATCTCCGATCTATGCGGGAATGAAAGTGGTGGAGAAAATTGGGACGATTAAAGATCTTGAAGCATTGGCAATCCTCGAGGGGTTGAAATGCATCCATGGA
AATTACGGTGAAAAGCTTAACCTGTTACTTGAGTCCAACGCGTTGCAGGTGATCGACTGCATCAATGGCAAGGCGGAAGACCTCTCGGAAAGCAAAAACATTTTGGAAGC
CATAGTCGCGTTGGCTCCGAAGCTTGGTAGCTCTGTCTTCAAGCATTGCCCCCGTGAGCAAAATCACATCGCCCACTCTTTAGCTCAGGTTACTCATTTACAGAAATGA
Protein sequenceShow/hide protein sequence
MDFLRPSLQATPFLIALQQRRTPPFDLPRYNSEIRNVVVGLPNLEWEKNSREKILPRVLELREYESALWWRYLNPRDTPARMSPTWTPWINTSVSNTKRAVSHSVIRIRK
ARESTVHVIWECKEVRNIGLLYLPNSEILFSLNKNGWASKDYWSVTEHACFRSKIECLKCFRSSIASACVICTPSWSWWMLNSKPEDLNSVAIIMWQIWNGRNQRLIDPR
KFHSNQIHNQIDNQLGQLKRISGKYQFGPKSSLTESLPSPSRWSPPPTNCWKLNADASWNDRLMKGDLGWVVRDFGGSPIYAGMKVVEKIGTIKDLEALAILEGLKCIHG
NYGEKLNLLLESNALQVIDCINGKAEDLSESKNILEAIVALAPKLGSSVFKHCPREQNHIAHSLAQVTHLQK