; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000212 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000212
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:1578374..1581088
RNA-Seq ExpressionLag0000212
SyntenyLag0000212
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4364303.1 hypothetical protein G4B88_028423 [Cannabis sativa]2.1e-2426.23Show/hide
Query:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE
        ++GY +A+        SSS+  +L  WWK  W++ +P KI+ F++RL               C++  I  +       E V+    L  CQ++K+A   E
Subjt:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE

Query:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM
        F   + + W  W+  N   F+    RP  + + A+ Y+  ++          + T  R+ +VW  P  G+ K+N DA+  SH +R   G +VRD  G+++
Subjt:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM

Query:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW
         +  F +        AEG+A +++L+   D G+    +E D   +   L    E+++  G                T H   R+GN AAH LA+++I   
Subjt:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW

Query:  SDEVW
         D  W
Subjt:  SDEVW

KAF4386115.1 hypothetical protein F8388_016367 [Cannabis sativa]1.2e-2425.71Show/hide
Query:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE
        ++GY +A+        SSS+  +L  WWK  W++ +P KI+ F++RL               C++  I          E V+    L  CQ++K+A   E
Subjt:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE

Query:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM
        F   + + W  W+  N   F+    RP  + + A+ Y+  ++          + T  R+ +VW  P  G+ K+N DA+  SH +R   G +VRD  G+++
Subjt:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM

Query:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW
         +  F +        AEG+A +++L+   D G+    +E D   +   L    E+++  G                T H   R+GN AAH LA+++I   
Subjt:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW

Query:  SDEVWMETCPRCLED
         D  W +   +   D
Subjt:  SDEVWMETCPRCLED

KAF4395712.1 hypothetical protein G4B88_013486 [Cannabis sativa]1.2e-2425.71Show/hide
Query:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE
        ++GY +A+        SSS+  +L  WWK  W++ +P KI+ F++RL               C++  I          E V+    L  CQ++K+A   E
Subjt:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE

Query:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM
        F   + + W  W+  N   F+    RP  + + A+ Y+  ++          + T  R+ +VW  P  G+ K+N DA+  SH +R   G +VRD  G+++
Subjt:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM

Query:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW
         +  F +        AEG+A +++L+   D G+    +E D   +   L    E+++  G                T H   R+GN AAH LA+++I   
Subjt:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW

Query:  SDEVWMETCPRCLED
         D  W +   +   D
Subjt:  SDEVWMETCPRCLED

XP_022140628.1 uncharacterized protein LOC111011237 [Momordica charantia]7.4e-3044.97Show/hide
Query:  LVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVMLSATFTKDNVREVDMAEGYAAVKSLELVT
        LVEWA  YV+ FRE   S+   GR T   E V+W  P    YK+N DASFL+      LGII+R+  GQVM SAT   +N++ VDMAE   AV+ L+L +
Subjt:  LVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVMLSATFTKDNVREVDMAEGYAAVKSLELVT

Query:  DMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQWSDEVWMETCP----RCLE
         +G+ P ILETDSSR+F + ++  ED+SE G                +F+FV REGN+AAH LAR A+      +WME  P     CLE
Subjt:  DMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQWSDEVWMETCP----RCLE

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]6.7e-3935.38Show/hide
Query:  QSGYKLA-QSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRLCLNR----SILREAG-------------------------FGEIL----ERV
        +SGYK+A  +    +  SSSS   ++ WW G WKM +P+KIKVFLWRLCL+R      L + G                         F E L    +  
Subjt:  QSGYKLA-QSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRLCLNR----SILREAG-------------------------FGEIL----ERV

Query:  RAGCCLLLCQDIKEAIGGEFEELVVLWWSMWSTWNKVRFQGADRP-----SGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDA
        +    L+L +  +     +FEEL V+ W +W+  N   F  + +        LVEWA  Y + FRE  +S+   GR T   E ++W+ P  G YK+N DA
Subjt:  RAGCCLLLCQDIKEAIGGEFEELVVLWWSMWSTWNKVRFQGADRP-----SGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDA

Query:  SFLSHLSRVDLGIIVRDPLGQVMLSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELGTFHFVHREGNQAAHRLA
        SFL+      LGII+ +  GQVM +AT   +N++ VDMAE  AAV+ L+L +++G+ PA+ +   +    +  +     S   +F+FV REGN+AAH LA
Subjt:  SFLSHLSRVDLGIIVRDPLGQVMLSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELGTFHFVHREGNQAAHRLA

Query:  RLAIAQWSDEVWMETCP----RCLE
        R A+      +WME  P     CLE
Subjt:  RLAIAQWSDEVWMETCP----RCLE

TrEMBL top hitse value%identityAlignment
A0A6J1CIF1 uncharacterized protein LOC1110112373.6e-3044.97Show/hide
Query:  LVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVMLSATFTKDNVREVDMAEGYAAVKSLELVT
        LVEWA  YV+ FRE   S+   GR T   E V+W  P    YK+N DASFL+      LGII+R+  GQVM SAT   +N++ VDMAE   AV+ L+L +
Subjt:  LVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVMLSATFTKDNVREVDMAEGYAAVKSLELVT

Query:  DMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQWSDEVWMETCP----RCLE
         +G+ P ILETDSSR+F + ++  ED+SE G                +F+FV REGN+AAH LAR A+      +WME  P     CLE
Subjt:  DMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQWSDEVWMETCP----RCLE

A0A6J1DAR4 uncharacterized protein LOC1110189543.3e-3935.38Show/hide
Query:  QSGYKLA-QSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRLCLNR----SILREAG-------------------------FGEIL----ERV
        +SGYK+A  +    +  SSSS   ++ WW G WKM +P+KIKVFLWRLCL+R      L + G                         F E L    +  
Subjt:  QSGYKLA-QSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRLCLNR----SILREAG-------------------------FGEIL----ERV

Query:  RAGCCLLLCQDIKEAIGGEFEELVVLWWSMWSTWNKVRFQGADRP-----SGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDA
        +    L+L +  +     +FEEL V+ W +W+  N   F  + +        LVEWA  Y + FRE  +S+   GR T   E ++W+ P  G YK+N DA
Subjt:  RAGCCLLLCQDIKEAIGGEFEELVVLWWSMWSTWNKVRFQGADRP-----SGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDA

Query:  SFLSHLSRVDLGIIVRDPLGQVMLSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELGTFHFVHREGNQAAHRLA
        SFL+      LGII+ +  GQVM +AT   +N++ VDMAE  AAV+ L+L +++G+ PA+ +   +    +  +     S   +F+FV REGN+AAH LA
Subjt:  SFLSHLSRVDLGIIVRDPLGQVMLSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELGTFHFVHREGNQAAHRLA

Query:  RLAIAQWSDEVWMETCP----RCLE
        R A+      +WME  P     CLE
Subjt:  RLAIAQWSDEVWMETCP----RCLE

A0A7J6F0S9 Uncharacterized protein1.0e-2426.23Show/hide
Query:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE
        ++GY +A+        SSS+  +L  WWK  W++ +P KI+ F++RL               C++  I  +       E V+    L  CQ++K+A   E
Subjt:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE

Query:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM
        F   + + W  W+  N   F+    RP  + + A+ Y+  ++          + T  R+ +VW  P  G+ K+N DA+  SH +R   G +VRD  G+++
Subjt:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM

Query:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW
         +  F +        AEG+A +++L+   D G+    +E D   +   L    E+++  G                T H   R+GN AAH LA+++I   
Subjt:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW

Query:  SDEVW
         D  W
Subjt:  SDEVW

A0A7J6GT92 Uncharacterized protein6.0e-2525.71Show/hide
Query:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE
        ++GY +A+        SSS+  +L  WWK  W++ +P KI+ F++RL               C++  I          E V+    L  CQ++K+A   E
Subjt:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE

Query:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM
        F   + + W  W+  N   F+    RP  + + A+ Y+  ++          + T  R+ +VW  P  G+ K+N DA+  SH +R   G +VRD  G+++
Subjt:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM

Query:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW
         +  F +        AEG+A +++L+   D G+    +E D   +   L    E+++  G                T H   R+GN AAH LA+++I   
Subjt:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW

Query:  SDEVWMETCPRCLED
         D  W +   +   D
Subjt:  SDEVWMETCPRCLED

A0A7J6HKE1 Uncharacterized protein6.0e-2525.71Show/hide
Query:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE
        ++GY +A+        SSS+  +L  WWK  W++ +P KI+ F++RL               C++  I          E V+    L  CQ++K+A   E
Subjt:  QSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRL---------------CLNRSILREAGFGEILERVRAGCCLLLCQDIKEAIGGE

Query:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM
        F   + + W  W+  N   F+    RP  + + A+ Y+  ++          + T  R+ +VW  P  G+ K+N DA+  SH +R   G +VRD  G+++
Subjt:  FEELVVLWWSMWSTWNKVRFQG-ADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVM

Query:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW
         +  F +        AEG+A +++L+   D G+    +E D   +   L    E+++  G                T H   R+GN AAH LA+++I   
Subjt:  LSATFTKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELG----------------TFHFVHREGNQAAHRLARLAIAQW

Query:  SDEVWMETCPRCLED
         D  W +   +   D
Subjt:  SDEVWMETCPRCLED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-0725Show/hide
Query:  LWWSMWSTWNKVRFQGA--DRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVMLSATF
        L W +W + N++ F+G   D P  L    + +           +  G +      V W+AP   W K N DA++     R  +G I+R+  G V+     
Subjt:  LWWSMWSTWNKVRFQGA--DRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVMLSATF

Query:  TKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNRE---------REDISEL------GTFHFVHREGNQAAHRLARLAIA
             + V  AE  A   ++  ++       I E+D+  +  +LN +          EDI +L        F F  R GN+ A R+AR +I+
Subjt:  TKDNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNRE---------REDISEL------GTFHFVHREGNQAAHRLARLAIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGCTATCAATGCCACGGTGGCAGGACTGATGACGGCCTCGAGGGGTGGGACCAAACCCTGTTGGAACAACACTTTAATGCCGCAGAGTGGATACAAGTTGGCTCA
GTCAGTCACCCTGGCACGAGGGGCTTCATCTTCATCCCCTAACTCTCTCCAGGATTGGTGGAAGGGATGTTGGAAGATGACGCTCCCAAGTAAGATAAAGGTGTTCCTTT
GGAGATTGTGTCTGAACAGATCGATCCTACGAGAAGCGGGTTTTGGGGAGATCTTGGAGAGAGTCCGAGCAGGATGTTGTCTACTACTCTGTCAGGACATCAAAGAGGCG
ATAGGGGGAGAATTTGAGGAACTGGTGGTGTTATGGTGGTCGATGTGGTCGACCTGGAATAAAGTTCGTTTTCAAGGGGCTGATAGGCCGAGTGGGCTAGTGGAATGGGC
TAAGGGGTATGTGGTGGCTTTTCGTGAGGTTGGGAGGAGTAGTAGGGAAGTGGGTCGGGAGACGGTTGGTCGGGAGAGGGTAGTGTGGCGGGCCCCGAGGAATGGCTGGT
ATAAGGTGAATTTTGATGCCTCTTTCCTCTCTCATTTGTCCAGGGTCGACCTAGGGATAATTGTGCGGGATCCTTTGGGCCAAGTAATGTTGTCGGCGACTTTTACTAAG
GATAATGTGAGAGAGGTTGACATGGCTGAAGGATATGCAGCCGTGAAAAGTTTGGAGTTGGTGACAGATATGGGTTTGGGTCCAGCAATTCTTGAGACGGACTCGAGTAG
AGTATTCCAGATCCTTAACCGAGAACGTGAGGATATCTCTGAGCTAGGTACTTTTCATTTCGTTCACCGTGAAGGGAATCAGGCGGCGCACCGATTGGCGAGGTTAGCAA
TAGCCCAGTGGAGTGATGAGGTTTGGATGGAGACTTGTCCTAGGTGTCTGGAGGACATTGAAGGCCTAGACCTGCTTTATGGGGTTCATAAGTCTGTTGTCACTCCGATA
GGGGTAAGGGGGAAGATAGGTGGAGGATCCCATAAATCTGTATCTGGTCCATTAGAGGCTTCTTTAGGTGGTGAGCCTAAGGCCACAAACCTACATGGAACAATTCATAT
TTATCTAGGACGCAAGAGGATGGAGGTTTGGATAGCTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCGCTATCAATGCCACGGTGGCAGGACTGATGACGGCCTCGAGGGGTGGGACCAAACCCTGTTGGAACAACACTTTAATGCCGCAGAGTGGATACAAGTTGGCTCA
GTCAGTCACCCTGGCACGAGGGGCTTCATCTTCATCCCCTAACTCTCTCCAGGATTGGTGGAAGGGATGTTGGAAGATGACGCTCCCAAGTAAGATAAAGGTGTTCCTTT
GGAGATTGTGTCTGAACAGATCGATCCTACGAGAAGCGGGTTTTGGGGAGATCTTGGAGAGAGTCCGAGCAGGATGTTGTCTACTACTCTGTCAGGACATCAAAGAGGCG
ATAGGGGGAGAATTTGAGGAACTGGTGGTGTTATGGTGGTCGATGTGGTCGACCTGGAATAAAGTTCGTTTTCAAGGGGCTGATAGGCCGAGTGGGCTAGTGGAATGGGC
TAAGGGGTATGTGGTGGCTTTTCGTGAGGTTGGGAGGAGTAGTAGGGAAGTGGGTCGGGAGACGGTTGGTCGGGAGAGGGTAGTGTGGCGGGCCCCGAGGAATGGCTGGT
ATAAGGTGAATTTTGATGCCTCTTTCCTCTCTCATTTGTCCAGGGTCGACCTAGGGATAATTGTGCGGGATCCTTTGGGCCAAGTAATGTTGTCGGCGACTTTTACTAAG
GATAATGTGAGAGAGGTTGACATGGCTGAAGGATATGCAGCCGTGAAAAGTTTGGAGTTGGTGACAGATATGGGTTTGGGTCCAGCAATTCTTGAGACGGACTCGAGTAG
AGTATTCCAGATCCTTAACCGAGAACGTGAGGATATCTCTGAGCTAGGTACTTTTCATTTCGTTCACCGTGAAGGGAATCAGGCGGCGCACCGATTGGCGAGGTTAGCAA
TAGCCCAGTGGAGTGATGAGGTTTGGATGGAGACTTGTCCTAGGTGTCTGGAGGACATTGAAGGCCTAGACCTGCTTTATGGGGTTCATAAGTCTGTTGTCACTCCGATA
GGGGTAAGGGGGAAGATAGGTGGAGGATCCCATAAATCTGTATCTGGTCCATTAGAGGCTTCTTTAGGTGGTGAGCCTAAGGCCACAAACCTACATGGAACAATTCATAT
TTATCTAGGACGCAAGAGGATGGAGGTTTGGATAGCTAGCTGA
Protein sequenceShow/hide protein sequence
MLAINATVAGLMTASRGGTKPCWNNTLMPQSGYKLAQSVTLARGASSSSPNSLQDWWKGCWKMTLPSKIKVFLWRLCLNRSILREAGFGEILERVRAGCCLLLCQDIKEA
IGGEFEELVVLWWSMWSTWNKVRFQGADRPSGLVEWAKGYVVAFREVGRSSREVGRETVGRERVVWRAPRNGWYKVNFDASFLSHLSRVDLGIIVRDPLGQVMLSATFTK
DNVREVDMAEGYAAVKSLELVTDMGLGPAILETDSSRVFQILNREREDISELGTFHFVHREGNQAAHRLARLAIAQWSDEVWMETCPRCLEDIEGLDLLYGVHKSVVTPI
GVRGKIGGGSHKSVSGPLEASLGGEPKATNLHGTIHIYLGRKRMEVWIAS