; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021640 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021640
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationscaffold2:16224703..16230714
RNA-Seq ExpressionSpg021640
SyntenySpg021640
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF8665205.1 hypothetical protein HU200_054101 [Digitaria exilis]1.7e-1927.14Show/hide
Query:  AAVIC--WALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNK
        A ++C  W LWT RNK  HGE   P+     W ++      H     +        +Q  API    W PPP G+ K NTDAA  +   +  +G + R++
Subjt:  AAVIC--WALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNK

Query:  DGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAKN
         G  + A++ ++ Y L  L AE  A   G+++A++ G   +I+E+DC++ ++   ++    +++   +  + ++  S +D +  F+PR  N  A   A+ 
Subjt:  DGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAKN

Query:  ARFVKCSETW
        A   +  + W
Subjt:  ARFVKCSETW

XP_022131661.1 uncharacterized protein LOC111004786 [Momordica charantia]1.6e-2242.11Show/hide
Query:  AAVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDG
        AA   WA+W DRN   HG  +     RC WI  Y  ++  A  + R S      QQ+  P    RW+PP D   K+N+DAAC     STGLG+I R+  G
Subjt:  AAVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDG

Query:  GTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFI
          LVA S+FL   L  L AE+  ILE +++A S     L+VESDCQ AI  +
Subjt:  GTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFI

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.8e-2233.63Show/hide
Query:  ENLGKAAVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVIC
        ++L  AA+  W +W DRN   HG+++ P+  +C+W    L  FL + + ++ S+ S   Q N  P+ +  W P      K+NTDAAC     ST  G I 
Subjt:  ENLGKAAVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVIC

Query:  RNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLL
        R+     + A SI + + L  LLAE+  ILEG++ A +    +L VESD  LAI  I  +     D +  +  I  L   F  ISF+   R+ N+ A  L
Subjt:  RNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLL

Query:  AK-NARFVKCSETWTSNFPSLAL
        AK        +  W  NFP+  L
Subjt:  AK-NARFVKCSETWTSNFPSLAL

XP_024033146.1 uncharacterized protein LOC112095446 [Citrus clementina]7.5e-2032.58Show/hide
Query:  AVICWALWTDRNKAT-HGEKIHPIPNRCK---WIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRN
        AV+ W +W  RNK    G++++P+    K    +E Y         S RS   S           +D+W PPP G++K+N DAA       TGLGV+ RN
Subjt:  AVICWALWTDRNKAT-HGEKIHPIPNRCK---WIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRN

Query:  KDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAK
          G  +VAA     ++  V  AE  A+  G+ IAL      +I+E+DC    N    K+S+  ++  T+S I      F  +S   +PR  N  A  LAK
Subjt:  KDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAK

Query:  NARFVKCSETWTSNFPSLALY
         A     + TW  NFP   L+
Subjt:  NARFVKCSETWTSNFPSLALY

XP_024046732.1 uncharacterized protein LOC112101057 [Citrus clementina]1.7e-1931.98Show/hide
Query:  AVICWALWTDRNKAT-HGEKIHPIPNRCK---WIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRN
        AV+ W +W  RNK    G++++P+    K    +E Y         S RS   S           +D+W PPP G++K+N DAA       TGLGV+ RN
Subjt:  AVICWALWTDRNKAT-HGEKIHPIPNRCK---WIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRN

Query:  KDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAK
          G  +V A     ++  V  AE  A+  G+ IAL      +I+E+DC    N    K+S+  ++  T+S I      F  +S   +PR  N  A  LAK
Subjt:  KDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAK

Query:  NARFVKCSETWTSNFPSLALYW
         A     + TW  NFP   L++
Subjt:  NARFVKCSETWTSNFPSLALYW

TrEMBL top hitse value%identityAlignment
A0A6J1BQ49 uncharacterized protein LOC1110047867.8e-2342.11Show/hide
Query:  AAVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDG
        AA   WA+W DRN   HG  +     RC WI  Y  ++  A  + R S      QQ+  P    RW+PP D   K+N+DAAC     STGLG+I R+  G
Subjt:  AAVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDG

Query:  GTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFI
          LVA S+FL   L  L AE+  ILE +++A S     L+VESDCQ AI  +
Subjt:  GTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFI

A0A6J1DX30 uncharacterized protein LOC1110248741.3e-2233.63Show/hide
Query:  ENLGKAAVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVIC
        ++L  AA+  W +W DRN   HG+++ P+  +C+W    L  FL + + ++ S+ S   Q N  P+ +  W P      K+NTDAAC     ST  G I 
Subjt:  ENLGKAAVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVIC

Query:  RNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLL
        R+     + A SI + + L  LLAE+  ILEG++ A +    +L VESD  LAI  I  +     D +  +  I  L   F  ISF+   R+ N+ A  L
Subjt:  RNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLL

Query:  AK-NARFVKCSETWTSNFPSLAL
        AK        +  W  NFP+  L
Subjt:  AK-NARFVKCSETWTSNFPSLAL

A0A6P5RZ31 uncharacterized protein LOC1107515791.8e-1930.94Show/hide
Query:  NLGKAAVICWALWTDRNKATHGEK------IHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTG
        NL    V+ W +W D+N   HG+K      ++ I +R   +E   A+ +H      SS+                W PPP G  K+N D ACS      G
Subjt:  NLGKAAVICWALWTDRNKATHGEK------IHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTG

Query:  LGVICRNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNK
        LG + RN  G  + A S+ +       + EL AI EG++     G ++L+VE+D + AIN I+  + A+      ++ I  L  +F  +SF F PR  N 
Subjt:  LGVICRNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNK

Query:  EADLLAKNARFVKCSETWTSNFP
         AD LAK A      + W    P
Subjt:  EADLLAKNARFVKCSETWTSNFP

A0A7N2M749 Uncharacterized protein6.9e-1930.36Show/hide
Query:  KENLGKAAVICWALWTDRNKATHGEKIHP----IPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTG
        +E L +   ICW +W +RN+  HG    P    + N  + ++++ A+ +   + SRS +  T            RW PPP GY+K+N D A     + +G
Subjt:  KENLGKAAVICWALWTDRNKATHGEKIHP----IPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTG

Query:  LGVICRNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNK
        +GVI R+ DG  + A    LD  L VL  E  A+  G+  A   G  +++ E D Q+ IN I     A + V   +  +     SF    F    R+ N 
Subjt:  LGVICRNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNK

Query:  EADLLAKNARFVKCSETWTSNFPS
         A LLA+ A+ V+    W    PS
Subjt:  EADLLAKNARFVKCSETWTSNFPS

A0A7N2MHN1 RNase H domain-containing protein2.6e-1830.3Show/hide
Query:  CHVTARILDSKENLGKA--AVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAAC
        C    R L   E+  K   AVI W+LW  RN    G + HP+   C      L  FL             +      PI + +W PP  G  KIN DAA 
Subjt:  CHVTARILDSKENLGKA--AVICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAAC

Query:  SEIPHSTGLGVICRNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFT
            +S G+GVI R+  G  + A S+ +     V   E+ A    +      G   +  E D  + IN + + +      E  M  I  L  SF+   FT
Subjt:  SEIPHSTGLGVICRNKDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFT

Query:  FIPRRLNKEADLLAKNARFVKCSETWTSNFP
        ++ R  N  AD LAK AR++   + WTS  P
Subjt:  FIPRRLNKEADLLAKNARFVKCSETWTSNFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.3e-1427.55Show/hide
Query:  ICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDGGTL
        + W LW  RN+     K +  P   +   E    +         +SG  + +         +W  PP  + K NTDA         G+G I RN+ GG L
Subjt:  ICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDGGTL

Query:  VAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAKNA
           +  L     VL AEL A+   +     F  + +I ESD Q  +N +L     W  ++  +  I  L+  F ++ F F PR  NK AD +A+ +
Subjt:  VAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAKNA

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.3e-0633.6Show/hide
Query:  INTDAACSEIPHSTGLGVICRN-KDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIP
        I TDAA        G G + RN  +   L   S   + RL  L+AE  A+   ++ A S G   L + SD Q  I  I  +S +  +    +  I +L  
Subjt:  INTDAACSEIPHSTGLGVICRN-KDGGTLVAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIP

Query:  SFVDISFTFIPRRLNKEADLLAKNA
         F D+SF+F+PR  N+ AD LAK++
Subjt:  SFVDISFTFIPRRLNKEADLLAKNA

AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-1327.55Show/hide
Query:  ICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDGGTL
        + W LW +RN+     +        +  E+ L  +      + + S  T  Q N++     RW PPP  + K NTDA  +      G+G + RN+ G   
Subjt:  ICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDGGTL

Query:  VAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAKNA
           +  L     VL AEL A+   +     F    +I ESD Q+ I  IL     W  ++ T+  +  L+  F ++ F FIPR  N  A+ +A+ +
Subjt:  VAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAKNA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.0e-1126.21Show/hide
Query:  ICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDGGTL
        + W +W   N               +        +L  + ++   +G+    +N  P    +W PP     K N DA+  E    +GLG I RN  G  +
Subjt:  ICWALWTDRNKATHGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDGGTL

Query:  VAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAKNARFVK
                 R+    AE S ++  ++ +  FG + +I E D Q     I  KSS    ++  + +I   IPSF  I F+F  R  N  AD LAK A  +K
Subjt:  VAASIFLDYRLEVLLAELSAILEGMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAKNARFVK

Query:  CSETWT
         +  W+
Subjt:  CSETWT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATACTGCACCATAAAGTGGGTATACTGCACCATAAATTGGTGATGAGTTTGAGGCATGGAGTTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAATTGGTGAT
TATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTCGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAA
ACTGGGAGCAGAACTGCCACGTAACAGCTCGAATTCTAGACTCAAAGGAGAATTTGGGAAAAGCAGCAGTGATTTGTTGGGCGCTATGGACTGATCGAAATAAAGCTACT
CATGGGGAGAAAATTCATCCTATTCCTAATCGCTGCAAATGGATCGAAGAGTACCTGGCTTCGTTTTTGCATGCATCTGCATCTTCCCGTTCATCTAGTGGATCCACACT
TAACCAGCAAAATCAAGCCCCGATCTTCGAGGACCGCTGGATCCCTCCTCCTGACGGTTACTGGAAGATCAATACAGATGCTGCGTGTTCTGAGATTCCTCATTCGACTG
GCCTTGGCGTCATTTGCAGAAATAAAGATGGTGGTACCTTGGTGGCTGCATCGATTTTCTTGGATTATCGCTTGGAGGTGTTGTTGGCGGAGCTGAGTGCTATTCTGGAG
GGCATGAGAATCGCCCTTAGTTTTGGTTGTGAGAATCTTATTGTGGAGTCAGACTGCCAGCTTGCTATTAACTTCATTCTTCGAAAATCATCTGCTTGGAATGATGTGGA
AGCTACTATGTCGTCGATTTGGGATCTGATTCCTTCTTTTGTTGATATTTCATTTACGTTTATCCCTAGGAGGCTTAATAAGGAAGCTGACCTTTTAGCTAAGAATGCCA
GATTTGTGAAGTGTTCTGAGACTTGGACTTCCAATTTTCCAAGTTTGGCTTTGTATTGGGCCTTGTGGGCCTTTTCTGTTGCCCTAGGTGCACCTTTACTGTTGCTTAGA
GATAAATGGTCGAATGTTCAGAAATTTACTGTATATGGAAAGTATGAGGTGAATAACTGGACGGCAGACGGAGATCCAAAATGGAAGGAACTTGCTTTGAAGGTATTTGA
GAATGAGGAGAGGTATGCATTATATTCAAATGTCCATACGAACAATGCATATGGAAAGTCAGCAATTACTACAACTGTGGCTTATGGATTCTTTATCGATCTCTTGATTG
AATCGATTGTTGTAGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTATACTGCACCATAAAGTGGGTATACTGCACCATAAATTGGTGATGAGTTTGAGGCATGGAGTTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAATTGGTGAT
TATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTCGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAA
ACTGGGAGCAGAACTGCCACGTAACAGCTCGAATTCTAGACTCAAAGGAGAATTTGGGAAAAGCAGCAGTGATTTGTTGGGCGCTATGGACTGATCGAAATAAAGCTACT
CATGGGGAGAAAATTCATCCTATTCCTAATCGCTGCAAATGGATCGAAGAGTACCTGGCTTCGTTTTTGCATGCATCTGCATCTTCCCGTTCATCTAGTGGATCCACACT
TAACCAGCAAAATCAAGCCCCGATCTTCGAGGACCGCTGGATCCCTCCTCCTGACGGTTACTGGAAGATCAATACAGATGCTGCGTGTTCTGAGATTCCTCATTCGACTG
GCCTTGGCGTCATTTGCAGAAATAAAGATGGTGGTACCTTGGTGGCTGCATCGATTTTCTTGGATTATCGCTTGGAGGTGTTGTTGGCGGAGCTGAGTGCTATTCTGGAG
GGCATGAGAATCGCCCTTAGTTTTGGTTGTGAGAATCTTATTGTGGAGTCAGACTGCCAGCTTGCTATTAACTTCATTCTTCGAAAATCATCTGCTTGGAATGATGTGGA
AGCTACTATGTCGTCGATTTGGGATCTGATTCCTTCTTTTGTTGATATTTCATTTACGTTTATCCCTAGGAGGCTTAATAAGGAAGCTGACCTTTTAGCTAAGAATGCCA
GATTTGTGAAGTGTTCTGAGACTTGGACTTCCAATTTTCCAAGTTTGGCTTTGTATTGGGCCTTGTGGGCCTTTTCTGTTGCCCTAGGTGCACCTTTACTGTTGCTTAGA
GATAAATGGTCGAATGTTCAGAAATTTACTGTATATGGAAAGTATGAGGTGAATAACTGGACGGCAGACGGAGATCCAAAATGGAAGGAACTTGCTTTGAAGGTATTTGA
GAATGAGGAGAGGTATGCATTATATTCAAATGTCCATACGAACAATGCATATGGAAAGTCAGCAATTACTACAACTGTGGCTTATGGATTCTTTATCGATCTCTTGATTG
AATCGATTGTTGTAGTATAG
Protein sequenceShow/hide protein sequence
MGILHHKVGILHHKLVMSLRHGVKLWQVLRIELKLVIICPCRKNYFAAAELGFAECSESVAGRLEGANSVLQQNWEQNCHVTARILDSKENLGKAAVICWALWTDRNKAT
HGEKIHPIPNRCKWIEEYLASFLHASASSRSSSGSTLNQQNQAPIFEDRWIPPPDGYWKINTDAACSEIPHSTGLGVICRNKDGGTLVAASIFLDYRLEVLLAELSAILE
GMRIALSFGCENLIVESDCQLAINFILRKSSAWNDVEATMSSIWDLIPSFVDISFTFIPRRLNKEADLLAKNARFVKCSETWTSNFPSLALYWALWAFSVALGAPLLLLR
DKWSNVQKFTVYGKYEVNNWTADGDPKWKELALKVFENEERYALYSNVHTNNAYGKSAITTTVAYGFFIDLLIESIVVV