; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015862 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015862
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr12:27378620..27385806
RNA-Seq ExpressionLag0015862
SyntenyLag0015862
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]3.2e-1027.03Show/hide
Query:  MIENLKEKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSA----------MDRLRLKTQLSQVPQTHLPPDSWRLNTDASR
        ++  L ++E+  +++I W+IW  RN  +    + D+  + RS+   I+  +     +S           + R R    +  V  +  P + W+LNTDAS 
Subjt:  MIENLKEKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSA----------MDRLRLKTQLSQVPQTHLPPDSWRLNTDASR

Query:  NNQISVGGVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLS
        + +  VGG+GW   D  G  +  G  +I+ K  I  LE+  I+ GL+ +     S       PI + SD++ VI  + K + DL+
Subjt:  NNQISVGGVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.0e-1229.03Show/hide
Query:  EKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKI---------SEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISVG
        E+E   +++I W+IW  RN  +     P+   I  ++D  I          +G +    +  + R+   T     P T    +SW+LNT+A+     + G
Subjt:  EKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKI---------SEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISVG

Query:  GVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFV
        G+GW   D  G  I      I+ +  I  LE+ AI EGLR++   +  C      PI + SD++  I+ L++   D ++I +L+ EI +++ ++E +S  
Subjt:  GVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFV

Query:  HCPRSLNVEAHNLARHA
        H  R  N  AH LAR A
Subjt:  HCPRSLNVEAHNLARHA

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]6.1e-1428.37Show/hide
Query:  MIENLKEKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISVGGVG
        M++   +++L+  ++  W IWNHRN V+        + +I+ +   ++E         +M    L  +L   P    P   W LN DAS ++    GG+G
Subjt:  MIENLKEKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISVGGVG

Query:  WTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHCP
        W      G  +  G   ++    +K+LE  AILEGLR+L  +        + P+ + +D+  V + LN+   DL+K  ++V EI  L    E ++F    
Subjt:  WTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHCP

Query:  RSLNVEAHNLARHAA
        R  N  AH+LA+ A+
Subjt:  RSLNVEAHNLARHAA

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]9.8e-1230.14Show/hide
Query:  EKELEDAILIMWEIWNHRNNVL----HNATSPDQNLIIRSV------DTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISV
        E+E   +++I W+IW  RN  +    H+ T   Q +I R +      DT + +G +    +  + R+   T     P T    +SW+LNTDA+     + 
Subjt:  EKELEDAILIMWEIWNHRNNVL----HNATSPDQNLIIRSV------DTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISV

Query:  GGVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLE-IPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERIS
        GG+GW   D  G  I      I+ +  I  LE+ AI EGLR++          E   PI + SD++  I+ L++   D ++I +L+ EI +++ +++ +S
Subjt:  GGVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLE-IPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERIS

Query:  FVHCPRSLNVEAHNLARHA
          H  R  N  AH+LAR A
Subjt:  FVHCPRSLNVEAHNLARHA

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]7.0e-1030.2Show/hide
Query:  ILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPP-DSWRLNTDASRNNQISVGGVGWTCHDSSGSSIC
        I I W +W  RN  ++   S   ++   SV+  +S    Y   V   D    K  +++V + H PP D  +LN D +   + SV G+G    D  G  I 
Subjt:  ILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPP-DSWRLNTDASRNNQISVGGVGWTCHDSSGSSIC

Query:  EGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHCPRSLNVEAHNLAR
              K     + +E  A+L GL+        CA   +P I++ +D + ++N LN++   L+ I+F++ +I RL+V  + +  VH  R  N+ AH LAR
Subjt:  EGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHCPRSLNVEAHNLAR

Query:  HA
        HA
Subjt:  HA

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134129.6e-1329.03Show/hide
Query:  EKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKI---------SEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISVG
        E+E   +++I W+IW  RN  +     P+   I  ++D  I          +G +    +  + R+   T     P T    +SW+LNT+A+     + G
Subjt:  EKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKI---------SEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISVG

Query:  GVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFV
        G+GW   D  G  I      I+ +  I  LE+ AI EGLR++   +  C      PI + SD++  I+ L++   D ++I +L+ EI +++ ++E +S  
Subjt:  GVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFV

Query:  HCPRSLNVEAHNLARHA
        H  R  N  AH LAR A
Subjt:  HCPRSLNVEAHNLARHA

A0A6J1DNV9 uncharacterized protein LOC1110224033.0e-1428.37Show/hide
Query:  MIENLKEKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISVGGVG
        M++   +++L+  ++  W IWNHRN V+        + +I+ +   ++E         +M    L  +L   P    P   W LN DAS ++    GG+G
Subjt:  MIENLKEKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISVGGVG

Query:  WTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHCP
        W      G  +  G   ++    +K+LE  AILEGLR+L  +        + P+ + +D+  V + LN+   DL+K  ++V EI  L    E ++F    
Subjt:  WTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHCP

Query:  RSLNVEAHNLARHAA
        R  N  AH+LA+ A+
Subjt:  RSLNVEAHNLARHAA

A0A6J1DSV1 uncharacterized protein LOC1110236084.8e-1230.14Show/hide
Query:  EKELEDAILIMWEIWNHRNNVL----HNATSPDQNLIIRSV------DTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISV
        E+E   +++I W+IW  RN  +    H+ T   Q +I R +      DT + +G +    +  + R+   T     P T    +SW+LNTDA+     + 
Subjt:  EKELEDAILIMWEIWNHRNNVL----HNATSPDQNLIIRSV------DTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISV

Query:  GGVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLE-IPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERIS
        GG+GW   D  G  I      I+ +  I  LE+ AI EGLR++          E   PI + SD++  I+ L++   D ++I +L+ EI +++ +++ +S
Subjt:  GGVGWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLE-IPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERIS

Query:  FVHCPRSLNVEAHNLARHA
          H  R  N  AH+LAR A
Subjt:  FVHCPRSLNVEAHNLARHA

A0A6J5UE59 Reverse transcriptase domain-containing protein7.6e-1028.36Show/hide
Query:  MWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPD--SWRLNTDASRNNQISVGGVGWTCHDSSGSSICEG
        +W IW  RN  + +    D   ++  +  ++SE     + +  +    L+   S  P +   P     ++N DA+ + Q   GGVGW   DS G  +C G
Subjt:  MWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPD--SWRLNTDASRNNQISVGGVGWTCHDSSGSSICEG

Query:  FTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHCPRSLNVEAHNLARHA
                   M+E+ AI   L       + CAN  +  I+V SD+   I  LN      S +  +V +I +L+  + R+SFV  PRS N  AH++A  A
Subjt:  FTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHCPRSLNVEAHNLARHA

Query:  A
        +
Subjt:  A

A0A803PE40 Uncharacterized protein3.4e-1028.37Show/hide
Query:  LKEKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHL---PPDSW--RLNTDASRNNQISVGGV
        L ++ELE   +IMW IW  RN + H  +  D   +       +       N  +A+       Q+S +P   +   PPD +  +LN DA+ N    V GV
Subjt:  LKEKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHL---PPDSW--RLNTDASRNNQISVGGV

Query:  GWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHC
        G    +  G  +      ++G +    +E KA+   L  +       + L +   +V +DA+ V N LN    DLS  S ++ ++  L+    RI   H 
Subjt:  GWTCHDSSGSSICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHC

Query:  PRSLNVEAHNLARHA
         RS N  AH LA+HA
Subjt:  PRSLNVEAHNLARHA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGAGAATCTTAAAGAGAAGGAGCTAGAGGATGCTATCCTAATCATGTGGGAGATATGGAATCACAGAAATAATGTTTTACACAACGCAACGAGCCCAGACCAGAA
TCTCATCATCAGATCAGTGGACACAAAAATTTCGGAGGGGGTAGCTTACCTCAATTTCGTCTCTGCCATGGATCGCCTAAGATTGAAGACCCAGCTGAGTCAAGTCCCTC
AGACTCATCTGCCTCCTGATTCATGGAGGCTCAACACTGACGCCTCCCGTAACAATCAAATTAGCGTTGGGGGAGTGGGATGGACCTGCCATGACTCCTCAGGTTCTTCC
ATCTGCGAGGGATTCACAAGAATCAAGGGCAAATGGCCTATCAAGATGTTGGAGATGAAAGCGATTCTCGAAGGTTTGCGTAGCTTACCAACCATTCGGGCTTCGTGCGC
TAATCTAGAAATCCCGCCAATAGTGGTGTTGTCTGATGCCATTGGCGTCATCAATCCGCTGAACAAATCTGAAGCAGACTTATCGAAAATCTCATTTCTTGTCGTCGAGA
TTGATCGCCTGATAGTTGAGGTCGAGAGAATCTCTTTCGTTCATTGCCCGCGTTCGCTGAATGTGGAAGCTCATAATCTCGCGCGCCATGCTGCCTTCAGTCTTTTCGAG
GGCTTTAGTTGTTTTTTGGATACTTCTTCCAATTCGGAAGAAGGGGAAAGACGTGCCCTGCAAAACAAGAGAAAACTATGCACCGGTGTGGTGCTTGCCACACCGACTCC
GATGCTTAAGTCAGTTCGGAGAAGCAACTACATGAGTTCTCTTGAGTTCAAATATTCCAAGGCTAGCCTTAGATGCTTTATCATGAGGGTCTCCTTTTCATTTGGTTATG
GTTGGCTTCAAACTTCTGATGATCAGCTTAAGCCCTTGGAGGGGTGCTTGCACTCAGTTGGGGACTTGGGGAGGTTGAGGGCATGTGTCAACCTCTCTTCAAGTGTCCAT
GTCCATTCTAGGGGTGCGTACACTCTTCTGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGGGCTTCTCTTCAAGCATCGTGGTCATCCCAGGGGTGCGTACACTCTTC
TGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGGAAGTTCCCGGTCCTCTCTTCAAGCATCGTGGTCATCCTAGGGGTGCGTACACTCTTCTGCGGAAAAGCTTGGGGA
GGCCTAGGAAGTTCCCGGTCCTCTCTTCAAGCATCGTGGTCATCCCAAGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGAGCCTCTCTT
CAAGCATCGTGGTCATCCCAGGGGGGTGCGTACACTCTTCTAGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGGGCCTCTCTTCAAGCATCGTGGTCATCCCAGGGGT
GCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCTGGTCCTCTCTTCAAGCATCGTGGTCATCCCAGGGGGGTGCGTACACTCTTCTGGGGAAAAGCT
TGGAATCTCATCCCGAGGGTGCGTACACTCAGTCGAGGTCTTGGAATCTCATCTCGAGGGTGCGTACACTCAGTCGAGGTCTTGGAATTTCATCTCGATGCGTTTTCATG
TCGCATAACTCTCCCTTATCGGCATTTGTGGGTGTCGACCTCCTATGGGTCTGTCTCAGCTCCTTGCATGTCGACTCTGTCATTGTCACCGAGACCTTCAACATCATCAT
TCTCGAGCGTTTGTCGACTCTGCTAAGAAAGCTCATCCATTCTTTTACCTTCTGCTTGATGATTCCGTTCTCCAGAGGACTTGGAGCCAAATTAAAGTTCTGGAAACAGG
CCCAATTCCAAAGATTTGAGTTGATATGCTGTTTTGGGCGGTTGGGGAAAGACGTGCCCTGCAAAACAAGAGAAAACTATGCACCGGTGTGGTGCTTGCCACACCGACTC
CGATGCTTAAGTCAGTTCGGAGAAGCAAGAGAGAGAATAGCCTTAGAGATTGATTCAGGCCTAATGGGCCTTCGTGAGCTTGTTTCTGCTCTGGGCTCTGGTCATTCGTA
G
mRNA sequenceShow/hide mRNA sequence
ATGATCGAGAATCTTAAAGAGAAGGAGCTAGAGGATGCTATCCTAATCATGTGGGAGATATGGAATCACAGAAATAATGTTTTACACAACGCAACGAGCCCAGACCAGAA
TCTCATCATCAGATCAGTGGACACAAAAATTTCGGAGGGGGTAGCTTACCTCAATTTCGTCTCTGCCATGGATCGCCTAAGATTGAAGACCCAGCTGAGTCAAGTCCCTC
AGACTCATCTGCCTCCTGATTCATGGAGGCTCAACACTGACGCCTCCCGTAACAATCAAATTAGCGTTGGGGGAGTGGGATGGACCTGCCATGACTCCTCAGGTTCTTCC
ATCTGCGAGGGATTCACAAGAATCAAGGGCAAATGGCCTATCAAGATGTTGGAGATGAAAGCGATTCTCGAAGGTTTGCGTAGCTTACCAACCATTCGGGCTTCGTGCGC
TAATCTAGAAATCCCGCCAATAGTGGTGTTGTCTGATGCCATTGGCGTCATCAATCCGCTGAACAAATCTGAAGCAGACTTATCGAAAATCTCATTTCTTGTCGTCGAGA
TTGATCGCCTGATAGTTGAGGTCGAGAGAATCTCTTTCGTTCATTGCCCGCGTTCGCTGAATGTGGAAGCTCATAATCTCGCGCGCCATGCTGCCTTCAGTCTTTTCGAG
GGCTTTAGTTGTTTTTTGGATACTTCTTCCAATTCGGAAGAAGGGGAAAGACGTGCCCTGCAAAACAAGAGAAAACTATGCACCGGTGTGGTGCTTGCCACACCGACTCC
GATGCTTAAGTCAGTTCGGAGAAGCAACTACATGAGTTCTCTTGAGTTCAAATATTCCAAGGCTAGCCTTAGATGCTTTATCATGAGGGTCTCCTTTTCATTTGGTTATG
GTTGGCTTCAAACTTCTGATGATCAGCTTAAGCCCTTGGAGGGGTGCTTGCACTCAGTTGGGGACTTGGGGAGGTTGAGGGCATGTGTCAACCTCTCTTCAAGTGTCCAT
GTCCATTCTAGGGGTGCGTACACTCTTCTGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGGGCTTCTCTTCAAGCATCGTGGTCATCCCAGGGGTGCGTACACTCTTC
TGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGGAAGTTCCCGGTCCTCTCTTCAAGCATCGTGGTCATCCTAGGGGTGCGTACACTCTTCTGCGGAAAAGCTTGGGGA
GGCCTAGGAAGTTCCCGGTCCTCTCTTCAAGCATCGTGGTCATCCCAAGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGAGCCTCTCTT
CAAGCATCGTGGTCATCCCAGGGGGGTGCGTACACTCTTCTAGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCCGGGCCTCTCTTCAAGCATCGTGGTCATCCCAGGGGT
GCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCTAGGAAGTTCCTGGTCCTCTCTTCAAGCATCGTGGTCATCCCAGGGGGGTGCGTACACTCTTCTGGGGAAAAGCT
TGGAATCTCATCCCGAGGGTGCGTACACTCAGTCGAGGTCTTGGAATCTCATCTCGAGGGTGCGTACACTCAGTCGAGGTCTTGGAATTTCATCTCGATGCGTTTTCATG
TCGCATAACTCTCCCTTATCGGCATTTGTGGGTGTCGACCTCCTATGGGTCTGTCTCAGCTCCTTGCATGTCGACTCTGTCATTGTCACCGAGACCTTCAACATCATCAT
TCTCGAGCGTTTGTCGACTCTGCTAAGAAAGCTCATCCATTCTTTTACCTTCTGCTTGATGATTCCGTTCTCCAGAGGACTTGGAGCCAAATTAAAGTTCTGGAAACAGG
CCCAATTCCAAAGATTTGAGTTGATATGCTGTTTTGGGCGGTTGGGGAAAGACGTGCCCTGCAAAACAAGAGAAAACTATGCACCGGTGTGGTGCTTGCCACACCGACTC
CGATGCTTAAGTCAGTTCGGAGAAGCAAGAGAGAGAATAGCCTTAGAGATTGATTCAGGCCTAATGGGCCTTCGTGAGCTTGTTTCTGCTCTGGGCTCTGGTCATTCGTA
G
Protein sequenceShow/hide protein sequence
MIENLKEKELEDAILIMWEIWNHRNNVLHNATSPDQNLIIRSVDTKISEGVAYLNFVSAMDRLRLKTQLSQVPQTHLPPDSWRLNTDASRNNQISVGGVGWTCHDSSGSS
ICEGFTRIKGKWPIKMLEMKAILEGLRSLPTIRASCANLEIPPIVVLSDAIGVINPLNKSEADLSKISFLVVEIDRLIVEVERISFVHCPRSLNVEAHNLARHAAFSLFE
GFSCFLDTSSNSEEGERRALQNKRKLCTGVVLATPTPMLKSVRRSNYMSSLEFKYSKASLRCFIMRVSFSFGYGWLQTSDDQLKPLEGCLHSVGDLGRLRACVNLSSSVH
VHSRGAYTLLGKAWGGLGSSRASLQASWSSQGCVHSSGKSLGRPRKFPEVPGPLFKHRGHPRGAYTLLRKSLGRPRKFPVLSSSIVVIPRVRTLFWGKAWGGLGSSRASL
QASWSSQGGAYTLLGKSLGRPRKFPGLSSSIVVIPGVRTLFWGKAWGGLGSSWSSLQASWSSQGGAYTLLGKSLESHPEGAYTQSRSWNLISRVRTLSRGLGISSRCVFM
SHNSPLSAFVGVDLLWVCLSSLHVDSVIVTETFNIIILERLSTLLRKLIHSFTFCLMIPFSRGLGAKLKFWKQAQFQRFELICCFGRLGKDVPCKTRENYAPVWCLPHRL
RCLSQFGEARERIALEIDSGLMGLRELVSALGSGHS