; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003178 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003178
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold4:29976211..29980020
RNA-Seq ExpressionSpg003178
SyntenySpg003178
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143495.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia]4.7e-3747.66Show/hide
Query:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKE---SLNSSF
        +I TP E+LFEILL +GYVS+EY   +L  +G+D++LTC FHAGAKGH+LEQC  F   VQEL+DSK L V  +H ++  I+ VE++   E   +  SS 
Subjt:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKE---SLNSSF

Query:  KLKP--LTIYYREKTTTHD--PKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNK
         LKP  LTI+Y EK    +   K ITI VP PF+YKSSKAVPW YE KVT+  +  +PPLPVDNI+G GG+T        D  SL K            K
Subjt:  KLKP--LTIYYREKTTTHD--PKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNK

Query:  ADMIRVQKQEKERR
        A   + +K E++++
Subjt:  ADMIRVQKQEKERR

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]2.5e-3846.89Show/hide
Query:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK
        EI TP + LFEIL  +GY+S+E+   D+  E +D+NLTC +HAGA+GH LEQC  F ++VQEL+D K L VTQ+ H+E  ID VE +   ES ++++K K
Subjt:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK

Query:  PLTIYYREK--TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTI--NSETPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADMIR
        PLT+ YREK    ++  + ITIQVP PF+Y SSKAVPW YE KVT+   +++  LPVDNI+  GG+TR       +  SL K+T +     K  KA   +
Subjt:  PLTIYYREK--TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTI--NSETPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADMIR

Query:  VQKQEKERR
         +K E++++
Subjt:  VQKQEKERR

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]1.6e-1348Show/hide
Query:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMVADTTAPDTPIGNPQAGLPFPPSF---ASHVRTTAETSMPQHTTYNPLYDIPVGQYPFPSFKEGQIPQ
        + K EQEKT++DIEE+REK+DAI+ ALEKGK +A+T+ P      PQ    F PSF       R   E  M Q+TTYNPLYDIP GQ+P P    G  PQ
Subjt:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMVADTTAPDTPIGNPQAGLPFPPSF---ASHVRTTAETSMPQHTTYNPLYDIPVGQYPFPSFKEGQIPQ

Query:  IPMASQAGASYFKPEFSKIPFAVNN
         P    AG  + +PE    P  V N
Subjt:  IPMASQAGASYFKPEFSKIPFAVNN

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]1.2e-3742.34Show/hide
Query:  NASAQYSPFYGQNTRPQMNQNFQ------SRRQQQTITPEIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQ
        N   +  P   +N  P  +QN Q         + ++   +I TP  +LFEILL +GYVS+EY   +L  +G+D++LTC FHAGAKGHSLEQC  F  +VQ
Subjt:  NASAQYSPFYGQNTRPQMNQNFQ------SRRQQQTITPEIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQ

Query:  ELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLKPLTIYYREKTTTHD--PKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNIS
        EL+DSK L V  +H ++  I+ VE++   E  + + K K LTI+Y EK    +   K ITI VP PF+YKSSKAVPW Y+ KVT+  +  +PPLP+DNI+
Subjt:  ELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLKPLTIYYREKTTTHD--PKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNIS

Query:  GTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADMIRVQKQEKERR
        G GG+TR       D  SL K            KA   + +K E++++
Subjt:  GTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADMIRVQKQEKERR

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]6.1e-3750.87Show/hide
Query:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK
        +I TPKE+LFEILL +GYVS+EY   +L  + +D++LTC FHAGAKGHSLEQC  F  +VQEL+DSK L V  +H ++  I+ VE++   E  + + K K
Subjt:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK

Query:  PLTIYYREK--TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSR
         LTI+Y EK    +   K ITI VP PF+YKSSKAVPW Y+ KVT+  +  +PPLPVDNI+  G +  ++ ++
Subjt:  PLTIYYREK--TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSR

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]6.3e-1853.54Show/hide
Query:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMVADTTAPDTPIGNPQAGLPFPPSFASHVRTTAETSMPQHTTYNPLYDIPVGQYPFPSFKEGQIPQIP
        ++K EQEKT++DIEE+REK+D I   LEKGK  AD      PI  PQ   P+PP +   +R  AE  MPQ+TTYNPLYD+P+GQYP    K  Q  QIP
Subjt:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMVADTTAPDTPIGNPQAGLPFPPSFASHVRTTAETSMPQHTTYNPLYDIPVGQYPFPSFKEGQIPQIP

XP_022157796.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia]1.8e-3645.97Show/hide
Query:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK
        +I TP  +LFEILL +GY+S+EY       +G+D++LTC FH GAKGHSLEQC  F  +VQEL+DSK L    +H ++   + VE++L  E  + S K K
Subjt:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK

Query:  PLTIYYREK----TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADM
        PLTI+YREK    + +  P  IT  VP PF+YKSSKAVPW YE KVT+  +  +P LPVDNI+G GG+TR       D  SL K            KA  
Subjt:  PLTIYYREK----TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADM

Query:  IRVQKQEKERR
         + +K E++++
Subjt:  IRVQKQEKERR

TrEMBL top hitse value%identityAlignment
A0A6J1CNY7 Ribonuclease H2.3e-3747.66Show/hide
Query:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKE---SLNSSF
        +I TP E+LFEILL +GYVS+EY   +L  +G+D++LTC FHAGAKGH+LEQC  F   VQEL+DSK L V  +H ++  I+ VE++   E   +  SS 
Subjt:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKE---SLNSSF

Query:  KLKP--LTIYYREKTTTHD--PKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNK
         LKP  LTI+Y EK    +   K ITI VP PF+YKSSKAVPW YE KVT+  +  +PPLPVDNI+G GG+T        D  SL K            K
Subjt:  KLKP--LTIYYREKTTTHD--PKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNK

Query:  ADMIRVQKQEKERR
        A   + +K E++++
Subjt:  ADMIRVQKQEKERR

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222311.2e-3846.89Show/hide
Query:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK
        EI TP + LFEIL  +GY+S+E+   D+  E +D+NLTC +HAGA+GH LEQC  F ++VQEL+D K L VTQ+ H+E  ID VE +   ES ++++K K
Subjt:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK

Query:  PLTIYYREK--TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTI--NSETPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADMIR
        PLT+ YREK    ++  + ITIQVP PF+Y SSKAVPW YE KVT+   +++  LPVDNI+  GG+TR       +  SL K+T +     K  KA   +
Subjt:  PLTIYYREK--TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTI--NSETPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADMIR

Query:  VQKQEKERR
         +K E++++
Subjt:  VQKQEKERR

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222317.8e-1448Show/hide
Query:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMVADTTAPDTPIGNPQAGLPFPPSF---ASHVRTTAETSMPQHTTYNPLYDIPVGQYPFPSFKEGQIPQ
        + K EQEKT++DIEE+REK+DAI+ ALEKGK +A+T+ P      PQ    F PSF       R   E  M Q+TTYNPLYDIP GQ+P P    G  PQ
Subjt:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMVADTTAPDTPIGNPQAGLPFPPSF---ASHVRTTAETSMPQHTTYNPLYDIPVGQYPFPSFKEGQIPQ

Query:  IPMASQAGASYFKPEFSKIPFAVNN
         P    AG  + +PE    P  V N
Subjt:  IPMASQAGASYFKPEFSKIPFAVNN

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222315.9e-3842.34Show/hide
Query:  NASAQYSPFYGQNTRPQMNQNFQ------SRRQQQTITPEIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQ
        N   +  P   +N  P  +QN Q         + ++   +I TP  +LFEILL +GYVS+EY   +L  +G+D++LTC FHAGAKGHSLEQC  F  +VQ
Subjt:  NASAQYSPFYGQNTRPQMNQNFQ------SRRQQQTITPEIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQ

Query:  ELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLKPLTIYYREKTTTHD--PKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNIS
        EL+DSK L V  +H ++  I+ VE++   E  + + K K LTI+Y EK    +   K ITI VP PF+YKSSKAVPW Y+ KVT+  +  +PPLP+DNI+
Subjt:  ELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLKPLTIYYREKTTTHD--PKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNIS

Query:  GTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADMIRVQKQEKERR
        G GG+TR       D  SL K            KA   + +K E++++
Subjt:  GTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADMIRVQKQEKERR

A0A6J1DZ90 Ribonuclease H2.9e-3750.87Show/hide
Query:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK
        +I TPKE+LFEILL +GYVS+EY   +L  + +D++LTC FHAGAKGHSLEQC  F  +VQEL+DSK L V  +H ++  I+ VE++   E  + + K K
Subjt:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK

Query:  PLTIYYREK--TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSR
         LTI+Y EK    +   K ITI VP PF+YKSSKAVPW Y+ KVT+  +  +PPLPVDNI+  G +  ++ ++
Subjt:  PLTIYYREK--TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSR

A0A6J1DZ90 Ribonuclease H3.1e-1853.54Show/hide
Query:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMVADTTAPDTPIGNPQAGLPFPPSFASHVRTTAETSMPQHTTYNPLYDIPVGQYPFPSFKEGQIPQIP
        ++K EQEKT++DIEE+REK+D I   LEKGK  AD      PI  PQ   P+PP +   +R  AE  MPQ+TTYNPLYD+P+GQYP    K  Q  QIP
Subjt:  EEKGEQEKTKRDIEEIREKVDAIIAALEKGKMVADTTAPDTPIGNPQAGLPFPPSFASHVRTTAETSMPQHTTYNPLYDIPVGQYPFPSFKEGQIPQIP

A0A6J1DZ90 Ribonuclease H6.6e-3745.97Show/hide
Query:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK
        +I TP  +LFEILL +GY+S+EY       +G+D++LTC FH GAKGHSLEQC  F  +VQEL+DSK L    +H ++   + VE++L  E  + S K K
Subjt:  EIMTPKEKLFEILLVNGYVSIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLK

Query:  PLTIYYREK----TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADM
        PLTI+YREK    + +  P  IT  VP PF+YKSSKAVPW YE KVT+  +  +P LPVDNI+G GG+TR       D  SL K            KA  
Subjt:  PLTIYYREK----TTTHDPKLITIQVPTPFKYKSSKAVPWSYEYKVTINSE--TPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADM

Query:  IRVQKQEKERR
         + +K E++++
Subjt:  IRVQKQEKERR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGGAGAGCCCGAGGATTTGCAGAATGGGCAAGGGATCTGCGAGAGAATACATCACCTATGGCCTCTAATGCGGAGGAGTTGTTTGAGTTTTTAGGGATGACTCG
TAGAGACCTAGGGCGTAGAACAAGGATTATGGAAGAAAAGGGTGAACAAGAGAAGACTAAGCGGGACATCGAGGAAATCAGGGAAAAGGTTGATGCAATCATTGCCGCTT
TAGAGAAGGGCAAAATGGTGGCAGATACGACTGCACCAGATACTCCGATTGGAAACCCTCAAGCTGGCCTACCATTTCCACCCAGTTTCGCTTCACATGTTCGTACGACA
GCAGAAACGTCCATGCCACAACATACTACCTATAACCCCTTATATGACATACCTGTTGGGCAATACCCTTTTCCATCATTTAAAGAAGGCCAAATCCCCCAAATACCCAT
GGCTAGCCAGGCTGGTGCTTCTTATTTCAAGCCAGAGTTTTCAAAAATACCTTTTGCGGTGAATAATGCTTCAGCGCAGTACTCCCCATTTTATGGCCAAAATACTCGAC
CCCAAATGAATCAGAATTTTCAGTCTCGTAGACAACAACAAACAATCACTCCAGAGATCATGACTCCTAAGGAGAAGCTTTTTGAGATTCTCCTCGTTAACGGATATGTA
TCAATAGAGTATGCACACAAGGACCTTGTTCAAGAAGGATTCGATGATAATTTGACTTGCCTATTTCATGCTGGGGCAAAGGGGCATTCTTTGGAACAATGTCGTCGTTT
TCATAAGAGGGTCCAAGAACTGGTGGACTCAAAATTTCTTGTGGTCACCCAAGCCCATCACCGGGAGGATGAAATAGACGCTGTGGAAGAATTGCTGCCTAAAGAAAGTT
TGAATTCATCTTTCAAACTGAAGCCACTCACGATCTATTACCGTGAGAAGACTACTACTCATGATCCAAAGTTGATCACCATTCAGGTGCCGACTCCTTTCAAGTACAAG
AGCTCTAAGGCAGTACCATGGAGTTATGAGTACAAAGTAACTATTAACTCAGAAACGCCACCACTTCCAGTTGACAACATTTCCGGAACGGGAGGCGTAACACGAAAAAG
CAACAGTCGATTAGATGACGTTTTAAGCCTATCGAAAAATACAAAAAGATTTGGGCTTGGGTATAAGCCGAATAAAGCAGATATGATCAGGGTACAGAAGCAAGAAAAAG
AGAGGCGTTTGGCCAGATTTAAAAATCGCGAACCAGAATATGAAGGAAAAGTCATCCCTCATCTCTACCACTCGTTTGAAAGTGCTGGTATAATTCGTCCAAGTGATTTC
GCAGTTGCAGTAGTGACTAAAGAGGAAGAATTGGGTCCATGGATCTGCCCGTGCCCAGAAAACTTCGAGCTCAACAATTGGAGTACTATTGAATTACCGTCATTTGCTGT
TAAGATGTCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGGAGAGCCCGAGGATTTGCAGAATGGGCAAGGGATCTGCGAGAGAATACATCACCTATGGCCTCTAATGCGGAGGAGTTGTTTGAGTTTTTAGGGATGACTCG
TAGAGACCTAGGGCGTAGAACAAGGATTATGGAAGAAAAGGGTGAACAAGAGAAGACTAAGCGGGACATCGAGGAAATCAGGGAAAAGGTTGATGCAATCATTGCCGCTT
TAGAGAAGGGCAAAATGGTGGCAGATACGACTGCACCAGATACTCCGATTGGAAACCCTCAAGCTGGCCTACCATTTCCACCCAGTTTCGCTTCACATGTTCGTACGACA
GCAGAAACGTCCATGCCACAACATACTACCTATAACCCCTTATATGACATACCTGTTGGGCAATACCCTTTTCCATCATTTAAAGAAGGCCAAATCCCCCAAATACCCAT
GGCTAGCCAGGCTGGTGCTTCTTATTTCAAGCCAGAGTTTTCAAAAATACCTTTTGCGGTGAATAATGCTTCAGCGCAGTACTCCCCATTTTATGGCCAAAATACTCGAC
CCCAAATGAATCAGAATTTTCAGTCTCGTAGACAACAACAAACAATCACTCCAGAGATCATGACTCCTAAGGAGAAGCTTTTTGAGATTCTCCTCGTTAACGGATATGTA
TCAATAGAGTATGCACACAAGGACCTTGTTCAAGAAGGATTCGATGATAATTTGACTTGCCTATTTCATGCTGGGGCAAAGGGGCATTCTTTGGAACAATGTCGTCGTTT
TCATAAGAGGGTCCAAGAACTGGTGGACTCAAAATTTCTTGTGGTCACCCAAGCCCATCACCGGGAGGATGAAATAGACGCTGTGGAAGAATTGCTGCCTAAAGAAAGTT
TGAATTCATCTTTCAAACTGAAGCCACTCACGATCTATTACCGTGAGAAGACTACTACTCATGATCCAAAGTTGATCACCATTCAGGTGCCGACTCCTTTCAAGTACAAG
AGCTCTAAGGCAGTACCATGGAGTTATGAGTACAAAGTAACTATTAACTCAGAAACGCCACCACTTCCAGTTGACAACATTTCCGGAACGGGAGGCGTAACACGAAAAAG
CAACAGTCGATTAGATGACGTTTTAAGCCTATCGAAAAATACAAAAAGATTTGGGCTTGGGTATAAGCCGAATAAAGCAGATATGATCAGGGTACAGAAGCAAGAAAAAG
AGAGGCGTTTGGCCAGATTTAAAAATCGCGAACCAGAATATGAAGGAAAAGTCATCCCTCATCTCTACCACTCGTTTGAAAGTGCTGGTATAATTCGTCCAAGTGATTTC
GCAGTTGCAGTAGTGACTAAAGAGGAAGAATTGGGTCCATGGATCTGCCCGTGCCCAGAAAACTTCGAGCTCAACAATTGGAGTACTATTGAATTACCGTCATTTGCTGT
TAAGATGTCAAAGTAA
Protein sequenceShow/hide protein sequence
MARRARGFAEWARDLRENTSPMASNAEELFEFLGMTRRDLGRRTRIMEEKGEQEKTKRDIEEIREKVDAIIAALEKGKMVADTTAPDTPIGNPQAGLPFPPSFASHVRTT
AETSMPQHTTYNPLYDIPVGQYPFPSFKEGQIPQIPMASQAGASYFKPEFSKIPFAVNNASAQYSPFYGQNTRPQMNQNFQSRRQQQTITPEIMTPKEKLFEILLVNGYV
SIEYAHKDLVQEGFDDNLTCLFHAGAKGHSLEQCRRFHKRVQELVDSKFLVVTQAHHREDEIDAVEELLPKESLNSSFKLKPLTIYYREKTTTHDPKLITIQVPTPFKYK
SSKAVPWSYEYKVTINSETPPLPVDNISGTGGVTRKSNSRLDDVLSLSKNTKRFGLGYKPNKADMIRVQKQEKERRLARFKNREPEYEGKVIPHLYHSFESAGIIRPSDF
AVAVVTKEEELGPWICPCPENFELNNWSTIELPSFAVKMSK