; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg023024 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg023024
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold12:30843337..30847918
RNA-Seq ExpressionSpg023024
SyntenySpg023024
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]1.9e-1834.36Show/hide
Query:  YWRTEDFWSWILDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSK----FIESNMFEERETEQSHRSRQR-GKPSKGAKS-HPSQFRWKSPPYP
        +W  +D W+W+++  S+EE+  S++I W IW+ RN  + +  +    +L +    FI SN+ +     Q+ RS+Q  G   +G ++ +    RW +PP  
Subjt:  YWRTEDFWSWILDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSK----FIESNMFEERETEQSHRSRQR-GKPSKGAKS-HPSQFRWKSPPYP

Query:  CWKLNVDATWSENLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFS
        CWKLN DA+WSE    GGIGWIL D  G  +  G   I +   I  LE+  I  GL+ + ++  RS      PI +ES+ + V+RL+  E+ D +
Subjt:  CWKLNVDATWSENLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]5.5e-1833.64Show/hide
Query:  LDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEE--RETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLG
        +DK+ EEE  +S+II W IW+ RN+ + K   P    +   I+  +     R T    +S  +           +  +WK P    WKLN +A W  +  
Subjt:  LDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEE--RETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLG

Query:  KGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEVS
         GGIGWILRD  G  +    + I    +I  LE+ AI EGLR +     R       PI +ES+ +  + LL+ + +D +EI  L+EEI ++ + +  VS
Subjt:  KGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEVS

Query:  FVFCPRESNEVAHCLAR
             RE+N+VAH LAR
Subjt:  FVFCPRESNEVAHCLAR

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.3e-2234.53Show/hide
Query:  ILDKSSEEEMEKSIIIIWSIWQHRNEIL----QKSSSPVVFKLSKFIESNMFEERETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSE
        +LDK+S+E+++  +I  W IW HRN ++      S S ++ +L+KF+         TE S++S      S   K+  ++ +W+ PP   W LN DA+WS+
Subjt:  ILDKSSEEEMEKSIIIIWSIWQHRNEIL----QKSSSPVVFKLSKFIESNMFEERETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSE

Query:  NLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLG
        +  +GGIGWI+R   G  +  G + +    ++KLLE  AI EGLR + +L        + P+ +E++   V  LLN + ED ++   +VEEIL L++S  
Subjt:  NLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLG

Query:  EVSFVFCPRESNEVAHCLARLSS
         ++F    RE+N  AH LA+ +S
Subjt:  EVSFVFCPRESNEVAHCLARLSS

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]1.4e-2142.29Show/hide
Query:  GFEGDKFTWRRRKDSRVGPLERLDRFFVNPDMDKIYNFITVHHLRYHDSDHRPIMA--ICAQGCNFKRKKAVRVKRFEEAWISFPESKAIVEETWKSVQG
        GF GD FTW      R    ERLDRF +N  + +I   + + HL +  SDHRPI+A  +        R+K  R  RFEE W SF E K IV   W +VQG
Subjt:  GFEGDKFTWRRRKDSRVGPLERLDRFFVNPDMDKIYNFITVHHLRYHDSDHRPIMA--ICAQGCNFKRKKAVRVKRFEEAWISFPESKAIVEETWKSVQG

Query:  G-DAEAYKTKINLCLEKLTNWNKIRLEGSISKAIDRKTNEIKMLEREESGCPSFNLIKAEKKLENLLLEEEQYWK
              ++ KIN CLE+L  WN  RL GS+  AI RK  EI+ + ++ +     NL +A++ LE LL EEE YW+
Subjt:  G-DAEAYKTKINLCLEKLTNWNKIRLEGSISKAIDRKTNEIKMLEREESGCPSFNLIKAEKKLENLLLEEEQYWK

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]6.6e-1934.86Show/hide
Query:  LDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEE--RETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLG
        +DK+ EEE  +S+II W IW+ RN+ + K        +   I+  +     R+T    +S  +           +  RWK P    WKLN DA W  +  
Subjt:  LDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEE--RETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLG

Query:  KGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMT-IPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEV
         GGIGWILRD  G  +    + I    +I  LE+ AI EGLR +     R I      PI +ES+ +  + LL+ + +D +EI  L+EEI ++ E +  V
Subjt:  KGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMT-IPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEV

Query:  SFVFCPRESNEVAHCLAR
        S     RE+N+VAH LAR
Subjt:  SFVFCPRESNEVAHCLAR

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134122.7e-1833.64Show/hide
Query:  LDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEE--RETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLG
        +DK+ EEE  +S+II W IW+ RN+ + K   P    +   I+  +     R T    +S  +           +  +WK P    WKLN +A W  +  
Subjt:  LDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEE--RETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLG

Query:  KGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEVS
         GGIGWILRD  G  +    + I    +I  LE+ AI EGLR +     R       PI +ES+ +  + LL+ + +D +EI  L+EEI ++ + +  VS
Subjt:  KGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEVS

Query:  FVFCPRESNEVAHCLAR
             RE+N+VAH LAR
Subjt:  FVFCPRESNEVAHCLAR

A0A6J1CQG0 uncharacterized protein LOC1110132169.2e-1934.36Show/hide
Query:  YWRTEDFWSWILDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSK----FIESNMFEERETEQSHRSRQR-GKPSKGAKS-HPSQFRWKSPPYP
        +W  +D W+W+++  S+EE+  S++I W IW+ RN  + +  +    +L +    FI SN+ +     Q+ RS+Q  G   +G ++ +    RW +PP  
Subjt:  YWRTEDFWSWILDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSK----FIESNMFEERETEQSHRSRQR-GKPSKGAKS-HPSQFRWKSPPYP

Query:  CWKLNVDATWSENLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFS
        CWKLN DA+WSE    GGIGWIL D  G  +  G   I +   I  LE+  I  GL+ + ++  RS      PI +ES+ + V+RL+  E+ D +
Subjt:  CWKLNVDATWSENLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFS

A0A6J1DNV9 uncharacterized protein LOC1110224036.2e-2334.53Show/hide
Query:  ILDKSSEEEMEKSIIIIWSIWQHRNEIL----QKSSSPVVFKLSKFIESNMFEERETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSE
        +LDK+S+E+++  +I  W IW HRN ++      S S ++ +L+KF+         TE S++S      S   K+  ++ +W+ PP   W LN DA+WS+
Subjt:  ILDKSSEEEMEKSIIIIWSIWQHRNEIL----QKSSSPVVFKLSKFIESNMFEERETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSE

Query:  NLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLG
        +  +GGIGWI+R   G  +  G + +    ++KLLE  AI EGLR + +L        + P+ +E++   V  LLN + ED ++   +VEEIL L++S  
Subjt:  NLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLG

Query:  EVSFVFCPRESNEVAHCLARLSS
         ++F    RE+N  AH LA+ +S
Subjt:  EVSFVFCPRESNEVAHCLARLSS

A0A6J1DRA0 uncharacterized protein LOC1110224236.8e-2242.29Show/hide
Query:  GFEGDKFTWRRRKDSRVGPLERLDRFFVNPDMDKIYNFITVHHLRYHDSDHRPIMA--ICAQGCNFKRKKAVRVKRFEEAWISFPESKAIVEETWKSVQG
        GF GD FTW      R    ERLDRF +N  + +I   + + HL +  SDHRPI+A  +        R+K  R  RFEE W SF E K IV   W +VQG
Subjt:  GFEGDKFTWRRRKDSRVGPLERLDRFFVNPDMDKIYNFITVHHLRYHDSDHRPIMA--ICAQGCNFKRKKAVRVKRFEEAWISFPESKAIVEETWKSVQG

Query:  G-DAEAYKTKINLCLEKLTNWNKIRLEGSISKAIDRKTNEIKMLEREESGCPSFNLIKAEKKLENLLLEEEQYWK
              ++ KIN CLE+L  WN  RL GS+  AI RK  EI+ + ++ +     NL +A++ LE LL EEE YW+
Subjt:  G-DAEAYKTKINLCLEKLTNWNKIRLEGSISKAIDRKTNEIKMLEREESGCPSFNLIKAEKKLENLLLEEEQYWK

A0A6J1DSV1 uncharacterized protein LOC1110236083.2e-1934.86Show/hide
Query:  LDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEE--RETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLG
        +DK+ EEE  +S+II W IW+ RN+ + K        +   I+  +     R+T    +S  +           +  RWK P    WKLN DA W  +  
Subjt:  LDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEE--RETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLG

Query:  KGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMT-IPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEV
         GGIGWILRD  G  +    + I    +I  LE+ AI EGLR +     R I      PI +ES+ +  + LL+ + +D +EI  L+EEI ++ E +  V
Subjt:  KGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMT-IPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEV

Query:  SFVFCPRESNEVAHCLAR
        S     RE+N+VAH LAR
Subjt:  SFVFCPRESNEVAHCLAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.2e-1631.43Show/hide
Query:  IIWSIWQHRNEILQKS---SSPVVFKLSKFIESNMFEERETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLGKGGIGWILRDSAG
        ++W +W+ RNE++ K     +P V  L + +E   FEE  T    R    GK S          +WK+PPY   K N DATW     + GIGWILR+ +G
Subjt:  IIWSIWQHRNEILQKS---SSPVVFKLSKFIESNMFEERETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLGKGGIGWILRDSAG

Query:  SSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEVSFVFCPRESNEVAH
          L MG +++ ++ ++   E++A+   +  +     + I+        ES+   +V LLN  ++ +  +   +E+I +L     EV F F PR  N+VA 
Subjt:  SSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEVSFVFCPRESNEVAH

Query:  CLARLSSSVS
         +AR S S S
Subjt:  CLARLSSSVS

AT4G29090.1 Ribonuclease H-like superfamily protein2.6e-1326.43Show/hide
Query:  FWSWILDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEERETEQSH---RSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDAT
        +W + L   + +  + S ++ W +W+     L K+ + +VF+  +F    +    E +      R+      +K   +  S  RW+ PP+   K N DAT
Subjt:  FWSWILDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNMFEERETEQSH---RSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDAT

Query:  WSENLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKE
        W+ +  + GIGW+LR+  G    MG +++ K  S+   E++A+   + +  S  + + V      + ES+   ++ +LN  +E +  +   ++++ RL  
Subjt:  WSENLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPPIVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKE

Query:  SLGEVSFVFCPRESNEVAHCLARLSSS
           EV FVF PRE N +A  +AR S S
Subjt:  SLGEVSFVFCPRESNEVAHCLARLSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCCAGAGCTAAATGGGCCTTTGGATCTGAAAGGATCACAGGGCCTGAAAAAGAAATGGAGAAGGCCATGGAGGTGGACAGTGAGCAAAAGGAACTTTCCAGCCC
AAACGAATTATTGAGGTCTGAAGTAACAGGAGAAGAATCACATCAAATAAGGAAATGGAAAAGGAGAGCCAGGCAAGGAAAAACAGAAGGAAACAGAGAAGAGCTTTTCA
GAATGGAAAAAAGAAAATTTCAAAATTCTGGTTTCGAGGGAGACAAATTCACATGGAGGAGAAGAAAAGACTCTAGAGTTGGGCCTTTGGAAAGATTGGACAGATTTTTT
GTGAACCCGGATATGGATAAGATTTACAACTTCATCACAGTGCATCATCTCAGATACCACGACTCGGACCACAGGCCTATAATGGCAATTTGTGCGCAAGGCTGCAATTT
CAAAAGGAAAAAAGCTGTCAGAGTTAAAAGATTCGAAGAAGCTTGGATTTCTTTTCCGGAAAGCAAGGCCATAGTCGAGGAAACCTGGAAGTCTGTTCAGGGGGGTGATG
CAGAAGCCTACAAAACAAAAATAAATTTATGTTTGGAGAAGCTTACAAACTGGAACAAAATCAGGTTAGAGGGCTCCATTTCAAAGGCTATTGACAGAAAGACCAATGAG
ATTAAAATGTTGGAAAGGGAGGAGTCGGGATGTCCATCTTTTAATCTAATCAAAGCTGAAAAGAAGCTGGAAAATTTGCTTCTTGAAGAGGAACAATATTGGAAGATGAG
ATCGCGAGAGGACTGGCTCAAATGGGAAGACCGGAATACAAAATGGGATTATTGGAGAACGGAAGACTTTTGGTCGTGGATTTTGGACAAGTCGAGCGAAGAGGAGATGG
AAAAGTCGATTATTATTATATGGAGCATATGGCAGCACAGGAACGAAATTCTCCAAAAATCATCCAGTCCAGTTGTGTTCAAATTATCAAAATTTATTGAAAGCAATATG
TTTGAGGAAAGAGAAACAGAACAATCTCACCGGTCGCGGCAAAGAGGGAAGCCTTCAAAGGGAGCGAAGAGCCATCCGAGTCAGTTCAGGTGGAAGTCCCCGCCATATCC
TTGTTGGAAACTCAATGTAGACGCTACTTGGAGTGAAAATCTGGGTAAAGGCGGCATAGGGTGGATCCTTCGTGACTCTGCGGGTTCTTCGTTATGCATGGGATTCAAAT
CGATCAGTAAAAGCTGGTCCATAAAGCTGCTTGAAATGAAAGCAATAGAAGAAGGGCTCAGAATCGTACCTTCTTTGATCGAGCGCTCCATTGTGATGACTATTCCCCCT
ATCGTGGTTGAGTCTAATGTTATTGGAGTCGTCCGCCTCCTTAATGGGGAAGAAGAAGATTTTTCTGAAATCTCCCATCTGGTGGAGGAGATTTTACGCCTCAAGGAGTC
TTTGGGGGAGGTTTCTTTCGTTTTTTGCCCAAGAGAGAGTAACGAGGTTGCCCACTGTTTGGCGCGCCTTTCTTCCTCTGTTTCCCCTGTATCTCGTTATTTGTCTGGTT
TTGAGATCTCTACTATTTCAGAAGAAGATCATGGTTGTTGGTTTGGCCCTCCCCCCTCTTGGTTAGTTGGGGTGTTAAATGGGTCTTGTTCTGATTCTTTTATTTCCCTT
TAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCCAGAGCTAAATGGGCCTTTGGATCTGAAAGGATCACAGGGCCTGAAAAAGAAATGGAGAAGGCCATGGAGGTGGACAGTGAGCAAAAGGAACTTTCCAGCCC
AAACGAATTATTGAGGTCTGAAGTAACAGGAGAAGAATCACATCAAATAAGGAAATGGAAAAGGAGAGCCAGGCAAGGAAAAACAGAAGGAAACAGAGAAGAGCTTTTCA
GAATGGAAAAAAGAAAATTTCAAAATTCTGGTTTCGAGGGAGACAAATTCACATGGAGGAGAAGAAAAGACTCTAGAGTTGGGCCTTTGGAAAGATTGGACAGATTTTTT
GTGAACCCGGATATGGATAAGATTTACAACTTCATCACAGTGCATCATCTCAGATACCACGACTCGGACCACAGGCCTATAATGGCAATTTGTGCGCAAGGCTGCAATTT
CAAAAGGAAAAAAGCTGTCAGAGTTAAAAGATTCGAAGAAGCTTGGATTTCTTTTCCGGAAAGCAAGGCCATAGTCGAGGAAACCTGGAAGTCTGTTCAGGGGGGTGATG
CAGAAGCCTACAAAACAAAAATAAATTTATGTTTGGAGAAGCTTACAAACTGGAACAAAATCAGGTTAGAGGGCTCCATTTCAAAGGCTATTGACAGAAAGACCAATGAG
ATTAAAATGTTGGAAAGGGAGGAGTCGGGATGTCCATCTTTTAATCTAATCAAAGCTGAAAAGAAGCTGGAAAATTTGCTTCTTGAAGAGGAACAATATTGGAAGATGAG
ATCGCGAGAGGACTGGCTCAAATGGGAAGACCGGAATACAAAATGGGATTATTGGAGAACGGAAGACTTTTGGTCGTGGATTTTGGACAAGTCGAGCGAAGAGGAGATGG
AAAAGTCGATTATTATTATATGGAGCATATGGCAGCACAGGAACGAAATTCTCCAAAAATCATCCAGTCCAGTTGTGTTCAAATTATCAAAATTTATTGAAAGCAATATG
TTTGAGGAAAGAGAAACAGAACAATCTCACCGGTCGCGGCAAAGAGGGAAGCCTTCAAAGGGAGCGAAGAGCCATCCGAGTCAGTTCAGGTGGAAGTCCCCGCCATATCC
TTGTTGGAAACTCAATGTAGACGCTACTTGGAGTGAAAATCTGGGTAAAGGCGGCATAGGGTGGATCCTTCGTGACTCTGCGGGTTCTTCGTTATGCATGGGATTCAAAT
CGATCAGTAAAAGCTGGTCCATAAAGCTGCTTGAAATGAAAGCAATAGAAGAAGGGCTCAGAATCGTACCTTCTTTGATCGAGCGCTCCATTGTGATGACTATTCCCCCT
ATCGTGGTTGAGTCTAATGTTATTGGAGTCGTCCGCCTCCTTAATGGGGAAGAAGAAGATTTTTCTGAAATCTCCCATCTGGTGGAGGAGATTTTACGCCTCAAGGAGTC
TTTGGGGGAGGTTTCTTTCGTTTTTTGCCCAAGAGAGAGTAACGAGGTTGCCCACTGTTTGGCGCGCCTTTCTTCCTCTGTTTCCCCTGTATCTCGTTATTTGTCTGGTT
TTGAGATCTCTACTATTTCAGAAGAAGATCATGGTTGTTGGTTTGGCCCTCCCCCCTCTTGGTTAGTTGGGGTGTTAAATGGGTCTTGTTCTGATTCTTTTATTTCCCTT
TAA
Protein sequenceShow/hide protein sequence
MAPRAKWAFGSERITGPEKEMEKAMEVDSEQKELSSPNELLRSEVTGEESHQIRKWKRRARQGKTEGNREELFRMEKRKFQNSGFEGDKFTWRRRKDSRVGPLERLDRFF
VNPDMDKIYNFITVHHLRYHDSDHRPIMAICAQGCNFKRKKAVRVKRFEEAWISFPESKAIVEETWKSVQGGDAEAYKTKINLCLEKLTNWNKIRLEGSISKAIDRKTNE
IKMLEREESGCPSFNLIKAEKKLENLLLEEEQYWKMRSREDWLKWEDRNTKWDYWRTEDFWSWILDKSSEEEMEKSIIIIWSIWQHRNEILQKSSSPVVFKLSKFIESNM
FEERETEQSHRSRQRGKPSKGAKSHPSQFRWKSPPYPCWKLNVDATWSENLGKGGIGWILRDSAGSSLCMGFKSISKSWSIKLLEMKAIEEGLRIVPSLIERSIVMTIPP
IVVESNVIGVVRLLNGEEEDFSEISHLVEEILRLKESLGEVSFVFCPRESNEVAHCLARLSSSVSPVSRYLSGFEISTISEEDHGCWFGPPPSWLVGVLNGSCSDSFISL