; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg017970 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg017970
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold9:28861260..28866469
RNA-Seq ExpressionSpg017970
SyntenySpg017970
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH07150.1 TatD related DNase [Prunus dulcis]5.0e-4740.51Show/hide
Query:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL
        K  +VIL ETK   ++R +V  VW SR   W    ++G SGGI +LWN  +  V++   G FS+SI +    G  +W++G+YGP   RER SFW+EL+DL
Subjt:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL

Query:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI
           C   W +GGD+N+ R++ EKS     T+ M+ FN FI++  L+D +L N  + WS+ R N     +DRF +S +    F       L R+TSDH PI
Subjt:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI

Query:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN
         L   + KWGPSPFR  N WL +  F   +  WW ++
Subjt:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN

CAN83725.1 hypothetical protein VITISV_037053 [Vitis vinifera]1.2e-4538.55Show/hide
Query:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI
        +V+LQETK    +R  V SVW  +++ W ++ A GASGGIVILW+   F   E   G FS+++ L+  +   FW+T VYGPN +  RK FW EL DL  +
Subjt:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI

Query:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM
          P W +GGD+N+ +   E+      T  M+ F+ FI+++ L D  L N  + WS+ + +     +DRF  S    S F       L R TSDHS ICL 
Subjt:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM

Query:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTSHDSKEGNESIYEDEFKRRSEIKAELLILSTNEEIMWRQR
            KWGP+PFR  N WL +  F      WW++ T    +EGN ++  D    R+  + EL  L   EE+ WRQ+
Subjt:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTSHDSKEGNESIYEDEFKRRSEIKAELLILSTNEEIMWRQR

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]5.0e-4740.51Show/hide
Query:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL
        K  +VIL ETK   ++R +V  VW SR   W    ++G SGGI +LWN  +  V++   G FS+SI +    G  +W++G+YGP   RER SFW+EL+DL
Subjt:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL

Query:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI
           C   W +GGD+N+ R++ EKS     T+ M+ FN FI++  L+D +L N  + WS+ R N     +DRF +S +    F       L R+TSDH PI
Subjt:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI

Query:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN
         L   + KWGPSPFR  N WL +  F   +  WW ++
Subjt:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]1.6e-5645.34Show/hide
Query:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI
        +VIL ETK S+IN   +KS+WSS +I WAS+DA GASGGI++LW++ +   +E+  G FS+S+H  LAD F++W+TGVY P   ++RK FW+EL DL  +
Subjt:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI

Query:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM
        C P W++G D+NI RW+ E S+   P  GM KFN FI+ A L D  ++N +Y WS+ RP++ ++ I+RF  S+  + KF   +  RL R  SDH PI L 
Subjt:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM

Query:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTS
           ++WG  PFRL N WL +  F   +++ W   +S
Subjt:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTS

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]9.8e-5938.24Show/hide
Query:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI
        +VILQETKLS ++ +IVKS+WS+  I W+++DA G + GI+ILWN+      E+ EG+FSL+I+  L+DGF FW++G+YGP+ +     FW+EL DL  +
Subjt:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI

Query:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM
        C  +WI+ GD+N+TRW+ EKS  R  T+ M  FN FIED++L D+ L+N ++ WS    N S +LID F ++     K     A R+ R TSDH PI L 
Subjt:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM

Query:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTSH-------------------------------------------DSKEGNESIYEDEFKRRSEI
         G+  WG +PFR  N WLS+ +F   +++WW +   H                                           D  EG++ +  D+ + R + 
Subjt:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTSH-------------------------------------------DSKEGNESIYEDEFKRRSEI

Query:  KAELLILSTNEEIMWRQRC
        K +LL +   EE  WRQRC
Subjt:  KAELLILSTNEEIMWRQRC

TrEMBL top hitse value%identityAlignment
A0A4Y1RS61 TatD related DNase2.4e-4740.51Show/hide
Query:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL
        K  +VIL ETK   ++R +V  VW SR   W    ++G SGGI +LWN  +  V++   G FS+SI +    G  +W++G+YGP   RER SFW+EL+DL
Subjt:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL

Query:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI
           C   W +GGD+N+ R++ EKS     T+ M+ FN FI++  L+D +L N  + WS+ R N     +DRF +S +    F       L R+TSDH PI
Subjt:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI

Query:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN
         L   + KWGPSPFR  N WL +  F   +  WW ++
Subjt:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN

A0A5E4F090 Reverse transcriptase domain-containing protein (Fragment)2.4e-4740.51Show/hide
Query:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL
        K  +VIL ETK   ++R +V  VW SR   W    ++G SGGI +LWN  +  V++   G FS+SI +    G  +W++G+YGP   RER SFW+EL+DL
Subjt:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL

Query:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI
           C   W +GGD+N+ R++ EKS     T+ M+ FN FI++  L+D +L N  + WS+ R N     +DRF +S +    F       L R+TSDH PI
Subjt:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI

Query:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN
         L   + KWGPSPFR  N WL +  F   +  WW ++
Subjt:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN

A0A6J1CVN2 uncharacterized protein LOC1110146577.6e-5745.34Show/hide
Query:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI
        +VIL ETK S+IN   +KS+WSS +I WAS+DA GASGGI++LW++ +   +E+  G FS+S+H  LAD F++W+TGVY P   ++RK FW+EL DL  +
Subjt:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI

Query:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM
        C P W++G D+NI RW+ E S+   P  GM KFN FI+ A L D  ++N +Y WS+ RP++ ++ I+RF  S+  + KF   +  RL R  SDH PI L 
Subjt:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM

Query:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTS
           ++WG  PFRL N WL +  F   +++ W   +S
Subjt:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTS

A0A6J1E2G6 uncharacterized protein LOC1110254054.7e-5938.24Show/hide
Query:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI
        +VILQETKLS ++ +IVKS+WS+  I W+++DA G + GI+ILWN+      E+ EG+FSL+I+  L+DGF FW++G+YGP+ +     FW+EL DL  +
Subjt:  LVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAI

Query:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM
        C  +WI+ GD+N+TRW+ EKS  R  T+ M  FN FIED++L D+ L+N ++ WS    N S +LID F ++     K     A R+ R TSDH PI L 
Subjt:  CLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPICLM

Query:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTSH-------------------------------------------DSKEGNESIYEDEFKRRSEI
         G+  WG +PFR  N WLS+ +F   +++WW +   H                                           D  EG++ +  D+ + R + 
Subjt:  LGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTSH-------------------------------------------DSKEGNESIYEDEFKRRSEI

Query:  KAELLILSTNEEIMWRQRC
        K +LL +   EE  WRQRC
Subjt:  KAELLILSTNEEIMWRQRC

M5VS59 Reverse transcriptase domain-containing protein (Fragment)7.6e-4940.93Show/hide
Query:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL
        K  +VIL ETK  T++R +V  VW SR   W    ++G SGGI +LWN  +  V++   G FS+SI +    G  +W++G+YGP   RER SFW+EL+DL
Subjt:  KHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSLSIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDL

Query:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI
           C   W +GGD+N+ R++ EKS     T+ M+ FN FI++  L+D +L N  + WS+ R N     +DRF +S +    F       L R+TSDH PI
Subjt:  QAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFISENIASKFDSVNAIRLDRVTSDHSPI

Query:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN
         L   + KWGPSPFR  N WL++  F   +  WW ++
Subjt:  CLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAAATTCCCAAACCAAATAGTACAAAACAAAATGCCTCCAACACCCAATTTGTAGCCCTGTATTTAGACTCAGTCGTCATTGTGCAGAGGAAACATTTCCATGA
CAGTTGGAATGATATCATGCAAGCCTTTCAACAATCCCTCTCGGCTTATTCCTCGATCAGCCCCCTACAGCCAGACAAAGCCCTTCTCTCTTGCGAAGACAAGGAACAAG
CCAGCGTACTAGCAAATATTAAAGGGTGGTGCAAGGTTGGAAGATACCAAGTTTGTTTTCTCCCATGGAGTTCGGAAGCCATGAATGGTGACCAAAAGGTGTCGTCATTC
GGGGGCTGGATTAAAGTTCGAAATCTTCCTCTCGATAAATGGAATCTTCAAAACTTCAAAAAGATAGGAGATGAGTGTGGTGGGTATATGGAAACAGCAAGCAAAACCCT
ATCTAGAATGGACATGATGGAGATAGGGAAAGATAAAGGAAAATCACACGGGGTTTATTCCGGTCGAAATCACTCTCCTATATCACACCAACCCAGGGACTATCAGCAGG
CCATCTCCCCGCCGACCATTACCAACCTCTTTGACACACTTGATGAACAACCTCACCCTGATAGCCCCATCCCGCTGCGGATTGAAGACCCGAATGACACAGAAAAGGAG
CAGAGATTGTGTGTTGAAACATCAGCCCTAGTTGATATAAGTGTGGGAGAAATACAAGAGGAGGATTCAGAATCAGAGTTTCCATATCAGGCAAGCGATCCTGCGGTTTT
CTTGCCAGTTCTTTTCCCTTGGCTGGCTAAACATGCTTTAGTGATTCTTCAAGAGACGAAGCTCTCAACCATCAATAGAATAATTGTGAAATCGGTTTGGAGCTCGAGGA
ATATTACCTGGGCTTCGGTTGATGCTATTGGTGCCTCGGGAGGGATTGTAATCCTGTGGAATGAATCCACCTTTGATGTTCTCGAGATTTTCGAAGGTATTTTCTCTTTA
TCTATCCATCTCTCGCTTGCTGATGGTTTTTCCTTTTGGATCACAGGAGTATATGGCCCCAATTTTTCACGTGAGAGAAAATCATTTTGGAAGGAATTATCTGATTTGCA
AGCCATATGCCTTCCAAACTGGATTATGGGGGGAGATTACAATATTACTAGATGGACTATGGAAAAATCAACCTTCCGAGCCCCTACTCGTGGCATGAAAAAATTCAATA
GATTTATTGAAGATGCTGCTTTACAGGACATCCATTTATCCAACGACAAATACATTTGGTCTAGTTGTCGCCCAAACCTCAGCATGACTCTCATTGACCGGTTTTTTATA
TCAGAAAATATTGCTTCCAAATTCGATTCTGTAAATGCTATAAGACTTGACAGAGTTACTTCGGATCACTCCCCTATATGCCTCATGTTAGGAAAAGAAAAGTGGGGCCC
CTCTCCCTTTCGCCTTATAAATGCTTGGTTATCTAACAACTCCTTTTTTAGTACAGTCGACTCTTGGTGGAAGGACAACACATCTCATGACAGCAAGGAGGGAAATGAGT
CAATTTATGAGGATGAATTCAAAAGAAGATCTGAAATCAAGGCAGAATTGCTTATTTTATCAACCAATGAAGAGATTATGTGGCGCCAAAGATGTCCATCAGGTCCCATC
GGTAGCTCTATAAGGGCGTTAAGGATACAGAGAAGAAAAATACTCCTAACCCTTAGAAAATCACGCTCCCACAAGCCCCTAACGCACATCCTTGAAGAGAATACTGGTGC
AATCTTTGGTGGTGGTGTTCGTGATAATTTTCCAGCGAGATCAAGACTTTTTCGCTACTGGAATTTTCTGCAAATAGTAAGGGAAAAGGCGAAACGGATCAAGATCGTCT
ACAAAGGTATAGCGTTCTTGATCTTGGATCAATTGGATCCAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAAATTCCCAAACCAAATAGTACAAAACAAAATGCCTCCAACACCCAATTTGTAGCCCTGTATTTAGACTCAGTCGTCATTGTGCAGAGGAAACATTTCCATGA
CAGTTGGAATGATATCATGCAAGCCTTTCAACAATCCCTCTCGGCTTATTCCTCGATCAGCCCCCTACAGCCAGACAAAGCCCTTCTCTCTTGCGAAGACAAGGAACAAG
CCAGCGTACTAGCAAATATTAAAGGGTGGTGCAAGGTTGGAAGATACCAAGTTTGTTTTCTCCCATGGAGTTCGGAAGCCATGAATGGTGACCAAAAGGTGTCGTCATTC
GGGGGCTGGATTAAAGTTCGAAATCTTCCTCTCGATAAATGGAATCTTCAAAACTTCAAAAAGATAGGAGATGAGTGTGGTGGGTATATGGAAACAGCAAGCAAAACCCT
ATCTAGAATGGACATGATGGAGATAGGGAAAGATAAAGGAAAATCACACGGGGTTTATTCCGGTCGAAATCACTCTCCTATATCACACCAACCCAGGGACTATCAGCAGG
CCATCTCCCCGCCGACCATTACCAACCTCTTTGACACACTTGATGAACAACCTCACCCTGATAGCCCCATCCCGCTGCGGATTGAAGACCCGAATGACACAGAAAAGGAG
CAGAGATTGTGTGTTGAAACATCAGCCCTAGTTGATATAAGTGTGGGAGAAATACAAGAGGAGGATTCAGAATCAGAGTTTCCATATCAGGCAAGCGATCCTGCGGTTTT
CTTGCCAGTTCTTTTCCCTTGGCTGGCTAAACATGCTTTAGTGATTCTTCAAGAGACGAAGCTCTCAACCATCAATAGAATAATTGTGAAATCGGTTTGGAGCTCGAGGA
ATATTACCTGGGCTTCGGTTGATGCTATTGGTGCCTCGGGAGGGATTGTAATCCTGTGGAATGAATCCACCTTTGATGTTCTCGAGATTTTCGAAGGTATTTTCTCTTTA
TCTATCCATCTCTCGCTTGCTGATGGTTTTTCCTTTTGGATCACAGGAGTATATGGCCCCAATTTTTCACGTGAGAGAAAATCATTTTGGAAGGAATTATCTGATTTGCA
AGCCATATGCCTTCCAAACTGGATTATGGGGGGAGATTACAATATTACTAGATGGACTATGGAAAAATCAACCTTCCGAGCCCCTACTCGTGGCATGAAAAAATTCAATA
GATTTATTGAAGATGCTGCTTTACAGGACATCCATTTATCCAACGACAAATACATTTGGTCTAGTTGTCGCCCAAACCTCAGCATGACTCTCATTGACCGGTTTTTTATA
TCAGAAAATATTGCTTCCAAATTCGATTCTGTAAATGCTATAAGACTTGACAGAGTTACTTCGGATCACTCCCCTATATGCCTCATGTTAGGAAAAGAAAAGTGGGGCCC
CTCTCCCTTTCGCCTTATAAATGCTTGGTTATCTAACAACTCCTTTTTTAGTACAGTCGACTCTTGGTGGAAGGACAACACATCTCATGACAGCAAGGAGGGAAATGAGT
CAATTTATGAGGATGAATTCAAAAGAAGATCTGAAATCAAGGCAGAATTGCTTATTTTATCAACCAATGAAGAGATTATGTGGCGCCAAAGATGTCCATCAGGTCCCATC
GGTAGCTCTATAAGGGCGTTAAGGATACAGAGAAGAAAAATACTCCTAACCCTTAGAAAATCACGCTCCCACAAGCCCCTAACGCACATCCTTGAAGAGAATACTGGTGC
AATCTTTGGTGGTGGTGTTCGTGATAATTTTCCAGCGAGATCAAGACTTTTTCGCTACTGGAATTTTCTGCAAATAGTAAGGGAAAAGGCGAAACGGATCAAGATCGTCT
ACAAAGGTATAGCGTTCTTGATCTTGGATCAATTGGATCCAATTTAG
Protein sequenceShow/hide protein sequence
MEKIPKPNSTKQNASNTQFVALYLDSVVIVQRKHFHDSWNDIMQAFQQSLSAYSSISPLQPDKALLSCEDKEQASVLANIKGWCKVGRYQVCFLPWSSEAMNGDQKVSSF
GGWIKVRNLPLDKWNLQNFKKIGDECGGYMETASKTLSRMDMMEIGKDKGKSHGVYSGRNHSPISHQPRDYQQAISPPTITNLFDTLDEQPHPDSPIPLRIEDPNDTEKE
QRLCVETSALVDISVGEIQEEDSESEFPYQASDPAVFLPVLFPWLAKHALVILQETKLSTINRIIVKSVWSSRNITWASVDAIGASGGIVILWNESTFDVLEIFEGIFSL
SIHLSLADGFSFWITGVYGPNFSRERKSFWKELSDLQAICLPNWIMGGDYNITRWTMEKSTFRAPTRGMKKFNRFIEDAALQDIHLSNDKYIWSSCRPNLSMTLIDRFFI
SENIASKFDSVNAIRLDRVTSDHSPICLMLGKEKWGPSPFRLINAWLSNNSFFSTVDSWWKDNTSHDSKEGNESIYEDEFKRRSEIKAELLILSTNEEIMWRQRCPSGPI
GSSIRALRIQRRKILLTLRKSRSHKPLTHILEENTGAIFGGGVRDNFPARSRLFRYWNFLQIVREKAKRIKIVYKGIAFLILDQLDPI