; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009601 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009601
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:40784965..40785826
RNA-Seq ExpressionLag0009601
SyntenyLag0009601
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.2e-3436.21Show/hide
Query:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSK------IPNPAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+  SE PSK++ S +S        PNPAY  W RQD LI +
Subjt:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSK------IPNPAYDHWVRQDSLIIA

Query:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNV
        WLL SMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL ++KK                                         +Y S ++V
Subjt:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNV

Query:  ITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTL-SINLTTQTGSK---------QQQTSSSDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCRRFGH
        I+ + ++PS+Q V SLL TQE   ++N S  +  + L S+N+ TQT  K         Q    ++ S N++   G G +NR    N NKPQCQ+C + G+
Subjt:  ITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTL-SINLTTQTGSK---------QQQTSSSDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCRRFGH

Query:  T
        +
Subjt:  T

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]4.2e-2646.05Show/hide
Query:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDS------KIPNPAYDHWVRQDSLIIA
        M S S+       E SS   +I    NKI+ +KL +DNFLLWK QILT L  + L++ L+  SE PSK++ S  S      + PNP Y  W RQD LI +
Subjt:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDS------KIPNPAYDHWVRQDSLIIA

Query:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKKE
        WLL SMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL ++KKE
Subjt:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKKE

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.2e-3436.21Show/hide
Query:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSK------IPNPAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+  SE PSK++ S +S        PNPAY  W RQD LI +
Subjt:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSK------IPNPAYDHWVRQDSLIIA

Query:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNV
        WLL SMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL ++KK                                         +Y S ++V
Subjt:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNV

Query:  ITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTL-SINLTTQTGSK---------QQQTSSSDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCRRFGH
        I+ + ++PS+Q V SLL TQE   ++N S  +  + L S+N+ TQT  K         Q    ++ S N++   G G +NR    N NKPQCQ+C + G+
Subjt:  ITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTL-SINLTTQTGSK---------QQQTSSSDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCRRFGH

Query:  T
        +
Subjt:  T

TYK18917.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]4.2e-2646.05Show/hide
Query:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDS------KIPNPAYDHWVRQDSLIIA
        M S S+       E SS   +I    NKI+ +KL +DNFLLWK QILT L  + L++ L+  SE PSK++ S  S      + PNP Y  W RQD LI +
Subjt:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDS------KIPNPAYDHWVRQDSLIIA

Query:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKKE
        WLL SMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL ++KKE
Subjt:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKKE

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.1e-4240.34Show/hide
Query:  NSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFI------TSGDSKIPNPAYDHWVRQDSLIIAWLLRSMSNS
        NS+     Q+ K INPG+K++ ++L++DN LLWK QI T L+G+ L+ ++D   +TP++F+      +S  S   NPAY  W++QD LI AWLL SM+  
Subjt:  NSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFI------TSGDSKIPNPAYDHWVRQDSLIIAWLLRSMSNS

Query:  LLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNVITEKDETPS
        +LS+ML+C++ARE+W +L+  F+SR +AR+M LK KLE+ KK                                         E+D+ ++VIT ++   +
Subjt:  LLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNVITEKDETPS

Query:  LQTVYSLLFTQENGIARNLSINLDGSTLSINLTTQTGSKQQQTSSSDSNNRKNSN----GKGNN----NRRSWNNNNKPQCQLCRRFGHT
        LQ V SLL  QE    RNL IN DGS  S+NLT    SK+     S   N   SN    G+G N    NRR+W  NNKPQCQ+C RFGHT
Subjt:  LQTVYSLLFTQENGIARNLSINLDGSTLSINLTTQTGSKQQQTSSSDSNNRKNSN----GKGNN----NRRSWNNNNKPQCQLCRRFGHT

TrEMBL top hitse value%identityAlignment
A0A438FKN9 Retrovirus-related Pol polyprotein from transposon RE28.6e-2531.88Show/hide
Query:  KLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGD--SKIPNPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNV
        KLD  NFL+W+ QILTTLRGH+L+H L E S  PS+F++S D      NP +  W +QD LI++WLL S++++LL+ M+ C+T+ +VW+ L+  F+++  
Subjt:  KLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGD--SKIPNPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNV

Query:  ARIMDLKSKLESLKK-----------------------------------------EYDSTVNVITEKDETPSLQTVYSLLFTQENGIARNLSINLDGST
        A++   K++L + KK                                         +Y++ +  +  + +  +++ + +LL  QE+ I +N+ I  D ST
Subjt:  ARIMDLKSKLESLKK-----------------------------------------EYDSTVNVITEKDETPSLQTVYSLLFTQENGIARNLSINLDGST

Query:  LSIN---LTTQTGSK--QQQTSSSDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCRRFGHTV
         S+     T + GS     + S+ +SN R                   G+G + R SW  NNKPQCQLC R GH V
Subjt:  LSIN---LTTQTGSK--QQQTSSSDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCRRFGHTV

A0A438IKE7 Retrovirus-related Pol polyprotein from transposon TNT 1-948.6e-2531.64Show/hide
Query:  KLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGD--SKIPNPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNV
        KLD  NFL+W+ QILTTLRGH+L+H L E S  PS+F++S D      NP +  W +QD LI++WLL S++++LL+ M+ C+T+ +VW+ L+  F+++  
Subjt:  KLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGD--SKIPNPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNV

Query:  ARIMDLKSKLESLKK-----------------------------------------EYDSTVNVITEKDETPSLQTVYSLLFTQENGIARNLSI-NLDGS
        A++   K++L + KK                                         +Y++ +  +  + +  +++ + +LL  QE+ I +N+ I +L   
Subjt:  ARIMDLKSKLESLKK-----------------------------------------EYDSTVNVITEKDETPSLQTVYSLLFTQENGIARNLSI-NLDGS

Query:  TLSINLTT-QTGSK--QQQTSSSDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCRRFGHTV
        +L+  +TT + GS     + S+ +SN R                   G+G + R SW  NNKPQCQLC R GH V
Subjt:  TLSINLTT-QTGSK--QQQTSSSDSNNRK---------------NSNGKGNNNRRSWNNNNKPQCQLCRRFGHTV

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-3436.21Show/hide
Query:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSK------IPNPAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+  SE PSK++ S +S        PNPAY  W RQD LI +
Subjt:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSK------IPNPAYDHWVRQDSLIIA

Query:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNV
        WLL SMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL ++KK                                         +Y S ++V
Subjt:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNV

Query:  ITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTL-SINLTTQTGSK---------QQQTSSSDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCRRFGH
        I+ + ++PS+Q V SLL TQE   ++N S  +  + L S+N+ TQT  K         Q    ++ S N++   G G +NR    N NKPQCQ+C + G+
Subjt:  ITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTL-SINLTTQTGSK---------QQQTSSSDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCRRFGH

Query:  T
        +
Subjt:  T

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-3436.21Show/hide
Query:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSK------IPNPAYDHWVRQDSLIIA
        M S S+       E SS   +I   GNKI+ +KL++D FLLWK QILT L  + L++ L+  SE PSK++ S +S        PNPAY  W RQD LI +
Subjt:  MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSK------IPNPAYDHWVRQDSLIIA

Query:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNV
        WLL SMS  +L++ML C++A+E+W  LQ  FSSR +A+ M  K+KL ++KK                                         +Y S ++V
Subjt:  WLLRSMSNSLLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNV

Query:  ITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTL-SINLTTQTGSK---------QQQTSSSDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCRRFGH
        I+ + ++PS+Q V SLL TQE   ++N S  +  + L S+N+ TQT  K         Q    ++ S N++   G G +NR    N NKPQCQ+C + G+
Subjt:  ITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTL-SINLTTQTGSK---------QQQTSSSDSNNRKNSNGKGNNNRRSWNNNNKPQCQLCRRFGH

Query:  T
        +
Subjt:  T

A0A6J1DLT9 uncharacterized protein LOC1110217575.3e-4340.34Show/hide
Query:  NSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFI------TSGDSKIPNPAYDHWVRQDSLIIAWLLRSMSNS
        NS+     Q+ K INPG+K++ ++L++DN LLWK QI T L+G+ L+ ++D   +TP++F+      +S  S   NPAY  W++QD LI AWLL SM+  
Subjt:  NSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFI------TSGDSKIPNPAYDHWVRQDSLIIAWLLRSMSNS

Query:  LLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNVITEKDETPS
        +LS+ML+C++ARE+W +L+  F+SR +AR+M LK KLE+ KK                                         E+D+ ++VIT ++   +
Subjt:  LLSEMLECETAREVWRILQNRFSSRNVARIMDLKSKLESLKK-----------------------------------------EYDSTVNVITEKDETPS

Query:  LQTVYSLLFTQENGIARNLSINLDGSTLSINLTTQTGSKQQQTSSSDSNNRKNSN----GKGNN----NRRSWNNNNKPQCQLCRRFGHT
        LQ V SLL  QE    RNL IN DGS  S+NLT    SK+     S   N   SN    G+G N    NRR+W  NNKPQCQ+C RFGHT
Subjt:  LQTVYSLLFTQENGIARNLSINLDGSTLSINLTTQTGSKQQQTSSSDSNNRKNSN----GKGNN----NRRSWNNNNKPQCQLCRRFGHT

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-0923.37Show/hide
Query:  EVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSKIPNPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLEC
        E+ +++ S+  +N  N     KL   N+L+W  Q+     G+ L   LD  +  P   I +  +   NP Y  W RQD LI + +L ++S S+   +   
Subjt:  EVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSKIPNPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLEC

Query:  ETAREVWRILQNRFSSRNVARIMDLKSK-----------------------------------------LESLKKEYDSTVNVITEKDETPSLQTVYSLL
         TA ++W  L+  +++ +   +  L+++                                         LE+L +EY   ++ I  KD  P+L  ++  L
Subjt:  ETAREVWRILQNRFSSRNVARIMDLKSK-----------------------------------------LESLKKEYDSTVNVITEKDETPSLQTVYSLL

Query:  FTQENGIARNLSINLDGSTLSINLTTQTGSKQQQTSSSDSN----NRKNSNGKGNNNRRSW----------NNNNKP---QCQLCRRFGHT
           E+ I   L+++   S   I +T    S +  T+++++N    N +  N   NNN + W          NN +KP   +CQ+C   GH+
Subjt:  FTQENGIARNLSINLDGSTLSINLTTQTGSKQQQTSSSDSN----NRKNSNGKGNNNRRSW----------NNNNKP---QCQLCRRFGHT

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.0e-1125.5Show/hide
Query:  NKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSKIP--NPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLECETAREVWRILQNR
        N     KL   N+L+W  Q+     G+ L   LD    TP    T G   +P  NP Y  W RQD LI + +L ++S S+   +    TA ++W  L+  
Subjt:  NKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSKIP--NPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLECETAREVWRILQNR

Query:  FSSRNVARIMDLK----------------------SKLESLKKEYDSTVNVITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTLSINLTTQTGSKQ
        +++ +   +  L+                        LE+L  +Y   ++ I  KD  PSL  ++  L  +E+ +    S  +   T ++     T + +
Subjt:  FSSRNVARIMDLK----------------------SKLESLKKEYDSTVNVITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTLSINLTTQTGSKQ

Query:  QQTSSSDSNNRKNSNGKGN------NNRRSWNNNNKP---QCQLCRRFGHT
         Q +  D+ N  N+N + N      +  RS N   KP   +CQ+C   GH+
Subjt:  QQTSSSDSNNRKNSNGKGN------NNRRSWNNNNKP---QCQLCRRFGHT

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.5e-0927.27Show/hide
Query:  ITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSKIPNPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLECETAREVWRILQNRFSSR
        I  +  DEDN++ WK++  + LR  +    +D     P  F         +P Y  W + +++++ WL+ SM++ LL  ++  ETA ++W  L+  F   
Subjt:  ITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSKIPNPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLECETAREVWRILQNRFSSR

Query:  NVARIMDLKSKLESLKKEYDS
           +I  L+ +L +L++  DS
Subjt:  NVARIMDLKSKLESLKKEYDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTCCTTCAACAGAGAAAAGCAATTCTGAGGTTGAAGTTTCATCTCAAAGTCTAAAAATTATAAACCCAGGCAACAAGATCACAACGATGAAGCTTGATGAAGA
CAATTTTCTTCTGTGGAAACTGCAAATTCTTACTACCCTACGAGGCCACAGATTAAAACATCATCTTGATGAAGGCTCAGAAACTCCTTCAAAGTTTATTACAAGCGGCG
ATTCGAAAATCCCTAATCCCGCTTATGATCATTGGGTTCGACAAGACAGCCTTATTATCGCCTGGTTACTTCGCTCGATGTCTAACTCGCTACTTTCGGAAATGCTCGAG
TGCGAAACTGCTCGAGAGGTGTGGAGAATTCTTCAGAATCGATTCTCTTCACGAAACGTTGCAAGGATTATGGATTTAAAATCCAAATTGGAATCGCTCAAGAAAGAGTA
TGACTCTACTGTCAATGTGATTACGGAAAAAGATGAGACTCCTTCATTGCAGACGGTGTATTCGCTTCTTTTTACTCAAGAAAACGGGATTGCTCGAAATCTTTCTATAA
ATCTTGATGGCTCAACTCTTTCTATTAATCTCACAACCCAGACGGGATCGAAACAACAACAGACTTCTTCGTCTGACTCCAACAATCGCAAGAATTCAAATGGGAAAGGC
AACAATAATCGGCGATCTTGGAACAATAATAATAAACCTCAGTGCCAGCTTTGTAGAAGATTTGGACACACTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTCCTTCAACAGAGAAAAGCAATTCTGAGGTTGAAGTTTCATCTCAAAGTCTAAAAATTATAAACCCAGGCAACAAGATCACAACGATGAAGCTTGATGAAGA
CAATTTTCTTCTGTGGAAACTGCAAATTCTTACTACCCTACGAGGCCACAGATTAAAACATCATCTTGATGAAGGCTCAGAAACTCCTTCAAAGTTTATTACAAGCGGCG
ATTCGAAAATCCCTAATCCCGCTTATGATCATTGGGTTCGACAAGACAGCCTTATTATCGCCTGGTTACTTCGCTCGATGTCTAACTCGCTACTTTCGGAAATGCTCGAG
TGCGAAACTGCTCGAGAGGTGTGGAGAATTCTTCAGAATCGATTCTCTTCACGAAACGTTGCAAGGATTATGGATTTAAAATCCAAATTGGAATCGCTCAAGAAAGAGTA
TGACTCTACTGTCAATGTGATTACGGAAAAAGATGAGACTCCTTCATTGCAGACGGTGTATTCGCTTCTTTTTACTCAAGAAAACGGGATTGCTCGAAATCTTTCTATAA
ATCTTGATGGCTCAACTCTTTCTATTAATCTCACAACCCAGACGGGATCGAAACAACAACAGACTTCTTCGTCTGACTCCAACAATCGCAAGAATTCAAATGGGAAAGGC
AACAATAATCGGCGATCTTGGAACAATAATAATAAACCTCAGTGCCAGCTTTGTAGAAGATTTGGACACACTGTTTAG
Protein sequenceShow/hide protein sequence
MESPSTEKSNSEVEVSSQSLKIINPGNKITTMKLDEDNFLLWKLQILTTLRGHRLKHHLDEGSETPSKFITSGDSKIPNPAYDHWVRQDSLIIAWLLRSMSNSLLSEMLE
CETAREVWRILQNRFSSRNVARIMDLKSKLESLKKEYDSTVNVITEKDETPSLQTVYSLLFTQENGIARNLSINLDGSTLSINLTTQTGSKQQQTSSSDSNNRKNSNGKG
NNNRRSWNNNNKPQCQLCRRFGHTV