; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019801 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019801
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionMitochondrial pyruvate carrier 2-like
Genome locationscaffold5:34800145..34803877
RNA-Seq ExpressionSpg019801
SyntenySpg019801
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019402.1 hypothetical protein SDJN02_18363, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-6777.72Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY
        MKKLYR+RGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN S H GKA HQKP   AAAK GSDHPPVFSC CFRCYTSY
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        WVRWDSSPNRQ+IHEIIDAYEE LA++K GKN+KKERKKRNTGSGSG  S  G+GKGSE     EESR TEME A    GGE EAEKGSVR I
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

XP_022927395.1 uncharacterized protein LOC111434229 [Cucurbita moschata]1.0e-6778.24Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY
        MKKLYR+RGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN SGH GKA HQKP   AAAK GSDHPPVFSC CFRCYTSY
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        WVRWDSSPNRQ+IHEIIDAYEE LA++K GKN+KKERKKRNTGSGSG  S  G+GKGSE     EESR TEME A    GGE EAEKGSVR I
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

XP_023001610.1 uncharacterized protein LOC111495687 [Cucurbita maxima]7.9e-6877.2Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY
        M KLYR+RGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN SGH GKA  QKP   AAAK GSDHPP FSC CFRCYTSY
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        WVRWDSSPNRQ+IHEIIDAYEE LA++K GKN+KKERKKRNTGSGSG  S  G+GKGSE     EESR TEME A  GGGGE EAEKG+VR I
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

XP_023520004.1 uncharacterized protein LOC111783314 [Cucurbita pepo subsp. pepo]2.2e-7079.27Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY
        MKKLYR+RGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN SGH GKA HQKP   AAA+ GSDHPPVFSC CFRCYTSY
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        WVRWDSSPNRQ+IHEIIDAYEE LA++K GKNSKKERKKRNTGSGSG  S  G+GKGSE     EESR TEME A  GGGGE EAEKGSVR I
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

XP_038894832.1 uncharacterized protein LOC120083238 [Benincasa hispida]6.7e-6779.06Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY
        MKKLYRKRGTVHPSPPIISDHLSFLP AILTLAAALS EDREVLAYLISS      AVNN S H GKA HQKP   AAAKGGSDHPP FSC CFRCYTSY
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        WVRWDSSPNRQLIHEIIDAYEEKLA++K GKN+KKERKKRN+G  S  G GEGK SEP + EEE R TE E A    GGEEE EKGSVRRI
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

TrEMBL top hitse value%identityAlignment
A0A0A0LVV3 Uncharacterized protein8.6e-6074.35Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY
        MKKLYRKRGTVHPSP IISDHLSFLP  ILTLAAALS  DREVLAYLISS      AV N S H GKATHQK    AAA GG DHPP FSC CF+CYTSY
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        WVRWDSSPNRQLIHEIIDAYEEKLA++K GKN+KKERKKRN   G   G GEGKGSE  + EEE R TE E A    GGEE AEKG VRRI
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

A0A5D3CKA3 Uncharacterized protein8.0e-5870.16Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSA------VNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY
        MKKLYRK GTVHPSPP+ISDHLSFLP AILTL++ALS +DREVLAYLISS       V+N S H GKA H K AA  A   GSDHPP FSC CF+CYTSY
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSA------VNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        WVRWDSSPNRQLIHEIIDAYE+KLA+TK GKN+KKERKKRN+ SG+  G GEGKG+E  +  EE + TE        GGEEEAEKG VRRI
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

A0A6J1ENT0 uncharacterized protein LOC1114342295.0e-6878.24Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY
        MKKLYR+RGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN SGH GKA HQKP   AAAK GSDHPPVFSC CFRCYTSY
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        WVRWDSSPNRQ+IHEIIDAYEE LA++K GKN+KKERKKRNTGSGSG  S  G+GKGSE     EESR TEME A    GGE EAEKGSVR I
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

A0A6J1JYL3 uncharacterized protein LOC1114900452.5e-5971.81Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSAVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSYWVRWDS
        MKK YR+RGTVHPSPP ISDHLSFLPAAILTLAAAL+ EDREVLAYLISSA  N SGH GK++ QK   T +AK GSDHPP FSCGCFRCYT YWVRWDS
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSAVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSYWVRWDS

Query:  SPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGD---GGGGEEEAEKGSVRRI
        SPNR++IHEII+AYEEKLA+TKTGK +KKERKKRN          EG  SE    EE+ R TE E AGD   GGGGEE AEKG+VR+I
Subjt:  SPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGD---GGGGEEEAEKGSVRRI

A0A6J1KLN6 uncharacterized protein LOC1114956873.8e-6877.2Show/hide
Query:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY
        M KLYR+RGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN SGH GKA  QKP   AAAK GSDHPP FSC CFRCYTSY
Subjt:  MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        WVRWDSSPNRQ+IHEIIDAYEE LA++K GKN+KKERKKRNTGSGSG  S  G+GKGSE     EESR TEME A  GGGGE EAEKG+VR I
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGS--GEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein4.1e-3046.51Show/hide
Query:  MKKLYRKRGTVHPSPPII--SDH-LSFLPAAILTLAAALSPEDREVLAYLISSAVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSYWVR
        MKKLYRK GTVHPSPP I  +DH L+ LP AI +LAA LSPEDREVLAYLIS+A  + SG     +              +H P+F C CF CYTSYWVR
Subjt:  MKKLYRKRGTVHPSPPII--SDH-LSFLPAAILTLAAALSPEDREVLAYLISSAVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAQTKTGKNS---KKERKKRNTGS-----GSGFGSGEGK-GSEPGSNEEESRP--TEMETAGDGGG--------------
        WDSSP+RQLIHEIIDA+E+ L + K  K +   KK+R+KR+  S      S F + + +  S  G +   S P  +  E   DGGG              
Subjt:  WDSSPNRQLIHEIIDAYEEKLAQTKTGKNS---KKERKKRNTGS-----GSGFGSGEGK-GSEPGSNEEESRP--TEMETAGDGGG--------------

Query:  ---GEEEAEKGSVRR
            + E EKG+VRR
Subjt:  ---GEEEAEKGSVRR

AT1G24270.1 unknown protein6.6e-2042.25Show/hide
Query:  KRGTVHPSPPIIS-------DHLS---FLPAAILTLAAALSPEDREVLAYLISSAVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSYWV
        K+G VHPSPP+ S       D LS    L +AIL L + LS ED EVLAYLI+ ++N  +              +  K  S   P+  C CF CYTSYW 
Subjt:  KRGTVHPSPPIIS-------DHLS---FLPAAILTLAAALSPEDREVLAYLISSAVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSYWV

Query:  RWDSSPNRQLIHEIIDAYEEKLAQ-----TKTGKNSKKERKK
        +WDSS NR+LI++II+A+E+ L +     + T K +KK  KK
Subjt:  RWDSSPNRQLIHEIIDAYEEKLAQ-----TKTGKNSKKERKK

AT1G62422.1 unknown protein8.8e-3352.63Show/hide
Query:  MKKLYRKRGTVHPSPP--IISDH--LSFLPAAILTLAAALSPEDREVLAYLISSAVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSYWV
        MKKL RK GTVHPSPP  I +D   LS LP AIL+L AALS EDREVLAYLIS+     SG S + +  K       K  + H P+F C CF CYTSYWV
Subjt:  MKKLYRKRGTVHPSPP--IISDH--LSFLPAAILTLAAALSPEDREVLAYLISSAVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSYWV

Query:  RWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRN-TGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI
        RWD+SP RQLIHEIIDAYE+ L      K  KK+R+KR+   SG     G  + SE GS+  E    + E  G+ GG E E EKGSV ++
Subjt:  RWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRN-TGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRI

AT5G13090.1 unknown protein1.0e-2037.43Show/hide
Query:  RKRGTVHPSPP---------IISDHLS-----------FLPAAILTLAAALSPEDREVLAYLISSAVN-NCSGHSGKATHQKPAATAAAKGGSDHPPVFS
        +K+G V+PSPP           S+HL+            LPA IL L + LS E+REVLAYLI+     +  G+S      K  +  ++K  +  PPVF 
Subjt:  RKRGTVHPSPP---------IISDHLS-----------FLPAAILTLAAALSPEDREVLAYLISSAVN-NCSGHSGKATHQKPAATAAAKGGSDHPPVFS

Query:  CGCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGS--EPGSNEEESRP
        C CF CYT+YW RWDSSPNR+LIHEII+A+E    +  +   SK +R K+    G      + K +     + +++S+P
Subjt:  CGCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGS--EPGSNEEESRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGCTCTACCGAAAAAGAGGAACGGTCCACCCATCGCCGCCGATCATCTCCGACCACCTCTCGTTCCTCCCCGCCGCCATCCTCACCCTCGCGGCGGCGCTCTC
GCCGGAGGACCGAGAGGTCTTGGCCTACCTCATCTCCTCCGCCGTCAACAACTGCTCCGGCCACAGCGGCAAGGCGACCCATCAGAAACCCGCCGCCACCGCCGCCGCAA
AGGGTGGTTCCGATCACCCCCCGGTTTTCTCCTGCGGCTGCTTCCGGTGCTACACGAGCTACTGGGTCAGATGGGACTCGTCTCCGAACCGGCAACTCATACACGAAATC
ATCGACGCTTATGAAGAAAAATTGGCTCAGACCAAAACCGGCAAGAACAGTAAGAAAGAGAGGAAGAAGAGAAACACCGGGTCAGGGTCAGGGTTCGGGTCAGGTGAAGG
AAAGGGGTCCGAACCGGGCTCGAATGAAGAAGAGTCGAGGCCGACGGAGATGGAGACGGCGGGAGACGGCGGCGGCGGTGAGGAAGAGGCGGAGAAAGGGTCGGTGAGGA
GGATAAAGGGCGGAAGTAAGACCAGGAGGTCTAGGCGTTTCACCATTACATTTCACTACAATGGGAGAATACAATATAGTCCTAGGGTTGAGTATTTTGGGGGAAAGGTA
TCTGTGGTGAAAGACGTAGAACCCAGTGGTCTGACGATCCTAGATTTATCTGTCCTCACAGCATACCTAGGTGTATGTAATGCTGCTTGGTTTGGTTACGTGGTGCCAGG
GAAAACATTGAGTACAGGATATAAGGGGATTCTGTCCGATGACGATATTTTGGATATGGTTGAGATGCTCCCAGAGGATAGGATGATCCATATGTATGTGGAGCACAACC
CGAATAGAGAAATTATAGATTTTACGGTACCTATCGCTGAGGTTAAACCACTGTTTTTGGAGTGGTACCCTGAGGAAGCGAGTGTAGAGTCCCTTAATAAGGAAGGAACT
GACATGGAAGGAATAAATTCTATTTCTCAGAGGGAAATGACAAAAAATGTGGAAGGAACAGACAAGGAAGAAATGGATTATATGGTTGAGAAGGAAATGGAAAAACATGT
GGATGGAAGTGATAAGGAAGAAATAACATCTATGGAAGGCACTGACCAAAACCATCTGGAAGTTGTAGAGGAAGACCCATTTGAGTATTGTGATGAAGAATGGGATGGAG
ATGAGTCAGAAGATGATGGTCACGATGATAACGACAAAGATGTCGGGGGTACAAGTACAAATGAGACTTGTGAAGAGAATGAAGTTCAATCTGGCAATGTCGAATGTGAC
TCTGAAGATGATTATGAAGAAGGAGAAGATTCTAGCGATGAGGGTTCTGTGGAGGCGAATGAACCATTCGATGCACACATTTCTATTGACGTAGATGTGGAATCGGATTA
TCGATCATCGAGTGATTTGAATTCTCTAGTAAATTCAAGTGGAAGAAGAGTGAATGTTGATCCTGAGTTTAGAGAAGACACAGATATGGGAAAGATTGAATTTGTTATCG
GAATGAAGTTTAGCACTTCTAACGTAATGAAAGATGCTATAAAAGAGTATGCAGTCAGAGGTGGGTACAATATTCGATTGATAAAGAATGACAAGCAACGAGTCACAGCT
ACTTGTGATGGAGGATGTACTTGGAGACTACATGCTAGTGTGGCTAAGGGGGAGGCCACTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGCTCTACCGAAAAAGAGGAACGGTCCACCCATCGCCGCCGATCATCTCCGACCACCTCTCGTTCCTCCCCGCCGCCATCCTCACCCTCGCGGCGGCGCTCTC
GCCGGAGGACCGAGAGGTCTTGGCCTACCTCATCTCCTCCGCCGTCAACAACTGCTCCGGCCACAGCGGCAAGGCGACCCATCAGAAACCCGCCGCCACCGCCGCCGCAA
AGGGTGGTTCCGATCACCCCCCGGTTTTCTCCTGCGGCTGCTTCCGGTGCTACACGAGCTACTGGGTCAGATGGGACTCGTCTCCGAACCGGCAACTCATACACGAAATC
ATCGACGCTTATGAAGAAAAATTGGCTCAGACCAAAACCGGCAAGAACAGTAAGAAAGAGAGGAAGAAGAGAAACACCGGGTCAGGGTCAGGGTTCGGGTCAGGTGAAGG
AAAGGGGTCCGAACCGGGCTCGAATGAAGAAGAGTCGAGGCCGACGGAGATGGAGACGGCGGGAGACGGCGGCGGCGGTGAGGAAGAGGCGGAGAAAGGGTCGGTGAGGA
GGATAAAGGGCGGAAGTAAGACCAGGAGGTCTAGGCGTTTCACCATTACATTTCACTACAATGGGAGAATACAATATAGTCCTAGGGTTGAGTATTTTGGGGGAAAGGTA
TCTGTGGTGAAAGACGTAGAACCCAGTGGTCTGACGATCCTAGATTTATCTGTCCTCACAGCATACCTAGGTGTATGTAATGCTGCTTGGTTTGGTTACGTGGTGCCAGG
GAAAACATTGAGTACAGGATATAAGGGGATTCTGTCCGATGACGATATTTTGGATATGGTTGAGATGCTCCCAGAGGATAGGATGATCCATATGTATGTGGAGCACAACC
CGAATAGAGAAATTATAGATTTTACGGTACCTATCGCTGAGGTTAAACCACTGTTTTTGGAGTGGTACCCTGAGGAAGCGAGTGTAGAGTCCCTTAATAAGGAAGGAACT
GACATGGAAGGAATAAATTCTATTTCTCAGAGGGAAATGACAAAAAATGTGGAAGGAACAGACAAGGAAGAAATGGATTATATGGTTGAGAAGGAAATGGAAAAACATGT
GGATGGAAGTGATAAGGAAGAAATAACATCTATGGAAGGCACTGACCAAAACCATCTGGAAGTTGTAGAGGAAGACCCATTTGAGTATTGTGATGAAGAATGGGATGGAG
ATGAGTCAGAAGATGATGGTCACGATGATAACGACAAAGATGTCGGGGGTACAAGTACAAATGAGACTTGTGAAGAGAATGAAGTTCAATCTGGCAATGTCGAATGTGAC
TCTGAAGATGATTATGAAGAAGGAGAAGATTCTAGCGATGAGGGTTCTGTGGAGGCGAATGAACCATTCGATGCACACATTTCTATTGACGTAGATGTGGAATCGGATTA
TCGATCATCGAGTGATTTGAATTCTCTAGTAAATTCAAGTGGAAGAAGAGTGAATGTTGATCCTGAGTTTAGAGAAGACACAGATATGGGAAAGATTGAATTTGTTATCG
GAATGAAGTTTAGCACTTCTAACGTAATGAAAGATGCTATAAAAGAGTATGCAGTCAGAGGTGGGTACAATATTCGATTGATAAAGAATGACAAGCAACGAGTCACAGCT
ACTTGTGATGGAGGATGTACTTGGAGACTACATGCTAGTGTGGCTAAGGGGGAGGCCACTTTTTAG
Protein sequenceShow/hide protein sequence
MKKLYRKRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSAVNNCSGHSGKATHQKPAATAAAKGGSDHPPVFSCGCFRCYTSYWVRWDSSPNRQLIHEI
IDAYEEKLAQTKTGKNSKKERKKRNTGSGSGFGSGEGKGSEPGSNEEESRPTEMETAGDGGGGEEEAEKGSVRRIKGGSKTRRSRRFTITFHYNGRIQYSPRVEYFGGKV
SVVKDVEPSGLTILDLSVLTAYLGVCNAAWFGYVVPGKTLSTGYKGILSDDDILDMVEMLPEDRMIHMYVEHNPNREIIDFTVPIAEVKPLFLEWYPEEASVESLNKEGT
DMEGINSISQREMTKNVEGTDKEEMDYMVEKEMEKHVDGSDKEEITSMEGTDQNHLEVVEEDPFEYCDEEWDGDESEDDGHDDNDKDVGGTSTNETCEENEVQSGNVECD
SEDDYEEGEDSSDEGSVEANEPFDAHISIDVDVESDYRSSSDLNSLVNSSGRRVNVDPEFREDTDMGKIEFVIGMKFSTSNVMKDAIKEYAVRGGYNIRLIKNDKQRVTA
TCDGGCTWRLHASVAKGEATF