; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037559 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037559
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionABC transporter E family member 2
Genome locationchr2:7234960..7241837
RNA-Seq ExpressionLag0037559
SyntenyLag0037559
Gene Ontology termsGO:0019430 - removal of superoxide radicals (biological process)
GO:0016020 - membrane (cellular component)
GO:0042644 - chloroplast nucleoid (cellular component)
GO:0004784 - superoxide dismutase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051536 - iron-sulfur cluster binding (molecular function)
InterPro domainsIPR007209 - RNase L inhibitor RLI-like, possible metal-binding domain
IPR013283 - RLI1
IPR017896 - 4Fe-4S ferredoxin-type, iron-sulphur binding domain
IPR017900 - 4Fe-4S ferredoxin, iron-sulphur binding, conserved site
IPR019831 - Manganese/iron superoxide dismutase, N-terminal
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR036324 - Manganese/iron superoxide dismutase, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
APO14281.1 superoxide dismutase 3 [Luffa aegyptiaca]1.3e-4271.09Show/hide
Query:  QRSNIPYMFMHHSQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDI
        Q  +  + ++H SQLHKRSSDVTTRG+K TAYYGLKTPPYD                      L AL+PYMSRRTLEVHWGKHH NYVEGLNKQLSQNDI
Subjt:  QRSNIPYMFMHHSQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDI

Query:  LYGYTLDELLKVTYNNGNPLPEFNNAAQ
        LYGYTLDELLKVTYNNGNPLPEFNNAAQ
Subjt:  LYGYTLDELLKVTYNNGNPLPEFNNAAQ

KAG9143596.1 hypothetical protein Leryth_020802 [Lithospermum erythrorhizon]9.1e-4447.89Show/hide
Query:  RLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVPRPR
        RL RIAIVS D+CK KK  QECK+SCPV  TGKLCIEVT ASKIAFISEELCIGCGICVKKCP EAIQIINL KDL+KD THRY PNTFKLHRLPVPRP 
Subjt:  RLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVPRPR

Query:  KVF------------EPSRLVGDIEDNL-------------------------------------------KSCDYL-ACLRLTPYNSCINCLTLAAMKS
        +V                 L G ++ NL                                           +   YL    RL     C        ++ 
Subjt:  KVF------------EPSRLVGDIEDNL-------------------------------------------KSCDYL-ACLRLTPYNSCINCLTLAAMKS

Query:  SCDLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFKV
          DLSVLDYLSD ICCLYGKP                          ENLRF DESLTFKV
Subjt:  SCDLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFKV

THU60628.1 hypothetical protein C4D60_Mb07t14790 [Musa balbisiana]7.0e-4446.64Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        MADRL RIAIVS DRCK KK  QECK+SCPV  TGKLCIEVT ASKIAFISEELCIGCGICVKKCP EAIQIINL KDLDKD THRY PNTFKLHRLPVP
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV-----------------------------------------FEPSRL----VGDIEDNLKS----------------------------CDYLAC
        RP +V                                         F  S L       +EDNLK+                              YL  
Subjt:  RPRKV-----------------------------------------FEPSRL----VGDIEDNLKS----------------------------CDYLAC

Query:  LRLTPYNSCINCL---TLAAMKSSCDLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFK
         +       I  L       +    DLSVLDYLSD ICCLYGKP                          ENLRF DESLTFK
Subjt:  LRLTPYNSCINCL---TLAAMKSSCDLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFK

XP_021911986.1 ABC transporter E family member 2 [Carica papaya]8.8e-4750.19Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        MADRL RIAIVS DRCK KK  QECK+SCPV  TGKLCIEVT ASKIAFISEELCIGCGICVKKCP EAIQIINL +DLDKD THRY PNTFKLHRLPVP
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV-----------------------------------------FEPSRL----VGDIEDNLKSCDYLACLRLTPYNSCINCLTLAAMKSSC------
        RP +V                                         F  S L       +EDNLK  +  + L +         +      +S       
Subjt:  RPRKV-----------------------------------------FEPSRL----VGDIEDNLKSCDYLACLRLTPYNSCINCLTLAAMKSSC------

Query:  DLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFKV
        DLSVLDYLSD ICCLYGKP                          ENLRF DESLTFKV
Subjt:  DLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFKV

XP_023523987.1 superoxide dismutase [Fe] 3, chloroplastic [Cucurbita pepo subsp. pepo]1.0e-4270.31Show/hide
Query:  QRSNIPYMFMHHSQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDI
        Q  ++ +  +H S+LHKRSSDVTTRG+KVTAYYGLKTPPY+                      LDAL+PYMSR+TLEVHWGKHHRNYVEGLNKQLSQNDI
Subjt:  QRSNIPYMFMHHSQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDI

Query:  LYGYTLDELLKVTYNNGNPLPEFNNAAQ
        LYG+TLDELLKVTYNNGNPLPEFNNAAQ
Subjt:  LYGYTLDELLKVTYNNGNPLPEFNNAAQ

TrEMBL top hitse value%identityAlignment
A0A1L5JHX4 Superoxide dismutase6.4e-4371.09Show/hide
Query:  QRSNIPYMFMHHSQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDI
        Q  +  + ++H SQLHKRSSDVTTRG+K TAYYGLKTPPYD                      L AL+PYMSRRTLEVHWGKHH NYVEGLNKQLSQNDI
Subjt:  QRSNIPYMFMHHSQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDI

Query:  LYGYTLDELLKVTYNNGNPLPEFNNAAQ
        LYGYTLDELLKVTYNNGNPLPEFNNAAQ
Subjt:  LYGYTLDELLKVTYNNGNPLPEFNNAAQ

A0A3S3MT95 ABC transporter E family member 24.1e-4242.91Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        MADRL RIAIVS DRCK KK  QECK+SCPV +TGKLCIEVT AS+IAFISEELCIGCGICVK+CP +AIQIINL KDLDKD THRY PNTFKLHRLPVP
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKVFE-------------------------PSRLVGDIEDNLKSCDYLACLRLTPYNSCIN------------------CLTLAAMKSS---------
        RP +V                           P  + G++   L   D          N  +N                   + + A++++         
Subjt:  RPRKVFE-------------------------PSRLVGDIEDNLKSCDYLACLRLTPYNSCIN------------------CLTLAAMKSS---------

Query:  -----------------------------CDLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFKV
                                      DLSVLDYLSD ICCLYGKP                          ENLRF +ESLTFKV
Subjt:  -----------------------------CDLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFKV

A0A4S8JFC2 Uncharacterized protein3.4e-4446.64Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        MADRL RIAIVS DRCK KK  QECK+SCPV  TGKLCIEVT ASKIAFISEELCIGCGICVKKCP EAIQIINL KDLDKD THRY PNTFKLHRLPVP
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV-----------------------------------------FEPSRL----VGDIEDNLKS----------------------------CDYLAC
        RP +V                                         F  S L       +EDNLK+                              YL  
Subjt:  RPRKV-----------------------------------------FEPSRL----VGDIEDNLKS----------------------------CDYLAC

Query:  LRLTPYNSCINCL---TLAAMKSSCDLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFK
         +       I  L       +    DLSVLDYLSD ICCLYGKP                          ENLRF DESLTFK
Subjt:  LRLTPYNSCINCL---TLAAMKSSCDLSVLDYLSDSICCLYGKP--------------------------ENLRFIDESLTFK

A0A6J1H3S8 Superoxide dismutase1.9e-4270.31Show/hide
Query:  QRSNIPYMFMHHSQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDI
        Q  +  Y  +H S+LHKRSSDV TRG+KVTAYYGLKTPPY+                      LDAL+PYMSR+TLEVHWGKHHRNYVEGLNKQLSQNDI
Subjt:  QRSNIPYMFMHHSQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDI

Query:  LYGYTLDELLKVTYNNGNPLPEFNNAAQ
        LYG+TLDELLKVTYNNGNPLPEFNNAAQ
Subjt:  LYGYTLDELLKVTYNNGNPLPEFNNAAQ

D7TFC0 Uncharacterized protein5.4e-4284.4Show/hide
Query:  EGDSMADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHR
        EG +MADRL RIAIVS DRCK KK  QECK+SCPV  TGKLCIEVT ASKIAFISEELCIGCGICVKKCP EAIQIINL KDLDKD THRY PNTFKLHR
Subjt:  EGDSMADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHR

Query:  LPVPRPRKV
        LPVPRP +V
Subjt:  LPVPRPRKV

SwissProt top hitse value%identityAlignment
P61221 ATP-binding cassette sub-family E member 13.1e-3467.62Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        MAD+L RIAIV+ D+CK KK  QECK+SCPV   GKLCIEVT  SKIA+ISE LCIGCGIC+KKCP  A+ I+NL  +L+K+ THRY  N FKLHRLP+P
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV
        RP +V
Subjt:  RPRKV

P61222 ATP-binding cassette sub-family E member 13.1e-3467.62Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        MAD+L RIAIV+ D+CK KK  QECK+SCPV   GKLCIEVT  SKIA+ISE LCIGCGIC+KKCP  A+ I+NL  +L+K+ THRY  N FKLHRLP+P
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV
        RP +V
Subjt:  RPRKV

Q03195 Translation initiation factor RLI11.0e-3471.43Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        M+D+  RIAIVS D+CK KK  QECK SCPV  TGKLCIEVT  SKIAFISE LCIGCGICVKKCP +AIQIINL  +L+   THRY  N+FKLHRLP P
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV
        RP +V
Subjt:  RPRKV

Q8LPJ4 ABC transporter E family member 27.5e-4180.95Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        MADRL RIAIVS DRCK KK  QECK+SCPV  TGKLCIEVT  SK+AFISEELCIGCGICVKKCP EAIQIINL +DL+KD THRY  NTFKLHRLPVP
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV
        RP +V
Subjt:  RPRKV

Q8LPJ4 ABC transporter E family member 21.1e-0252Show/hide
Query:  DLSVLDYLSDSICCLYGKPENLRFIDESLTFKV-----VILLTFVPELNI
        DLSVLDYLSD ICCLYGKP     +  +L F V     + L  FVP  N+
Subjt:  DLSVLDYLSDSICCLYGKPENLRFIDESLTFKV-----VILLTFVPELNI

Q9LID6 ABC transporter E family member 12.0e-3878.1Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEV-TASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        M+DRL RIAIVS DRCK KK  QECK+SCPV  TGKLCIEV + SK AFISEELCIGCGICVKKCP EAIQIINL KDL KD THRY  N FKLHRLP+P
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEV-TASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV
        RP +V
Subjt:  RPRKV

Arabidopsis top hitse value%identityAlignment
AT3G13640.1 RNAse l inhibitor protein 11.5e-3978.1Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEV-TASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        M+DRL RIAIVS DRCK KK  QECK+SCPV  TGKLCIEV + SK AFISEELCIGCGICVKKCP EAIQIINL KDL KD THRY  N FKLHRLP+P
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEV-TASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV
        RP +V
Subjt:  RPRKV

AT3G13640.1 RNAse l inhibitor protein 19.8e-0450Show/hide
Query:  DLSVLDYLSDSICCLYGKPENLRFIDESLTFKV-----VILLTFVPELNI
        DLSVLDYLSD +CCLYGKP     +  +L F V     V L  F+P  N+
Subjt:  DLSVLDYLSDSICCLYGKPENLRFIDESLTFKV-----VILLTFVPELNI

AT4G19210.1 RNAse l inhibitor protein 25.3e-4280.95Show/hide
Query:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP
        MADRL RIAIVS DRCK KK  QECK+SCPV  TGKLCIEVT  SK+AFISEELCIGCGICVKKCP EAIQIINL +DL+KD THRY  NTFKLHRLPVP
Subjt:  MADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEVT-ASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVP

Query:  RPRKV
        RP +V
Subjt:  RPRKV

AT4G19210.1 RNAse l inhibitor protein 27.5e-0452Show/hide
Query:  DLSVLDYLSDSICCLYGKPENLRFIDESLTFKV-----VILLTFVPELNI
        DLSVLDYLSD ICCLYGKP     +  +L F V     + L  FVP  N+
Subjt:  DLSVLDYLSDSICCLYGKPENLRFIDESLTFKV-----VILLTFVPELNI

AT4G25100.1 Fe superoxide dismutase 11.3e-1646.53Show/hide
Query:  VTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDILYGYTLDELLKVTYNNGNPLPEFNNAAQ
        VTA Y LK PP+                       LDAL+P+MS++TLE HWGKHHR YV+ L KQ+   + L G  L+ ++  TYNNG+ LP FNNAAQ
Subjt:  VTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDILYGYTLDELLKVTYNNGNPLPEFNNAAQ

Query:  A
        A
Subjt:  A

AT5G23310.1 Fe superoxide dismutase 33.5e-3360.34Show/hide
Query:  SQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDILYGYTLDELLKV
        S   +R S  +  GLKV AYYGLKTPPY                       LDAL+PYMSRRTLEVHWGKHHR YV+ LNKQL ++D LYGYT++EL+K 
Subjt:  SQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDILYGYTLDELLKV

Query:  TYNNGNPLPEFNNAAQ
        TYNNGNPLPEFNNAAQ
Subjt:  TYNNGNPLPEFNNAAQ

AT5G51100.1 Fe superoxide dismutase 21.6e-1746.53Show/hide
Query:  VTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDILYGYTLDELLKVTYNNGNPLPEFNNAAQ
        +TA + LK PPY                       LDAL+P+MSR TL+ HWGKHH+ YVE LNKQ+   D L   +L+E++ ++YN GN LP FNNAAQ
Subjt:  VTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSRRTLEVHWGKHHRNYVEGLNKQLSQNDILYGYTLDELLKVTYNNGNPLPEFNNAAQ

Query:  A
        A
Subjt:  A


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTCGATCCCTCATCCTTTTCCATTTTTTTCGCTTTCTCACCTTCGCTGTCCGACCTCTCAGATCTTCAGGATACGGTATGTTCTGATCTCCGACCTCTCAGATC
TTCAGGATACGATCCCCACCCATGCCGCGTCCGTCGACGTTGCGAGAGTTTGATGGGTCCGCATGGTCGGTTTTTGGAGCATGGTGAGGGTGATTCGATGGCGGATCGAT
TGATGCGTATAGCTATTGTGAGTTTGGATAGGTGCAAGGCTAAAAAGTGGTGTCAGGAATGCAAGGAAAGCTGTCCAGTTGGTATGACGGGTAAACTGTGTATTGAGGTT
ACAGCCTCTAAGATCGCTTTCATCTCAGAAGAGCTATGTATTGGATGCGGTATATGTGTCAAGAAATGCCCATCTGAAGCAATTCAAATCATCAATCTGCGAAAGGATTT
GGATAAAGATGCAACACACCGATATGTCCCCAACACCTTCAAATTGCACAGGTTGCCAGTTCCTCGGCCTAGGAAAGTTTTTGAACCCTCCAGATTGGTAGGAGATATTG
AAGATAATCTAAAGTCCTGTGATTACCTCGCTTGCTTGCGTTTGACTCCCTACAATTCCTGCATTAATTGTCTGACTCTTGCAGCTATGAAATCGTCGTGCGATCTTAGT
GTCTTGGATTACTTGTCCGACTCTATTTGCTGTCTTTATGGGAAGCCGGAAAATCTACGATTTATAGACGAATCCCTTACCTTCAAGGTTGTGATTCTGCTGACTTTTGT
CCCAGAATTGAACATTGATGTCATTGCAACTTGTCCTTTCTTTTTCATTCTTTTTATAGAGCTGATTTTGTGCTGGACTTCTCGATTGCTTAAGGTTGTAATTGATACAG
CCAAAAAGCCTACCACCAAGACCTTAGGCATAGCTGAACAACTATCGGAAGAAGCTGGTAGAATGAAATTAGAAAAAGTAGTGTTATCCAAAGGTTTTACAGCAGCTTTG
GGAGTGGAAAAACGGGAAGAAGTGGGAAAGGCACCATTGGGTAGAGGTTTTACAGCAAAACTGAAGCAGATAGTCAAGGAAATTTGTTCCGGATACTGCTGGATTTGGGA
TCAACGTTCAAATATACCATACATGTTCATGCATCATAGTCAATTGCATAAGAGAAGCTCTGATGTAACCACAAGAGGATTGAAAGTCACCGCTTATTATGGCTTGAAGA
CACCCCCCTATGACCTTATTCATAGTAAGCCTACAAATTTCTACCAAACTGATTTTAAATTTGTCACAATTTCAGCATTACTGGATGCTTTAGATCCATATATGAGTCGG
AGGACATTGGAGGTTCACTGGGGTAAACATCACCGTAACTACGTTGAAGGCTTGAACAAACAACTGAGCCAAAATGATATTCTCTATGGCTACACTTTGGATGAACTTCT
CAAAGTAACATATAACAATGGGAATCCTTTGCCTGAGTTTAACAATGCTGCCCAGGCACTTCAATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTCGATCCCTCATCCTTTTCCATTTTTTTCGCTTTCTCACCTTCGCTGTCCGACCTCTCAGATCTTCAGGATACGGTATGTTCTGATCTCCGACCTCTCAGATC
TTCAGGATACGATCCCCACCCATGCCGCGTCCGTCGACGTTGCGAGAGTTTGATGGGTCCGCATGGTCGGTTTTTGGAGCATGGTGAGGGTGATTCGATGGCGGATCGAT
TGATGCGTATAGCTATTGTGAGTTTGGATAGGTGCAAGGCTAAAAAGTGGTGTCAGGAATGCAAGGAAAGCTGTCCAGTTGGTATGACGGGTAAACTGTGTATTGAGGTT
ACAGCCTCTAAGATCGCTTTCATCTCAGAAGAGCTATGTATTGGATGCGGTATATGTGTCAAGAAATGCCCATCTGAAGCAATTCAAATCATCAATCTGCGAAAGGATTT
GGATAAAGATGCAACACACCGATATGTCCCCAACACCTTCAAATTGCACAGGTTGCCAGTTCCTCGGCCTAGGAAAGTTTTTGAACCCTCCAGATTGGTAGGAGATATTG
AAGATAATCTAAAGTCCTGTGATTACCTCGCTTGCTTGCGTTTGACTCCCTACAATTCCTGCATTAATTGTCTGACTCTTGCAGCTATGAAATCGTCGTGCGATCTTAGT
GTCTTGGATTACTTGTCCGACTCTATTTGCTGTCTTTATGGGAAGCCGGAAAATCTACGATTTATAGACGAATCCCTTACCTTCAAGGTTGTGATTCTGCTGACTTTTGT
CCCAGAATTGAACATTGATGTCATTGCAACTTGTCCTTTCTTTTTCATTCTTTTTATAGAGCTGATTTTGTGCTGGACTTCTCGATTGCTTAAGGTTGTAATTGATACAG
CCAAAAAGCCTACCACCAAGACCTTAGGCATAGCTGAACAACTATCGGAAGAAGCTGGTAGAATGAAATTAGAAAAAGTAGTGTTATCCAAAGGTTTTACAGCAGCTTTG
GGAGTGGAAAAACGGGAAGAAGTGGGAAAGGCACCATTGGGTAGAGGTTTTACAGCAAAACTGAAGCAGATAGTCAAGGAAATTTGTTCCGGATACTGCTGGATTTGGGA
TCAACGTTCAAATATACCATACATGTTCATGCATCATAGTCAATTGCATAAGAGAAGCTCTGATGTAACCACAAGAGGATTGAAAGTCACCGCTTATTATGGCTTGAAGA
CACCCCCCTATGACCTTATTCATAGTAAGCCTACAAATTTCTACCAAACTGATTTTAAATTTGTCACAATTTCAGCATTACTGGATGCTTTAGATCCATATATGAGTCGG
AGGACATTGGAGGTTCACTGGGGTAAACATCACCGTAACTACGTTGAAGGCTTGAACAAACAACTGAGCCAAAATGATATTCTCTATGGCTACACTTTGGATGAACTTCT
CAAAGTAACATATAACAATGGGAATCCTTTGCCTGAGTTTAACAATGCTGCCCAGGCACTTCAATGTTAA
Protein sequenceShow/hide protein sequence
MFFDPSSFSIFFAFSPSLSDLSDLQDTVCSDLRPLRSSGYDPHPCRVRRRCESLMGPHGRFLEHGEGDSMADRLMRIAIVSLDRCKAKKWCQECKESCPVGMTGKLCIEV
TASKIAFISEELCIGCGICVKKCPSEAIQIINLRKDLDKDATHRYVPNTFKLHRLPVPRPRKVFEPSRLVGDIEDNLKSCDYLACLRLTPYNSCINCLTLAAMKSSCDLS
VLDYLSDSICCLYGKPENLRFIDESLTFKVVILLTFVPELNIDVIATCPFFFILFIELILCWTSRLLKVVIDTAKKPTTKTLGIAEQLSEEAGRMKLEKVVLSKGFTAAL
GVEKREEVGKAPLGRGFTAKLKQIVKEICSGYCWIWDQRSNIPYMFMHHSQLHKRSSDVTTRGLKVTAYYGLKTPPYDLIHSKPTNFYQTDFKFVTISALLDALDPYMSR
RTLEVHWGKHHRNYVEGLNKQLSQNDILYGYTLDELLKVTYNNGNPLPEFNNAAQALQC