; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022502 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022502
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:31051815..31055703
RNA-Seq ExpressionLag0022502
SyntenyLag0022502
Gene Ontology termsGO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR036396 - Cytochrome P450 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7627772.1 Zinc finger CCHC-type [Arabidopsis thaliana x Arabidopsis arenosa]1.5e-5534.5Show/hide
Query:  MSNPGKSHWEASKWILR-------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIWLK
        MS PGK HW A KW+LR       L    T + +  ++G+ DSDYAADLD+RRS TGY+FT+ GN +SWKS+LQS+VALS+TE EY+AL E+VKEA+W++
Subjt:  MSNPGKSHWEASKWILR-------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIWLK

Query:  GSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLEVNTAEVAASMASEAFLFQLAE
        G + E+  +  KV + CDSQS ISL+KN TFH+RTKHI ++F+FIR+V++EG V++ KI +  NPADMLTK      + V   E A        L     
Subjt:  GSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLEVNTAEVAASMASEAFLFQLAE

Query:  AEVARAYYLILTTL----------------RRRA------RTRGHISSSALSNSVPGELVQQL---ADRSST---------HAFHV-----------CLA
        AE  + +  +L  L                R+ A      R+    +   L + V   LV  L   AD  +T          AF V           CL 
Subjt:  AEVARAYYLILTTL----------------RRRA------RTRGHISSSALSNSVPGELVQQL---ADRSST---------HAFHV-----------CLA

Query:  SSRHQFPHFVS-DQSGLIRTHFNRDPSLLISGSRQNFSHNDMALLATLPSSQH----------PESIRIGISS------LLLILSSLHLHSLSLLRSLSP
         +R   P   + D +  I      +P   +  +++  +      L     + H           +S+ IG  +      L   L++ H +  ++   +  
Subjt:  SSRHQFPHFVS-DQSGLIRTHFNRDPSLLISGSRQNFSHNDMALLATLPSSQH----------PESIRIGISS------LLLILSSLHLHSLSLLRSLSP

Query:  FIFVGKTSVFA--------------------------VGIALLRWSLRPPRTVMHCLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMR
        FI  G+ +  A                          V + L    L+       CLCE MR+Y PV WDSKHA  +D LP+ T V+  D+VTYF YGM 
Subjt:  FIFVGKTSVFA--------------------------VGIALLRWSLRPPRTVMHCLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMR

Query:  QMEALWEKDCFEFKPN
        +ME LW  D  EF PN
Subjt:  QMEALWEKDCFEFKPN

KAG7962752.1 hypothetical protein I3843_09G081300 [Carya illinoinensis]6.5e-5160.82Show/hide
Query:  MSNPGKSHWEASKWILR--LGYT----KTDQLED-YIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIWLK
        M NPGK+HW A+KWILR  LG      K ++ ED  + GYVDSDYA DLDKRRS TGY+FT+ G  +SW+S+LQS +ALSSTE EY+A+ E+VKEAIWL+
Subjt:  MSNPGKSHWEASKWILR--LGYT----KTDQLED-YIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIWLK

Query:  GSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTK
        G + ++  +  +V ++CDSQSAI L+KN  +H RTKHIDVRFHF+RE+L+EG + LQKI +E NPADMLTK
Subjt:  GSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTK

KYP57718.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.2e-4958.01Show/hide
Query:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW
        M NPG++HWE  KW+LR         L Y K  Q    IEG+VD+DYA  +D R+SL+GY+FTL+G  +SWK++LQSVVALS+TE EYIALAE VKE +W
Subjt:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW

Query:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE
        LKG + E+      V+IHCDSQSAI L+ +  +H+RTKHIDV+ HFIREV++ G V++ KIASE NPADMLTKSLP  + E
Subjt:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE

PNX61902.1 retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense]2.7e-4953.59Show/hide
Query:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW
        M+NPG+ HW+A KW+LR         L Y K D  +D +EGYVD+DYA ++D R+SL+G++FTL+G  ++WK++ QSVVALS+T+ EYIAL E VKEAIW
Subjt:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW

Query:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE
        LKG I E+      V+IHCDSQSAI L+ +  +H+RTKHID+R HF+R++++  ++ ++K+ASE NPADM TKSLP +R +
Subjt:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE

XP_042012165.1 secreted RxLR effector protein 161-like [Salvia splendens]2.5e-5054.4Show/hide
Query:  MSNPGKSHWEASKWILRLGYTK------------TDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKE
        MSNPGKSHW+A KW+LR  Y K             DQ ++ +EG+ DSDYA++ D R+S TGY+FT+YG+ +SWKS+LQSVVALS+TE EYIAL ++V E
Subjt:  MSNPGKSHWEASKWILRLGYTK------------TDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKE

Query:  AIWLKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANR
        + W++G +E++  +  KV ++CDS SAI LSK+ TFH+R+KHIDVR HFIR+ +++G ++++KIA+EHNPAD LTK +PA++
Subjt:  AIWLKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANR

TrEMBL top hitse value%identityAlignment
A0A151SSH9 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-5058.01Show/hide
Query:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW
        M NPG++HWE  KW+LR         L Y K  Q    IEG+VD+DYA  +D R+SL+GY+FTL+G  +SWK++LQSVVALS+TE EYIALAE VKE +W
Subjt:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW

Query:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE
        LKG + E+      V+IHCDSQSAI L+ +  +H+RTKHIDV+ HFIREV++ G V++ KIASE NPADMLTKSLP  + E
Subjt:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE

A0A176W7N9 Reverse transcriptase Ty1/copia-type domain-containing protein2.3e-4955Show/hide
Query:  MSNPGKSHWEASKWILRLGYTKTD----------QLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAI
        MSNPGK+HW+A KW+LR      D            E  +EG+VDSDYA  LD RRSLTGYLFT+ G +ISWKS+LQ VV LS+TE EYIA+ E+VKEAI
Subjt:  MSNPGKSHWEASKWILRLGYTKTD----------QLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAI

Query:  WLKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANR
        WLKG + E+     +V IHCD+ SAI LSK+  FHD++KHID++ HF+R+++  G V ++KI++E NP+DMLTKS+P ++
Subjt:  WLKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANR

A0A2K3K6K0 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)1.3e-4953.59Show/hide
Query:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW
        M+NPG+ HW+A KW+LR         L Y K D  +D +EGYVD+DYA ++D R+SL+G++FTL+G  ++WK++ QSVVALS+T+ EYIAL E VKEAIW
Subjt:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW

Query:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE
        LKG I E+      V+IHCDSQSAI L+ +  +H+RTKHID+R HF+R++++  ++ ++K+ASE NPADM TKSLP +R +
Subjt:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class2.9e-4955.08Show/hide
Query:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW
        MSNPGK HW+A KW+LR         L Y++       +EG+ D+DYAADLDKRRSL+G++F LYGN++SWK +LQ VVALS+TE EYI+L E+VKEA+W
Subjt:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW

Query:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLEVNTAEV
        LK  + E+ S+ F   IHCDSQSAI L+KNP+ H+R+KHIDV+FH+IR V+ +  V+L K+ +  N +DMLTK+L A+R  V TA++
Subjt:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLEVNTAEV

A0A6D2LC14 Uncharacterized protein2.9e-4955.25Show/hide
Query:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW
        M+NPG+ HW A KWILR         L +TK+++ E  IEG+ DSDY+ADLDKRRS++GY+F + GN +SW+S LQ VVALS+TE EY+AL+E+ KE IW
Subjt:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW

Query:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE
        LKG   ++  +  + ++HCDSQSAI L+KN   HDRTKHID R+HFIR+++ EG V++ K+ + +NPADMLTK LP N  E
Subjt:  LKGSIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLE

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.8e-2843.95Show/hide
Query:  LRLGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYG-NIISWKSSLQSVVALSSTEVEYIALAESVKEAIWLKGSIEEI-FSRGFKVRIHCDSQ
        ++L + K    E+ I GYVDSD+A     R+S TGYLF ++  N+I W +  Q+ VA SSTE EY+AL E+V+EA+WLK  +  I       ++I+ D+Q
Subjt:  LRLGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYG-NIISWKSSLQSVVALSSTEVEYIALAESVKEAIWLKGSIEEI-FSRGFKVRIHCDSQ

Query:  SAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANR
          IS++ NP+ H R KHID+++HF RE +Q   + L+ I +E+  AD+ TK LPA R
Subjt:  SAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-4552.51Show/hide
Query:  MSNPGKSHWEASKWILR-LGYTKTDQL-----EDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIWLKG
        + NPGK HWEA KWILR L  T  D L     +  ++GY D+D A D+D R+S TGYLFT  G  ISW+S LQ  VALS+TE EYIA  E+ KE IWLK 
Subjt:  MSNPGKSHWEASKWILR-LGYTKTDQL-----EDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIWLKG

Query:  SIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLEV
         ++E+     +  ++CDSQSAI LSKN  +H RTKHIDVR+H+IRE++ +  +++ KI++  NPADMLTK +P N+ E+
Subjt:  SIEEIFSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLEV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.8e-1835.23Show/hide
Query:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW
        M  P + H +A K ILR         +   K + L   +  Y D+D+A D D   S  GY+  L  + ISW S  Q  V  SSTE EY ++A +  E  W
Subjt:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW

Query:  LKGSIEEIFSRGFKVR-IHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSL
        +   + E+  R  +   I+CD+  A  L  NP FH R KHI + +HFIR  +Q G +++  +++    AD LTK L
Subjt:  LKGSIEEIFSRGFKVR-IHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSL

Q9FMV7 Cytochrome P450 94B11.4e-1662.3Show/hide
Query:  CLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKPN
        CLCE MR+Y PV WDSKHA  +D LP+ TP++  D+VTYF YGM +ME +W KD  EFKPN
Subjt:  CLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKPN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-1834.66Show/hide
Query:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW
        M  P   HW A K +LR         +   K + L   +  Y D+D+A D D   S  GY+  L  + ISW S  Q  V  SSTE EY ++A +  E  W
Subjt:  MSNPGKSHWEASKWILR---------LGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIW

Query:  LKGSIEEI-FSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSL
        +   + E+         I+CD+  A  L  NP FH R KHI + +HFIR  +Q G +++  +++    AD LTK L
Subjt:  LKGSIEEI-FSRGFKVRIHCDSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSL

Arabidopsis top hitse value%identityAlignment
AT1G13140.1 cytochrome P450, family 86, subfamily C, polypeptide 34.5e-1045.76Show/hide
Query:  LCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKP
        L ETMR+Y P+P + K AI +D  P+ T ++   RV + +Y M +ME++W KDC  FKP
Subjt:  LCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKP

AT1G13140.2 cytochrome P450, family 86, subfamily C, polypeptide 34.5e-1045.76Show/hide
Query:  LCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKP
        L ETMR+Y P+P + K AI +D  P+ T ++   RV + +Y M +ME++W KDC  FKP
Subjt:  LCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKP

AT3G01900.1 cytochrome P450, family 94, subfamily B, polypeptide 21.4e-1960.56Show/hide
Query:  SLRPPRTVMHCLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKPN
        SL+    +  CLCE MR+Y PVPWDSKHA+ +D LP+ T V+A DRVTYF YGM +ME LW +D  EFKPN
Subjt:  SLRPPRTVMHCLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKPN

AT3G48520.1 cytochrome P450, family 94, subfamily B, polypeptide 32.5e-1660.66Show/hide
Query:  CLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKPN
        CLCE MR+Y PV WDSKHA  +D LP+ T V+  D+VTYF YGM +ME LW  D  EF PN
Subjt:  CLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKPN

AT5G63450.1 cytochrome P450, family 94, subfamily B, polypeptide 11.0e-1762.3Show/hide
Query:  CLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKPN
        CLCE MR+Y PV WDSKHA  +D LP+ TP++  D+VTYF YGM +ME +W KD  EFKPN
Subjt:  CLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAACCCAGGAAAGTCTCATTGGGAGGCTTCAAAGTGGATTCTCAGGCTTGGTTACACTAAAACAGACCAACTTGAAGATTATATTGAAGGGTATGTAGATTCGGA
TTATGCAGCGGATCTAGACAAACGAAGGTCACTAACAGGATACCTATTTACCCTCTATGGAAACATTATTAGTTGGAAGTCCTCTTTACAGTCGGTTGTGGCTCTTTCCT
CCACAGAAGTTGAATATATAGCTTTAGCAGAATCGGTCAAAGAAGCAATATGGCTTAAAGGAAGTATTGAAGAAATATTTTCAAGAGGGTTCAAAGTGAGAATACACTGT
GACAGCCAAAGTGCAATCTCTTTGTCTAAGAATCCCACCTTTCATGATCGAACCAAACACATCGACGTCCGCTTCCATTTCATAAGAGAAGTTCTTCAAGAAGGCAAGGT
TCAACTTCAAAAGATTGCATCAGAACATAACCCAGCTGATATGCTAACTAAATCCTTACCAGCAAATCGATTGGAAGTCAATACCGCTGAGGTGGCAGCCAGCATGGCAA
GTGAAGCCTTCCTTTTTCAGTTGGCTGAAGCTGAAGTAGCACGTGCCTATTACCTCATCTTAACAACCCTACGACGGCGTGCACGAACTCGAGGCCACATCAGCTCTAGT
GCCCTCTCCAACTCCGTCCCTGGCGAACTAGTTCAGCAACTCGCGGATCGATCCTCGACGCACGCATTCCATGTGTGTTTAGCAAGCAGCAGGCATCAATTTCCTCATTT
TGTTTCGGACCAATCTGGACTCATTCGAACACATTTCAATAGAGATCCAAGTCTTTTGATAAGTGGGTCGAGACAAAACTTCAGCCACAATGACATGGCTCTTTTGGCTA
CTCTCCCATCATCCCAACATCCAGAATCAATTCGTATAGGGATTTCTTCTTTACTTCTCATTCTCTCTTCTCTTCATCTTCATTCTCTTTCTCTCCTCCGTTCACTCTCT
CCTTTTATCTTTGTTGGCAAAACCAGCGTCTTCGCCGTCGGAATTGCGCTTCTTCGCTGGAGCTTAAGGCCGCCACGGACTGTTATGCATTGCCTATGCGAAACCATGAG
AATCTACTCACCTGTACCTTGGGATTCCAAGCATGCCATTGTCAACGACTACTTGCCCAACAGGACTCCAGTCCAAGCCGAAGACAGAGTGACATATTTCTCGTATGGGA
TGAGGCAAATGGAGGCTCTATGGGAAAAGGATTGCTTTGAGTTCAAGCCGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAACCCAGGAAAGTCTCATTGGGAGGCTTCAAAGTGGATTCTCAGGCTTGGTTACACTAAAACAGACCAACTTGAAGATTATATTGAAGGGTATGTAGATTCGGA
TTATGCAGCGGATCTAGACAAACGAAGGTCACTAACAGGATACCTATTTACCCTCTATGGAAACATTATTAGTTGGAAGTCCTCTTTACAGTCGGTTGTGGCTCTTTCCT
CCACAGAAGTTGAATATATAGCTTTAGCAGAATCGGTCAAAGAAGCAATATGGCTTAAAGGAAGTATTGAAGAAATATTTTCAAGAGGGTTCAAAGTGAGAATACACTGT
GACAGCCAAAGTGCAATCTCTTTGTCTAAGAATCCCACCTTTCATGATCGAACCAAACACATCGACGTCCGCTTCCATTTCATAAGAGAAGTTCTTCAAGAAGGCAAGGT
TCAACTTCAAAAGATTGCATCAGAACATAACCCAGCTGATATGCTAACTAAATCCTTACCAGCAAATCGATTGGAAGTCAATACCGCTGAGGTGGCAGCCAGCATGGCAA
GTGAAGCCTTCCTTTTTCAGTTGGCTGAAGCTGAAGTAGCACGTGCCTATTACCTCATCTTAACAACCCTACGACGGCGTGCACGAACTCGAGGCCACATCAGCTCTAGT
GCCCTCTCCAACTCCGTCCCTGGCGAACTAGTTCAGCAACTCGCGGATCGATCCTCGACGCACGCATTCCATGTGTGTTTAGCAAGCAGCAGGCATCAATTTCCTCATTT
TGTTTCGGACCAATCTGGACTCATTCGAACACATTTCAATAGAGATCCAAGTCTTTTGATAAGTGGGTCGAGACAAAACTTCAGCCACAATGACATGGCTCTTTTGGCTA
CTCTCCCATCATCCCAACATCCAGAATCAATTCGTATAGGGATTTCTTCTTTACTTCTCATTCTCTCTTCTCTTCATCTTCATTCTCTTTCTCTCCTCCGTTCACTCTCT
CCTTTTATCTTTGTTGGCAAAACCAGCGTCTTCGCCGTCGGAATTGCGCTTCTTCGCTGGAGCTTAAGGCCGCCACGGACTGTTATGCATTGCCTATGCGAAACCATGAG
AATCTACTCACCTGTACCTTGGGATTCCAAGCATGCCATTGTCAACGACTACTTGCCCAACAGGACTCCAGTCCAAGCCGAAGACAGAGTGACATATTTCTCGTATGGGA
TGAGGCAAATGGAGGCTCTATGGGAAAAGGATTGCTTTGAGTTCAAGCCGAACTGA
Protein sequenceShow/hide protein sequence
MSNPGKSHWEASKWILRLGYTKTDQLEDYIEGYVDSDYAADLDKRRSLTGYLFTLYGNIISWKSSLQSVVALSSTEVEYIALAESVKEAIWLKGSIEEIFSRGFKVRIHC
DSQSAISLSKNPTFHDRTKHIDVRFHFIREVLQEGKVQLQKIASEHNPADMLTKSLPANRLEVNTAEVAASMASEAFLFQLAEAEVARAYYLILTTLRRRARTRGHISSS
ALSNSVPGELVQQLADRSSTHAFHVCLASSRHQFPHFVSDQSGLIRTHFNRDPSLLISGSRQNFSHNDMALLATLPSSQHPESIRIGISSLLLILSSLHLHSLSLLRSLS
PFIFVGKTSVFAVGIALLRWSLRPPRTVMHCLCETMRIYSPVPWDSKHAIVNDYLPNRTPVQAEDRVTYFSYGMRQMEALWEKDCFEFKPN