; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025117 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025117
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr10:8701810..8707809
RNA-Seq ExpressionLag0025117
SyntenyLag0025117
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573361.1 hypothetical protein SDJN03_27248, partial [Cucurbita argyrosperma subsp. sororia]9.2e-7043.17Show/hide
Query:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK
        MES AL RG  PSKLPL KPS S PM  M S L F+++ P F RLSNGS    VSIG T+NPNI DK LA CG    GET YS+A+ED+LK+L+SLVQPK
Subjt:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK

Query:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG
        + NS + +I   KNEALKLV++EKY  A C  K LC  +   A+V YEAR+A++QILI LDEY+KA +FLEEKDNFP+  S + R SLYKAVVHTMLGNG
Subjt:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG

Query:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATRIHGGTKLVGFSLPSDW
        D AE+WWN YL TLGNG    +LK H  NTNS+ FL + K +LK LLSLK   V   S+L +IIP K  +    ++    DA + H              
Subjt:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATRIHGGTKLVGFSLPSDW

Query:  NEASSSPSLCSSSSLSLSVFISFLTPPPPATCFIRHLFISKSKGKSPSIRSETKPRSGFSFQIWNEKDFPFVSDPFQLEIQSGSAEGTELVLYFTSLEDL
               +LC+                                                                                         
Subjt:  NEASSSPSLCSSSSLSLSVFISFLTPPPPATCFIRHLFISKSKGKSPSIRSETKPRSGFSFQIWNEKDFPFVSDPFQLEIQSGSAEGTELVLYFTSLEDL

Query:  KIKNLEEEVLEAQVAYVQILIYLDKYEEAL-TLIEKESHFPKSD-ARPCLYKGDAAKSQGN
        K+++  EE LEAQ+AY+ ILIYL KYEEAL  L+  E  FP S+ A PCLYK     + GN
Subjt:  KIKNLEEEVLEAQVAYVQILIYLDKYEEAL-TLIEKESHFPKSD-ARPCLYKGDAAKSQGN

KAG7012525.1 hypothetical protein SDJN02_25277, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-5763.59Show/hide
Query:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK
        MES ALLRG  PSKLPL KPS S PM  M S L F+++ P F RLSNGS    VSIG T+NPNI DK LA CG    GET YS+A+ED+LK+L+SLVQPK
Subjt:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK

Query:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG
        + NS + +I   KNEALKLV++EKY  A C  K LC  +   A+V YEAR+A++QILI LDEY+KA +FLEEKDNFP+  S + R SLYKAVVHTMLGNG
Subjt:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG

Query:  D-AEKWWNKYLATLGNG
        D AE+WWN YL TLGNG
Subjt:  D-AEKWWNKYLATLGNG

XP_022955326.1 uncharacterized protein LOC111457322 isoform X1 [Cucurbita moschata]1.9e-6743.17Show/hide
Query:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK
        MES ALLRG  PSKLPL KPS S PM  M S L F+++ P F RLSNGS    VSIG T+NPNIHDK LA CG    G T YS+A+ED+LK+LLSLVQPK
Subjt:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK

Query:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG
        + NS + +I   KNEALKLV++ KY  A C  K LC  +   A+V YEAR+A++QILI LDEY+KA +FLEE DNFP+  S + R SLYKAVVHTMLGNG
Subjt:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG

Query:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATRIHGGTKLVGFSLPSDW
        D AE+WWN YL TLG+G    +LK H  NTNS+ FL + K +LK LLSLK   V   S+L NIIP K  +    ++    DA + H              
Subjt:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATRIHGGTKLVGFSLPSDW

Query:  NEASSSPSLCSSSSLSLSVFISFLTPPPPATCFIRHLFISKSKGKSPSIRSETKPRSGFSFQIWNEKDFPFVSDPFQLEIQSGSAEGTELVLYFTSLEDL
               +LC+                                                                                         
Subjt:  NEASSSPSLCSSSSLSLSVFISFLTPPPPATCFIRHLFISKSKGKSPSIRSETKPRSGFSFQIWNEKDFPFVSDPFQLEIQSGSAEGTELVLYFTSLEDL

Query:  KIKNLEEEVLEAQVAYVQILIYLDKYEEAL-TLIEKESHFPKSD-ARPCLYKGDAAKSQGN
        K+++  EE LEAQ+AY+ ILIYL KYEEAL  L+  E  F  S+ ARPCLYK     + GN
Subjt:  KIKNLEEEVLEAQVAYVQILIYLDKYEEAL-TLIEKESHFPKSD-ARPCLYKGDAAKSQGN

XP_022955329.1 uncharacterized protein LOC111457322 isoform X2 [Cucurbita moschata]2.3e-6560.3Show/hide
Query:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK
        MES ALLRG  PSKLPL KPS S PM  M S L F+++ P F RLSNGS    VSIG T+NPNIHDK LA CG    G T YS+A+ED+LK+LLSLVQPK
Subjt:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK

Query:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG
        + NS + +I   KNEALKLV++ KY  A C  K LC  +   A+V YEAR+A++QILI LDEY+KA +FLEE DNFP+  S + R SLYKAVVHTMLGNG
Subjt:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG

Query:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTK
        D AE+WWN YL TLG+G    +LK H  NTNS+ FL + K +LK LLSLK   V   S+L NIIP K
Subjt:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTK

XP_023542161.1 uncharacterized protein LOC111802127 [Cucurbita pepo subsp. pepo]2.1e-6657.75Show/hide
Query:  MESIALLRGASPSKLPLYKPSSSIP--MMASSLRFDLKTPSFSRLSNGSAG--VSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK
        MES ALLRG  PSKLPL++PS SIP  MM S LRF+++     RLSNGS    VSIG T NPNI DK LA CG    GET YS+A+ D+LKSLLSLVQ K
Subjt:  MESIALLRGASPSKLPLYKPSSSIP--MMASSLRFDLKTPSFSRLSNGSAG--VSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK

Query:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG
        + NSP+ +I   KNEALKLV++ KY  A C  K LC  +   A+V YEAR+ ++QILI LDEY+KA +FLEEKDNFP+  S + R SLYKAVVHTMLGNG
Subjt:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG

Query:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATR
        D AE+ WN YL TLGNG    +LK H  NTNS+ FL + K +LK LLSLK   VE  S+L NII TK      ++   +D A R
Subjt:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATR

TrEMBL top hitse value%identityAlignment
A0A1S3BA48 uncharacterized protein LOC1034876731.7e-5339.01Show/hide
Query:  MESIALLRGASPSKLPLYKPSSSIPMMASS--LRFDLK--TPSFSRLSNGSAGV-SIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQP
        MESI L R  S  KLP   PS +IP M+SS  L F+L+  TP FS+LSN S  + +IG   + N  +K +A CG   GGE H     ++ LKSLLSLV+P
Subjt:  MESIALLRGASPSKLPLYKPSSSIPMMASS--LRFDLK--TPSFSRLSNGSAGV-SIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQP

Query:  KQKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGN
         + NS +S I   K+EALKLVMD KY  AE  M+ L    + + +V+YEARVA++QILI+LD+YEKA  FLEE+ NFP S   + R  LYKAVV+TML  
Subjt:  KQKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGN

Query:  GD-AEKWWNKYLATLG----NGKLKIH-SENTNSESFLT-DTKGILKSLLSLK-PAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATRIHGGTKLVGFSL
         D AEKWWNKYL TLG    NGK+KI+   NTNSE  +  + K +LK LLSLK PA VEE S  S+II TKN      +   ++ A              
Subjt:  GD-AEKWWNKYLATLG----NGKLKIH-SENTNSESFLT-DTKGILKSLLSLK-PAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATRIHGGTKLVGFSL

Query:  PSDWNEASSSPSLCSSSSLSLSVFISFLTPPPPATCFIRHLFISKSKGKSPSIRSETKPRSGFSFQIWNEKDFPFVSDPFQLEIQSGSAEGTELVLYFTS
                                                 F+ KSK                            + DP                     
Subjt:  PSDWNEASSSPSLCSSSSLSLSVFISFLTPPPPATCFIRHLFISKSKGKSPSIRSETKPRSGFSFQIWNEKDFPFVSDPFQLEIQSGSAEGTELVLYFTS

Query:  LEDLKIKNLEEEVLEAQVAYVQILIYLDKYEEALTLIEK-ESHFPKSDARPCLYKGDAAKSQGN
                   E LEAQ+ Y+ ILIYLD+YEEAL ++   ++HF  SD RPCLYK       GN
Subjt:  LEDLKIKNLEEEVLEAQVAYVQILIYLDKYEEALTLIEK-ESHFPKSDARPCLYKGDAAKSQGN

A0A6J1CF48 uncharacterized protein LOC1110109431.5e-5759.17Show/hide
Query:  MASSLRFDLKTPSFSRLSNGSAGVSIGRTQNPNIHDKFLAHC-GISCGGETHYSDAKEDALKSLLSLVQPK-QKNSPSSMIIPIKNEALKLVMDEKYLAA
        MA  L FDL    F RL NGS G+SIG T+NP    KFLAHC  I+   ET YS+AK+D  KSLLSLV  K   NS  S I+ IKNEALKLV+D+KY   
Subjt:  MASSLRFDLKTPSFSRLSNGSAGVSIGRTQNPNIHDKFLAHC-GISCGGETHYSDAKEDALKSLLSLVQPK-QKNSPSSMIIPIKNEALKLVMDEKYLAA

Query:  ECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNGDAEKWWNKYLATLGNGKLK----IHSEN
        ECLM+ L +R  TN DV YEARVAYVQ LIYLDEY KA +FLEE   FP+S SSD RP LYKAVVHTMLG  DAEK WN YL TL NG +      HS N
Subjt:  ECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNGDAEKWWNKYLATLGNGKLK----IHSEN

Query:  TNSES--FLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTK
               FLTD K  LKSLLSL+    E+ S L+ IIP K
Subjt:  TNSES--FLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTK

A0A6J1GT93 uncharacterized protein LOC111457322 isoform X19.3e-6843.17Show/hide
Query:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK
        MES ALLRG  PSKLPL KPS S PM  M S L F+++ P F RLSNGS    VSIG T+NPNIHDK LA CG    G T YS+A+ED+LK+LLSLVQPK
Subjt:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK

Query:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG
        + NS + +I   KNEALKLV++ KY  A C  K LC  +   A+V YEAR+A++QILI LDEY+KA +FLEE DNFP+  S + R SLYKAVVHTMLGNG
Subjt:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG

Query:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATRIHGGTKLVGFSLPSDW
        D AE+WWN YL TLG+G    +LK H  NTNS+ FL + K +LK LLSLK   V   S+L NIIP K  +    ++    DA + H              
Subjt:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATRIHGGTKLVGFSLPSDW

Query:  NEASSSPSLCSSSSLSLSVFISFLTPPPPATCFIRHLFISKSKGKSPSIRSETKPRSGFSFQIWNEKDFPFVSDPFQLEIQSGSAEGTELVLYFTSLEDL
               +LC+                                                                                         
Subjt:  NEASSSPSLCSSSSLSLSVFISFLTPPPPATCFIRHLFISKSKGKSPSIRSETKPRSGFSFQIWNEKDFPFVSDPFQLEIQSGSAEGTELVLYFTSLEDL

Query:  KIKNLEEEVLEAQVAYVQILIYLDKYEEAL-TLIEKESHFPKSD-ARPCLYKGDAAKSQGN
        K+++  EE LEAQ+AY+ ILIYL KYEEAL  L+  E  F  S+ ARPCLYK     + GN
Subjt:  KIKNLEEEVLEAQVAYVQILIYLDKYEEAL-TLIEKESHFPKSD-ARPCLYKGDAAKSQGN

A0A6J1GVX9 uncharacterized protein LOC111457322 isoform X21.1e-6560.3Show/hide
Query:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK
        MES ALLRG  PSKLPL KPS S PM  M S L F+++ P F RLSNGS    VSIG T+NPNIHDK LA CG    G T YS+A+ED+LK+LLSLVQPK
Subjt:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNGS--AGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK

Query:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG
        + NS + +I   KNEALKLV++ KY  A C  K LC  +   A+V YEAR+A++QILI LDEY+KA +FLEE DNFP+  S + R SLYKAVVHTMLGNG
Subjt:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNG

Query:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTK
        D AE+WWN YL TLG+G    +LK H  NTNS+ FL + K +LK LLSLK   V   S+L NIIP K
Subjt:  D-AEKWWNKYLATLGNG----KLKIHSENTNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTK

A0A6J1K442 uncharacterized protein LOC1114904752.1e-5157.99Show/hide
Query:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNG--SAGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK
        MES ALL G  PSKLPL KPS S PM  + S L F+++ P FSRLSNG  S  VSI  T NPN  D  LA CG    GET YS+A+ED+LK+LLSLVQPK
Subjt:  MESIALLRGASPSKLPLYKPSSSIPM--MASSLRFDLKTPSFSRLSNG--SAGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPK

Query:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLG-N
        + NS + +I   KNEALKLV++EKY    C  K LC  +    +V YEAR+A++QILI LDEY KA +FLEEKDNFP+S + +   SLYKAVVHTML  N
Subjt:  QKNSPSSMIIPIKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLG-N

Query:  GDAEKWWNKYLATLGNGKL
        G AE+WWN YL  + + KL
Subjt:  GDAEKWWNKYLATLGNGKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34540.2 unknown protein7.6e-0629.25Show/hide
Query:  IKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLG-NGDAEKWWNKYL
        IK EA++ + + K   A  L++    R +   +  +  ++A V+ILI L+ Y++A ++    D    +  SD R  LYKA+++TML  + +A++ W ++ 
Subjt:  IKNEALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLG-NGDAEKWWNKYL

Query:  ATLGNG
         ++G G
Subjt:  ATLGNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCCATTGCCTTGCTTCGTGGTGCCTCGCCATCCAAACTTCCATTATACAAACCTTCTTCTTCCATTCCGATGATGGCTTCATCGCTCCGTTTCGACCTTAAGAC
GCCATCTTTTTCTCGGCTCTCTAATGGTTCAGCTGGGGTCTCAATTGGGCGGACTCAAAATCCCAATATACATGACAAATTTTTAGCTCATTGTGGAATATCATGTGGTG
GAGAGACACATTACTCGGATGCTAAAGAAGATGCATTGAAGTCGCTGCTGAGCTTAGTACAACCAAAACAAAAGAACTCACCATCTTCAATGATCATTCCCATTAAGAAT
GAGGCATTGAAGCTGGTAATGGATGAGAAGTATCTTGCAGCAGAGTGCCTTATGAAAGGCTTATGCGATAGAAATCAGACAAATGCCGACGTGGCATACGAGGCTCGAGT
GGCATATGTCCAAATTCTTATATATCTTGATGAATACGAAAAAGCTCGAAAATTTCTTGAGGAGAAGGACAACTTTCCTCGATCTGATTCATCCGATGGAAGACCTAGTC
TTTACAAGGCTGTGGTGCATACCATGTTGGGCAATGGTGATGCTGAAAAATGGTGGAATAAGTACCTAGCAACCCTTGGCAATGGGAAGTTAAAAATTCATAGTGAAAAT
ACAAATTCAGAGAGCTTCTTGACAGATACAAAAGGCATATTGAAGTCACTGTTGAGCTTAAAACCAGCAATAGTGGAAGAAGGCAGTGTGTTATCAAATATTATTCCCAC
TAAGAATCCAATTCCCTCCGTTTCTCTTCTTAGACGGCACGACGACGCAACGCGAATCCACGGCGGCACGAAGCTTGTTGGGTTTTCCCTTCCTTCAGACTGGAACGAAG
CTTCTTCTTCACCGTCCCTCTGTTCTTCTTCTTCACTGTCCCTCTCTGTTTTCATTTCCTTTCTCACTCCCCCGCCTCCAGCTACCTGCTTCATTCGGCACCTCTTCATC
TCAAAATCGAAGGGGAAATCCCCTTCGATTCGGTCTGAAACGAAACCCAGATCTGGGTTTTCGTTCCAGATCTGGAACGAAAAGGATTTTCCCTTCGTTTCAGACCCATT
CCAGCTTGAAATTCAAAGCGGCAGCGCGGAGGGAACGGAGCTAGTTCTCTATTTCACGAGTTTAGAAGATTTGAAGATCAAAAACCTCGAGGAGGAGGTATTAGAGGCAC
AAGTGGCATATGTCCAGATTCTTATCTATCTCGATAAATATGAAGAAGCTCTAACACTCATTGAGAAGGAGAGTCACTTCCCTAAATCTGATGCAAGACCTTGCCTTTAT
AAGGGTGATGCGGCCAAGTCGCAGGGGAATGTCGGGGCTGTTAGGTGCCGAATTCGGATTCCGAATCCTGAGCCCAGGGCGTTACAGATTGCCTCATTTGATTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCCATTGCCTTGCTTCGTGGTGCCTCGCCATCCAAACTTCCATTATACAAACCTTCTTCTTCCATTCCGATGATGGCTTCATCGCTCCGTTTCGACCTTAAGAC
GCCATCTTTTTCTCGGCTCTCTAATGGTTCAGCTGGGGTCTCAATTGGGCGGACTCAAAATCCCAATATACATGACAAATTTTTAGCTCATTGTGGAATATCATGTGGTG
GAGAGACACATTACTCGGATGCTAAAGAAGATGCATTGAAGTCGCTGCTGAGCTTAGTACAACCAAAACAAAAGAACTCACCATCTTCAATGATCATTCCCATTAAGAAT
GAGGCATTGAAGCTGGTAATGGATGAGAAGTATCTTGCAGCAGAGTGCCTTATGAAAGGCTTATGCGATAGAAATCAGACAAATGCCGACGTGGCATACGAGGCTCGAGT
GGCATATGTCCAAATTCTTATATATCTTGATGAATACGAAAAAGCTCGAAAATTTCTTGAGGAGAAGGACAACTTTCCTCGATCTGATTCATCCGATGGAAGACCTAGTC
TTTACAAGGCTGTGGTGCATACCATGTTGGGCAATGGTGATGCTGAAAAATGGTGGAATAAGTACCTAGCAACCCTTGGCAATGGGAAGTTAAAAATTCATAGTGAAAAT
ACAAATTCAGAGAGCTTCTTGACAGATACAAAAGGCATATTGAAGTCACTGTTGAGCTTAAAACCAGCAATAGTGGAAGAAGGCAGTGTGTTATCAAATATTATTCCCAC
TAAGAATCCAATTCCCTCCGTTTCTCTTCTTAGACGGCACGACGACGCAACGCGAATCCACGGCGGCACGAAGCTTGTTGGGTTTTCCCTTCCTTCAGACTGGAACGAAG
CTTCTTCTTCACCGTCCCTCTGTTCTTCTTCTTCACTGTCCCTCTCTGTTTTCATTTCCTTTCTCACTCCCCCGCCTCCAGCTACCTGCTTCATTCGGCACCTCTTCATC
TCAAAATCGAAGGGGAAATCCCCTTCGATTCGGTCTGAAACGAAACCCAGATCTGGGTTTTCGTTCCAGATCTGGAACGAAAAGGATTTTCCCTTCGTTTCAGACCCATT
CCAGCTTGAAATTCAAAGCGGCAGCGCGGAGGGAACGGAGCTAGTTCTCTATTTCACGAGTTTAGAAGATTTGAAGATCAAAAACCTCGAGGAGGAGGTATTAGAGGCAC
AAGTGGCATATGTCCAGATTCTTATCTATCTCGATAAATATGAAGAAGCTCTAACACTCATTGAGAAGGAGAGTCACTTCCCTAAATCTGATGCAAGACCTTGCCTTTAT
AAGGGTGATGCGGCCAAGTCGCAGGGGAATGTCGGGGCTGTTAGGTGCCGAATTCGGATTCCGAATCCTGAGCCCAGGGCGTTACAGATTGCCTCATTTGATTGGTAA
Protein sequenceShow/hide protein sequence
MESIALLRGASPSKLPLYKPSSSIPMMASSLRFDLKTPSFSRLSNGSAGVSIGRTQNPNIHDKFLAHCGISCGGETHYSDAKEDALKSLLSLVQPKQKNSPSSMIIPIKN
EALKLVMDEKYLAAECLMKGLCDRNQTNADVAYEARVAYVQILIYLDEYEKARKFLEEKDNFPRSDSSDGRPSLYKAVVHTMLGNGDAEKWWNKYLATLGNGKLKIHSEN
TNSESFLTDTKGILKSLLSLKPAIVEEGSVLSNIIPTKNPIPSVSLLRRHDDATRIHGGTKLVGFSLPSDWNEASSSPSLCSSSSLSLSVFISFLTPPPPATCFIRHLFI
SKSKGKSPSIRSETKPRSGFSFQIWNEKDFPFVSDPFQLEIQSGSAEGTELVLYFTSLEDLKIKNLEEEVLEAQVAYVQILIYLDKYEEALTLIEKESHFPKSDARPCLY
KGDAAKSQGNVGAVRCRIRIPNPEPRALQIASFDW