; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022831 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022831
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein FAR1-RELATED SEQUENCE 4-like
Genome locationchr7:38999374..39002827
RNA-Seq ExpressionLag0022831
SyntenyLag0022831
Gene Ontology termsNA
InterPro domainsIPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059897.1 protein FAR1-RELATED SEQUENCE 4-like [Cucumis melo var. makuwa]9.6e-8453.97Show/hide
Query:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE
        SS    G +NL S + SSS + DFQ++TD+H        DLKE  +F                                  C QD C W+VRAS YK  E
Subjt:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE

Query:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF
        LW LRKY++NH+CS+N IQT H+QAS+SLISDC+ ++  FSS D STP DI+  MRTKLGVNVSYYKAWR KE ++ SL G                   
Subjt:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF

Query:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV
           ++T++ETD +GHFKYC M +G+ IEGWKYCRPNISVD TFLK K+ GTLLTAST+DGNNQIFPLAF IVDSENDASW+WFFENIKNS GDRE LV++
Subjt:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV

Query:  FDRHLIIPKGALVVF
         DRHL IPK    VF
Subjt:  FDRHLIIPKGALVVF

KAA0065296.1 protein FAR1-RELATED SEQUENCE 4-like [Cucumis melo var. makuwa]2.5e-8454.29Show/hide
Query:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE
        SS    G +NL S + SSS + DFQ++TD+H        DLKEN +F                                  C QD C W+VRAS YK  +
Subjt:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE

Query:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF
        LW LRKY++NH+CS+N IQT H+QASSSLISDC+ ++  FSS D STP DI+  MRTKLGVNVSYYKAWR KE ++ SL G                   
Subjt:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF

Query:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV
           ++T++ETD +GHFKYC M +G+ IEGWKYCRPNISVD TFLK K+ GTLLTAST+DGNNQIFPLAF IVDSENDASW+WFFENIKNS GDRE LV++
Subjt:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV

Query:  FDRHLIIPKGALVVF
         DRHL IPK    VF
Subjt:  FDRHLIIPKGALVVF

TYK23827.1 MuDR family transposase [Cucumis melo var. makuwa]1.9e-8454.6Show/hide
Query:  SSSAIVGYENLSSVDSSSS-NVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE
        SS    G +NL S+ SSSS + DFQ++TD+H        DLKE  +F                                  C QD C W+VRAS YK  E
Subjt:  SSSAIVGYENLSSVDSSSS-NVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE

Query:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF
        LW LRKY++NH+CS+N IQT H+QASSSLISDC+ ++  FSS D STP DI+  MRTKLGVNVSYYKAWR KE ++ SL G                   
Subjt:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF

Query:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV
           ++T++ETD +GHFKYC M +G+ IEGWKYCRPNISVD TFLK K+ GTLLTAST+DGNNQIFPLAF IVDSENDASW+WFFENIKNS GDRE LV++
Subjt:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV

Query:  FDRHLIIPKGALVVF
         DRHL IPK    VF
Subjt:  FDRHLIIPKGALVVF

XP_038896605.1 uncharacterized protein LOC120084863 [Benincasa hispida]3.0e-8554.37Show/hide
Query:  HRLIAGSSSAIVGY--ENLSSVDSSSSNVDFQVITDVHFDGDLKENGVFG---------------------------------CVQDGCQWFVRASLYKE
        H  + GS S I G   + L  + SSS + + QVI +V F+GDLKE  VFG                                 C+Q+GCQW+VRAS YK+
Subjt:  HRLIAGSSSAIVGY--ENLSSVDSSSSNVDFQVITDVHFDGDLKENGVFG---------------------------------CVQDGCQWFVRASLYKE

Query:  SELWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG----TYT----------
        SELWMLRKY+S+H+C MN+ Q+CHRQASSS+I D LKE+FRF S DHS P+DIV+K RTKLGVN+SY KAWR KE I++SL G    +Y+          
Subjt:  SELWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG----TYT----------

Query:  SFETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLV
             T+T+ + D +GHFK C MV+G+SIEGW+Y  P ISVD TFLK KF GTLL+ASTLDGNN IFPLAF IVDSEND SWKWFFE+I+ S GDRE LV
Subjt:  SFETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLV

Query:  IVFDRHLIIPKGALVVFQQL
        IV  RHL IPK  L VF  +
Subjt:  IVFDRHLIIPKGALVVFQQL

XP_038907134.1 uncharacterized protein LOC120092945 [Benincasa hispida]3.1e-9057.14Show/hide
Query:  IAGSSSAIVG-YENLSSVDSSSSNV---DFQVITDVHFDGDLKENGVFG---------------------------------CVQDGCQWFVRASLYKES
        + GSSS +    EN  S+   S  +   DFQVI D+H   DLKE  VF                                  CVQDGCQW+VRAS YK S
Subjt:  IAGSSSAIVG-YENLSSVDSSSSNV---DFQVITDVHFDGDLKENGVFG---------------------------------CVQDGCQWFVRASLYKES

Query:  ELWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTS
        +LWMLRK++  HDCSMN++QT HRQAS+SLI DCLK EFR SS D  TPKDIV+K+R +LGVN+SYYKAWR KE I+KSLKG                  
Subjt:  ELWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTS

Query:  FETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVI
            T+T++ETD DGHFKYC M +GSSIEGWK+CRPNI VD TFLK K+ GTLLTAST+DGNN+ FPLAF IVDSENDASWKWFFENIKNSFG+REGLVI
Subjt:  FETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVI

Query:  VFDRHLIIPKGALVV
        + +RHL IP+G + V
Subjt:  VFDRHLIIPKGALVV

TrEMBL top hitse value%identityAlignment
A0A5A7T3G5 Protein FAR1-RELATED SEQUENCE 4-like4.6e-8453.97Show/hide
Query:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE
        SS    G +NL S + SSS + DFQ++TD+H        DLKE  +F                                  C QD C W+VRAS YK  E
Subjt:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE

Query:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF
        LW LRKY++NH+CS+N IQT H+QAS+SLISDC+ ++  FSS D STP DI+  MRTKLGVNVSYYKAWR KE ++ SL G                   
Subjt:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF

Query:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV
           ++T++ETD +GHFKYC M +G+ IEGWKYCRPNISVD TFLK K+ GTLLTAST+DGNNQIFPLAF IVDSENDASW+WFFENIKNS GDRE LV++
Subjt:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV

Query:  FDRHLIIPKGALVVF
         DRHL IPK    VF
Subjt:  FDRHLIIPKGALVVF

A0A5A7UZ18 Protein FAR1-RELATED SEQUENCE 4-like4.6e-8453.97Show/hide
Query:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE
        SS    G +NL S + SSS + DFQ++TD+H        DLKE  +F                                  C QD C W+VRAS YK  E
Subjt:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE

Query:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF
        LW LRKY++NH+CS+N IQT H+QAS+SLISDC+ ++  FSS D STP DI+  MRTKLGVNVSYYKAWR KE ++ SL G                   
Subjt:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF

Query:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV
           ++T++ETD +GHFKYC M +G+ IEGWKYCRPNISVD TFLK K+ GTLLTAST+DGNNQIFPLAF IVDSENDASW+WFFENIKNS GDRE LV++
Subjt:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV

Query:  FDRHLIIPKGALVVF
         DRHL IPK    VF
Subjt:  FDRHLIIPKGALVVF

A0A5A7VG38 Protein FAR1-RELATED SEQUENCE 4-like1.2e-8454.29Show/hide
Query:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE
        SS    G +NL S + SSS + DFQ++TD+H        DLKEN +F                                  C QD C W+VRAS YK  +
Subjt:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE

Query:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF
        LW LRKY++NH+CS+N IQT H+QASSSLISDC+ ++  FSS D STP DI+  MRTKLGVNVSYYKAWR KE ++ SL G                   
Subjt:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF

Query:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV
           ++T++ETD +GHFKYC M +G+ IEGWKYCRPNISVD TFLK K+ GTLLTAST+DGNNQIFPLAF IVDSENDASW+WFFENIKNS GDRE LV++
Subjt:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV

Query:  FDRHLIIPKGALVVF
         DRHL IPK    VF
Subjt:  FDRHLIIPKGALVVF

A0A5D3DAW8 Protein FAR1-RELATED SEQUENCE 4-like4.6e-8453.97Show/hide
Query:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE
        SS    G +NL S + SSS + DFQ++TD+H        DLKE  +F                                  C QD C W+VRAS YK  E
Subjt:  SSSAIVGYENLSS-VDSSSSNVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE

Query:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF
        LW LRKY++NH+CS+N IQT H+QAS+SLISDC+ ++  FSS D STP DI+  MRTKLGVNVSYYKAWR KE ++ SL G                   
Subjt:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF

Query:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV
           ++T++ETD +GHFKYC M +G+ IEGWKYCRPNISVD TFLK K+ GTLLTAST+DGNNQIFPLAF IVDSENDASW+WFFENIKNS GDRE LV++
Subjt:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV

Query:  FDRHLIIPKGALVVF
         DRHL IPK    VF
Subjt:  FDRHLIIPKGALVVF

A0A5D3DJR8 MuDR family transposase9.4e-8554.6Show/hide
Query:  SSSAIVGYENLSSVDSSSS-NVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE
        SS    G +NL S+ SSSS + DFQ++TD+H        DLKE  +F                                  C QD C W+VRAS YK  E
Subjt:  SSSAIVGYENLSSVDSSSS-NVDFQVITDVHFDG-----DLKENGVFG---------------------------------CVQDGCQWFVRASLYKESE

Query:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF
        LW LRKY++NH+CS+N IQT H+QASSSLISDC+ ++  FSS D STP DI+  MRTKLGVNVSYYKAWR KE ++ SL G                   
Subjt:  LWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKG--------------TYTSF

Query:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV
           ++T++ETD +GHFKYC M +G+ IEGWKYCRPNISVD TFLK K+ GTLLTAST+DGNNQIFPLAF IVDSENDASW+WFFENIKNS GDRE LV++
Subjt:  ETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIV

Query:  FDRHLIIPKGALVVF
         DRHL IPK    VF
Subjt:  FDRHLIIPKGALVVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase1.6e-1231.25Show/hide
Query:  IIKSLKGTYTSFETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIK
        ++ S  G    ++ D+ T         F+        SI+G+++CRP I VD+  L GK+   L+ AS  D  NQ FPLAF +    +  SW+WF   I+
Subjt:  IIKSLKGTYTSFETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIK

Query:  NSFGDREGLVIV
             R+G+ ++
Subjt:  NSFGDREGLVIV

AT1G64255.1 MuDR family transposase4.5e-1524.9Show/hide
Query:  KENGVFGCVQDGCQWFVRASLYKESELWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHS-------TPKDIVHKMRTKLGVNVSYYK
        K+  +F C++  C+W + A+  K+  L  + KY   H        TCH      ++ +  K EF    ++ +       T  ++    + K+G  +    
Subjt:  KENGVFGCVQDGCQWFVRASLYKESELWMLRKYVSNHDCSMNSIQTCHRQASSSLISDCLKEEFRFSSLDHS-------TPKDIVHKMRTKLGVNVSYYK

Query:  AWREKEQIIKSLKGTY-TSFETD-----------------TYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQ
            KE+ IK + G +  SFE                    Y  F       F         SIEG+++CRP I VD+  L  ++   L+ AS +D  N+
Subjt:  AWREKEQIIKSLKGTY-TSFETD-----------------TYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQ

Query:  IFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIVFDRH
         FPLAF +    +   W+WF   I+     R+GL ++   H
Subjt:  IFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIVFDRH

AT1G64260.1 MuDR family transposase1.8e-1626.79Show/hide
Query:  KENGVFGCVQDGCQWFVRASLYKESELWMLRKYVSNHDCS------------MNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVN
        KE   F CV+  C+W +RA+  +E  L  + KY   H CS             + I+   R   +  I++  K     +  +  T K    K+     V 
Subjt:  KENGVFGCVQDGCQWFVRASLYKESELWMLRKYVSNHDCS------------MNSIQTCHRQASSSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVN

Query:  VSYYKAWREKEQIIKSLKGTYTSFETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSEN
            +++R   ++I +   +        Y  F       F+        SIEG+++CRP I VD+  L GK+   L+ AS +D  N+ FPLAF +    +
Subjt:  VSYYKAWREKEQIIKSLKGTYTSFETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVGTLLTASTLDGNNQIFPLAFGIVDSEN

Query:  DASWKWFFENIKNSFGDREGLVIV
          SW+WFF  I+     R+ L ++
Subjt:  DASWKWFFENIKNSFGDREGLVIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACGGTGCTAAATCGTTTACACCGGTTGCTCCCCAAGGTTAACACAACACCCTCCTCGGTCGTGCACTCGAGAGAGAATGATTGGAATCAAGGAATTACTGTGCT
CAGAGTGAATGGAGAGAAGATGAAAATTCACTTCTCCACCGTCGACGCCTCTTCCGTTCGAAAATCTCAGATATCGAACAAAACCTCAATTCGCTCTCTTAACTTCACAA
ATATCCCCGGTCGCCTCTTCCGGTCGTCGACGCCTCTTCCGGTCGCCGGATTCCAGGGATCTCGCCGGAAAGAAAATTCGAAAGAGACTGTAACAGTGATTGTCGGTTTG
GATTTTATCGAAAGGATCTTTCTTAGATTTCACAGTTTAGATCTCAGATTTCACCGATTAATTGCAGGGTCATCTAGTGCTATTGTTGGATACGAGAATTTGTCGTCTGT
TGATTCTTCCTCCAGTAATGTTGATTTTCAAGTTATTACTGATGTGCATTTTGATGGGGATTTAAAGGAGAATGGTGTTTTTGGGTGTGTTCAAGATGGTTGCCAATGGT
TTGTTAGGGCATCTCTGTATAAAGAGAGTGAATTGTGGATGCTTAGGAAGTATGTATCTAATCACGATTGCTCTATGAATAGTATCCAAACTTGTCATAGGCAAGCCTCT
TCATCTCTTATTAGTGATTGTTTGAAAGAAGAATTTAGATTTAGTTCTTTGGACCATTCGACTCCGAAAGATATTGTGCATAAGATGCGTACAAAACTTGGAGTCAATGT
TAGTTATTACAAAGCTTGGAGAGAAAAAGAACAGATTATAAAGTCGTTGAAAGGTACATACACATCATTTGAAACTGATACATACACATCATTTGAAACCGATGGAGATG
GACATTTTAAGTACTGCTTAATGGTTGTTGGTTCATCTATTGAGGGTTGGAAGTATTGTAGGCCCAACATATCTGTGGATAGCACATTCTTGAAGGGTAAATTTGTTGGA
ACTCTTTTGACAGCCTCAACTCTTGATGGTAACAATCAAATTTTTCCTCTTGCTTTCGGTATTGTAGATTCTGAAAACGATGCATCATGGAAATGGTTTTTTGAGAATAT
AAAGAATAGTTTTGGAGATCGAGAAGGTTTAGTTATTGTCTTTGATAGACATTTGATTATTCCCAAGGGTGCTTTGGTTGTTTTCCAACAGTTGAGTATTGTGTTTGCAT
TCAACATCTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCACGGTGCTAAATCGTTTACACCGGTTGCTCCCCAAGGTTAACACAACACCCTCCTCGGTCGTGCACTCGAGAGAGAATGATTGGAATCAAGGAATTACTGTGCT
CAGAGTGAATGGAGAGAAGATGAAAATTCACTTCTCCACCGTCGACGCCTCTTCCGTTCGAAAATCTCAGATATCGAACAAAACCTCAATTCGCTCTCTTAACTTCACAA
ATATCCCCGGTCGCCTCTTCCGGTCGTCGACGCCTCTTCCGGTCGCCGGATTCCAGGGATCTCGCCGGAAAGAAAATTCGAAAGAGACTGTAACAGTGATTGTCGGTTTG
GATTTTATCGAAAGGATCTTTCTTAGATTTCACAGTTTAGATCTCAGATTTCACCGATTAATTGCAGGGTCATCTAGTGCTATTGTTGGATACGAGAATTTGTCGTCTGT
TGATTCTTCCTCCAGTAATGTTGATTTTCAAGTTATTACTGATGTGCATTTTGATGGGGATTTAAAGGAGAATGGTGTTTTTGGGTGTGTTCAAGATGGTTGCCAATGGT
TTGTTAGGGCATCTCTGTATAAAGAGAGTGAATTGTGGATGCTTAGGAAGTATGTATCTAATCACGATTGCTCTATGAATAGTATCCAAACTTGTCATAGGCAAGCCTCT
TCATCTCTTATTAGTGATTGTTTGAAAGAAGAATTTAGATTTAGTTCTTTGGACCATTCGACTCCGAAAGATATTGTGCATAAGATGCGTACAAAACTTGGAGTCAATGT
TAGTTATTACAAAGCTTGGAGAGAAAAAGAACAGATTATAAAGTCGTTGAAAGGTACATACACATCATTTGAAACTGATACATACACATCATTTGAAACCGATGGAGATG
GACATTTTAAGTACTGCTTAATGGTTGTTGGTTCATCTATTGAGGGTTGGAAGTATTGTAGGCCCAACATATCTGTGGATAGCACATTCTTGAAGGGTAAATTTGTTGGA
ACTCTTTTGACAGCCTCAACTCTTGATGGTAACAATCAAATTTTTCCTCTTGCTTTCGGTATTGTAGATTCTGAAAACGATGCATCATGGAAATGGTTTTTTGAGAATAT
AAAGAATAGTTTTGGAGATCGAGAAGGTTTAGTTATTGTCTTTGATAGACATTTGATTATTCCCAAGGGTGCTTTGGTTGTTTTCCAACAGTTGAGTATTGTGTTTGCAT
TCAACATCTATTGA
Protein sequenceShow/hide protein sequence
MTTVLNRLHRLLPKVNTTPSSVVHSRENDWNQGITVLRVNGEKMKIHFSTVDASSVRKSQISNKTSIRSLNFTNIPGRLFRSSTPLPVAGFQGSRRKENSKETVTVIVGL
DFIERIFLRFHSLDLRFHRLIAGSSSAIVGYENLSSVDSSSSNVDFQVITDVHFDGDLKENGVFGCVQDGCQWFVRASLYKESELWMLRKYVSNHDCSMNSIQTCHRQAS
SSLISDCLKEEFRFSSLDHSTPKDIVHKMRTKLGVNVSYYKAWREKEQIIKSLKGTYTSFETDTYTSFETDGDGHFKYCLMVVGSSIEGWKYCRPNISVDSTFLKGKFVG
TLLTASTLDGNNQIFPLAFGIVDSENDASWKWFFENIKNSFGDREGLVIVFDRHLIIPKGALVVFQQLSIVFAFNIY