; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005772 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005772
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr6:29432244..29433122
RNA-Seq ExpressionLag0005772
SyntenyLag0005772
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]4.2e-7364.61Show/hide
Query:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA
        + S  +D  S IFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T INA
Subjt:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNT +TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ +TFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]4.2e-7363.79Show/hide
Query:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA
        + S  +D  S IFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKLFGF+DG+ P P +S  TT        S+   Q NP YEDWIAKD AL T INA
Subjt:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNT +TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ +TFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]4.2e-7364.61Show/hide
Query:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA
        + S  +D  S IFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T INA
Subjt:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNT +TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ +TFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]4.2e-7364.61Show/hide
Query:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA
        + S  +D  S IFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T INA
Subjt:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNT +TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ +TFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]1.4e-7361.94Show/hide
Query:  STARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQP--NPQYEDWIAKDHALQTQINA
        +T +DL+S IFLLSNICNLVSIRLDST+F+LWK QL +ILKAHKLFGFIDGS   P S    ++S + +  ++T S P  NP +EDWIAKD AL T INA
Subjt:  STARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQP--NPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLS  ALAY+V   T++Q+WE LEKHYSS SRTNVVNLKS+LQ+I KK+ ESI  Y++RIKE+KDK ANVS+TI+ E L+IY ++GL  EYNT  TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSSNRNQ
        R+QS++FEELHV + SEESA+EKQ+KR++   QP AL A +  ++N+
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSSNRNQ

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.0e-7364.61Show/hide
Query:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA
        + S  +D  S IFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T INA
Subjt:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNT +TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ +TFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.0e-7364.61Show/hide
Query:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA
        + S  +D  S IFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T INA
Subjt:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNT +TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ +TFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X12.0e-7364.61Show/hide
Query:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA
        + S  +D  S IFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T INA
Subjt:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNT +TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ +TFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

A0A5D3CLI6 T4.52.0e-7364.61Show/hide
Query:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA
        + S  +D  S IFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T INA
Subjt:  ADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLSP ALAY+VG  +++Q+W+ L K YSS SR+NVVNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNT +TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ +TFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

A0A6J1D9L6 uncharacterized protein LOC1110188927.0e-7461.94Show/hide
Query:  STARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQP--NPQYEDWIAKDHALQTQINA
        +T +DL+S IFLLSNICNLVSIRLDST+F+LWK QL +ILKAHKLFGFIDGS   P S    ++S + +  ++T S P  NP +EDWIAKD AL T INA
Subjt:  STARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQP--NPQYEDWIAKDHALQTQINA

Query:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT
        TLS  ALAY+V   T++Q+WE LEKHYSS SRTNVVNLKS+LQ+I KK+ ESI  Y++RIKE+KDK ANVS+TI+ E L+IY ++GL  EYNT  TS+RT
Subjt:  TLSPTALAYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRT

Query:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSSNRNQ
        R+QS++FEELHV + SEESA+EKQ+KR++   QP AL A +  ++N+
Subjt:  RSQSITFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSSNRNQ

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-0723.5Show/hide
Query:  FVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINATLSPTALAYIVGCETAQQMWETLEKHYSST
        F  W+ ++R +L    L   +D    KP +                      + EDW   D    + I   LS   +  I+  +TA+ +W  LE  Y S 
Subjt:  FVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINATLSPTALAYIVGCETAQQMWETLEKHYSST

Query:  SRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRTRSQSITFEELHVLLVSEESALEKQIKRDEA
        + TN + LK +L  +    G +   ++     L  +LAN+ V I  ED  I  ++ LP+ Y+   T++     +I  +++   L+  E   +K   + +A
Subjt:  SRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRTRSQSITFEELHVLLVSEESALEKQIKRDEA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.2e-1526.91Show/hide
Query:  LNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINATLSPTAL
        LN++  L  N+ N+   +L STN+++W  Q+ ++   ++L GF+DGS   P +++        TD++  V   NP Y  W  +D  + + +   +S +  
Subjt:  LNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINATLSPTAL

Query:  AYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRTRSQSITF
          +    TA Q+WETL K Y++ S  +V  L+++L+   K + ++I  Y+Q +    D+LA +   +  ++ V   +  LP EY      +  +    T 
Subjt:  AYIVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRTRSQSITF

Query:  EELHVLLVSEESALEKQIKRDEAFAQPTALLA-----QTSSNRNQNYNQ
         E+H  L++ ES   K +    A   P    A      T++N N N N+
Subjt:  EELHVLLVSEESALEKQIKRDEAFAQPTALLA-----QTSSNRNQNYNQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-1225.55Show/hide
Query:  RLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINATLSPTALAYIVGCETAQQMWETLE
        +L STN+++W  Q+ ++   ++L GF+DGS P P +++ T            V + NP Y  W  +D  + + I   +S +    +    TA Q+WETL 
Subjt:  RLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINATLSPTALAYIVGCETAQQMWETLE

Query:  KHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRTRSQSITFEELHVLLVSEESALEKQ
        K Y++ S  +V  L+                +I R     D+LA +   +  ++ V   +  LP +Y      +  +    +  E+H  L++ ES   K 
Subjt:  KHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRTRSQSITFEELHVLLVSEESALEKQ

Query:  IKRDEAFAQPTALLAQTSSNRNQNYNQ
        +  + A   P      T  N N N NQ
Subjt:  IKRDEAFAQPTALLAQTSSNRNQNYNQ

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.3e-0825.55Show/hide
Query:  DSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINATLSPTALAYIVGCETAQQMWETLEKH
        D  N+V WK + RS L+  K FGFIDG+ PKP                      +P Y+ W   +  +   +  +++   L  ++  ETA +MWE L + 
Subjt:  DSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINATLSPTALAYIVGCETAQQMWETLEKH

Query:  YSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKEL
        +       +  L+  L T+ ++ G+S+ +Y  ++ ++
Subjt:  YSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGATTCTACTGCCAGAGATCTCAATTCTTCTATTTTTCTTCTTTCAAATATTTGTAATCTGGTCTCAATTCGGCTAGATTCTACAAATTTTGTTCTATGGAAGCA
TCAGTTGCGATCAATCTTGAAGGCGCACAAATTGTTTGGATTCATTGATGGATCCTTTCCGAAACCGGCTTCATCTGTACGTACTACCACTTCTTCTTCGTCTACTGATT
CTTCATCTACTGTTTCTCAGCCAAATCCACAATATGAGGACTGGATTGCCAAAGATCACGCACTACAAACTCAGATTAATGCAACTCTTTCTCCCACTGCATTGGCCTAT
ATTGTTGGTTGTGAGACTGCACAACAAATGTGGGAAACTCTAGAGAAACACTACTCTTCTACTTCGCGAACAAATGTCGTTAACCTAAAATCAGAGTTGCAAACTATTGC
GAAGAAATCTGGTGAAAGCATCACACAATACATCCAACGCATCAAGGAGTTGAAAGATAAATTGGCCAATGTTTCCGTCACGATTCATGTAGAGGATCTGGTGATCTACA
CGATGAGCGGCTTACCTGCTGAATACAACACTTCCAAGACGTCTCTGCGTACTAGATCTCAGTCGATCACCTTCGAAGAACTTCATGTTCTTCTTGTTTCTGAAGAGTCT
GCCCTTGAGAAGCAGATTAAACGTGATGAGGCCTTTGCTCAGCCTACTGCTTTACTTGCACAAACTTCTTCAAATCGCAATCAGAACTACAATCAAATATACCTCGAGGT
CGTGGAAATTCTGGAAATTCTCGAGGCCGTGGAAGAGGTTTTGGTCGTCCTCCTTTTTGTAATTCTGGTCGTGGACGTATACCTGGACATCCTGTTGGTAATACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGATTCTACTGCCAGAGATCTCAATTCTTCTATTTTTCTTCTTTCAAATATTTGTAATCTGGTCTCAATTCGGCTAGATTCTACAAATTTTGTTCTATGGAAGCA
TCAGTTGCGATCAATCTTGAAGGCGCACAAATTGTTTGGATTCATTGATGGATCCTTTCCGAAACCGGCTTCATCTGTACGTACTACCACTTCTTCTTCGTCTACTGATT
CTTCATCTACTGTTTCTCAGCCAAATCCACAATATGAGGACTGGATTGCCAAAGATCACGCACTACAAACTCAGATTAATGCAACTCTTTCTCCCACTGCATTGGCCTAT
ATTGTTGGTTGTGAGACTGCACAACAAATGTGGGAAACTCTAGAGAAACACTACTCTTCTACTTCGCGAACAAATGTCGTTAACCTAAAATCAGAGTTGCAAACTATTGC
GAAGAAATCTGGTGAAAGCATCACACAATACATCCAACGCATCAAGGAGTTGAAAGATAAATTGGCCAATGTTTCCGTCACGATTCATGTAGAGGATCTGGTGATCTACA
CGATGAGCGGCTTACCTGCTGAATACAACACTTCCAAGACGTCTCTGCGTACTAGATCTCAGTCGATCACCTTCGAAGAACTTCATGTTCTTCTTGTTTCTGAAGAGTCT
GCCCTTGAGAAGCAGATTAAACGTGATGAGGCCTTTGCTCAGCCTACTGCTTTACTTGCACAAACTTCTTCAAATCGCAATCAGAACTACAATCAAATATACCTCGAGGT
CGTGGAAATTCTGGAAATTCTCGAGGCCGTGGAAGAGGTTTTGGTCGTCCTCCTTTTTGTAATTCTGGTCGTGGACGTATACCTGGACATCCTGTTGGTAATACTCTGA
Protein sequenceShow/hide protein sequence
MADSTARDLNSSIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSTVSQPNPQYEDWIAKDHALQTQINATLSPTALAY
IVGCETAQQMWETLEKHYSSTSRTNVVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIHVEDLVIYTMSGLPAEYNTSKTSLRTRSQSITFEELHVLLVSEES
ALEKQIKRDEAFAQPTALLAQTSSNRNQNYNQIYLEVVEILEILEAVEEVLVVLLFVILVVDVYLDILLVIL