; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009771 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009771
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr9:42185408..42186235
RNA-Seq ExpressionLag0009771
SyntenyLag0009771
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK17989.1 uncharacterized protein E5676_scaffold306G002980 [Cucumis melo var. makuwa]7.2e-7559.84Show/hide
Query:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSS-----TDSSSAVSQPNPQYEDWIAKDHALQ
        + ST+++L+SP+FLL+NICNL+SIRLDSTN+ LWK Q   +LKAHKL+GFID S P P  ++   T++SS     T SSS     NP YEDW AKD A  
Subjt:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSS-----TDSSSAVSQPNPQYEDWIAKDHALQ

Query:  TLINATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFK
         LINATLS  ALTY+VGC+++ Q+W+TLE+HYSS +RTNIVNLKS+LQ I+KK  E I  YI++IKE+KDKLAN +  ++ EDLVIY ++GLP EYN F+
Subjt:  TLINATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFK

Query:  TSLRTRSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSSN
        TS++TRSQ V+F ELH+LL SEESALEKQ KR++   QPTA+LA  ++N
Subjt:  TSLRTRSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSSN

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]2.7e-7464.61Show/hide
Query:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLS  AL Y+VG  +++Q+W+ L K YSS SR+N+VNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ VTFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]2.7e-7464.61Show/hide
Query:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLS  AL Y+VG  +++Q+W+ L K YSS SR+N+VNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ VTFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]2.2e-7662.02Show/hide
Query:  STARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKP----ASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLI
        +T +DL+SPIFLLSNICNLVSIRLDST+F+LWK QL +ILKAHKLFGFIDGS   P    ASS  T +  ++T S   +   NP +EDWIAKD AL TLI
Subjt:  STARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKP----ASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLI

Query:  NATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSL
        NATLS+ AL Y+V   T++Q+WE LEKHYSS SRTN+VNLKS+LQ+I KK+ ESI  Y++RIKE+KDK ANVS+TI+ E L+IY ++GL  EYNT  TS+
Subjt:  NATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSL

Query:  RTRSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQT--SSNRNQNYNPNIS
        RTR+QSV+FEELHV + SEESA+EKQ+KR++   QP AL A +  S NR   ++PN S
Subjt:  RTRSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQT--SSNRNQNYNPNIS

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]2.5e-7567.26Show/hide
Query:  MADST--ARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTL
        MADS+   +DL+SPIFLLSNICNLVS+RLDS+NFVLWK QL +ILKAHKL+GFIDGS PKPA  + +    SS+   +A    NP + +WIAKDHAL TL
Subjt:  MADST--ARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTL

Query:  INATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTS
        +NA LSS+AL Y+VGC+++QQ+W+TL KHYSS+SRTN+VNLKS+LQ+I+KK G SI  Y+QRIKELKDKLANV V +D EDL+IYT++ LP E+N F+TS
Subjt:  INATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTS

Query:  LRTRSQSVTFEELHVLLVSEESALEK
        +RTRSQSV+FEELHVLLVSEE+A++K
Subjt:  LRTRSQSVTFEELHVLLVSEESALEK

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.3e-7464.61Show/hide
Query:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLS  AL Y+VG  +++Q+W+ L K YSS SR+N+VNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ VTFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

A0A5D3CLI6 T4.51.3e-7464.61Show/hide
Query:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA
        + S  +D  SPIFLLSNICNL+S+RLDSTNFVLWK QL +ILKAHKL+GFIDG+ P P    RT  SSS   +S+   Q NP YEDWIAKD AL T+INA
Subjt:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINA

Query:  TLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT
        TLS  AL Y+VG  +++Q+W+ L K YSS SR+N+VNLKS+LQTI KK  ESI  YI+RIKE+KDKLANVS  I+ EDL+IY ++GLP EYNTF+TS+RT
Subjt:  TLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRT

Query:  RSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS
        RSQ VTFEELHVLL +EESAL KQ K D+++ QPT LL+ + S
Subjt:  RSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSS

A0A5D3D3T6 Retrotran_gag_3 domain-containing protein3.5e-7559.84Show/hide
Query:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSS-----TDSSSAVSQPNPQYEDWIAKDHALQ
        + ST+++L+SP+FLL+NICNL+SIRLDSTN+ LWK Q   +LKAHKL+GFID S P P  ++   T++SS     T SSS     NP YEDW AKD A  
Subjt:  ADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSS-----TDSSSAVSQPNPQYEDWIAKDHALQ

Query:  TLINATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFK
         LINATLS  ALTY+VGC+++ Q+W+TLE+HYSS +RTNIVNLKS+LQ I+KK  E I  YI++IKE+KDKLAN +  ++ EDLVIY ++GLP EYN F+
Subjt:  TLINATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFK

Query:  TSLRTRSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSSN
        TS++TRSQ V+F ELH+LL SEESALEKQ KR++   QPTA+LA  ++N
Subjt:  TSLRTRSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQTSSN

A0A6J1D9L6 uncharacterized protein LOC1110188921.1e-7662.02Show/hide
Query:  STARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKP----ASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLI
        +T +DL+SPIFLLSNICNLVSIRLDST+F+LWK QL +ILKAHKLFGFIDGS   P    ASS  T +  ++T S   +   NP +EDWIAKD AL TLI
Subjt:  STARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKP----ASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLI

Query:  NATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSL
        NATLS+ AL Y+V   T++Q+WE LEKHYSS SRTN+VNLKS+LQ+I KK+ ESI  Y++RIKE+KDK ANVS+TI+ E L+IY ++GL  EYNT  TS+
Subjt:  NATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSL

Query:  RTRSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQT--SSNRNQNYNPNIS
        RTR+QSV+FEELHV + SEESA+EKQ+KR++   QP AL A +  S NR   ++PN S
Subjt:  RTRSQSVTFEELHVLLVSEESALEKQIKRDEAFAQPTALLAQT--SSNRNQNYNPNIS

A0A6J1E049 uncharacterized protein LOC1110251501.2e-7567.26Show/hide
Query:  MADST--ARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTL
        MADS+   +DL+SPIFLLSNICNLVS+RLDS+NFVLWK QL +ILKAHKL+GFIDGS PKPA  + +    SS+   +A    NP + +WIAKDHAL TL
Subjt:  MADST--ARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTL

Query:  INATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTS
        +NA LSS+AL Y+VGC+++QQ+W+TL KHYSS+SRTN+VNLKS+LQ+I+KK G SI  Y+QRIKELKDKLANV V +D EDL+IYT++ LP E+N F+TS
Subjt:  INATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTS

Query:  LRTRSQSVTFEELHVLLVSEESALEK
        +RTRSQSV+FEELHVLLVSEE+A++K
Subjt:  LRTRSQSVTFEELHVLLVSEESALEK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.0e-0824Show/hide
Query:  FVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSSTALTYIVGCETAQQMWETLEKHYSST
        F  W+ ++R +L    L   +D    KP             D+  A        EDW   D    + I   LS   +  I+  +TA+ +W  LE  Y S 
Subjt:  FVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSSTALTYIVGCETAQQMWETLEKHYSST

Query:  SRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEELHVLLVSEESALEKQIKRDEA
        + TN + LK +L  +    G +   ++     L  +LAN+ V I+ ED  I  ++ LP+ Y+   T++     ++  +++   L+  E   +K   + +A
Subjt:  SRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEELHVLLVSEESALEKQIKRDEA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.8e-1627.2Show/hide
Query:  LNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSSTAL
        LN+   L  N+ N+   +L STN+++W  Q+ ++   ++L GF+DGS          TT   +T  + A  + NP Y  W  +D  + + +   +S +  
Subjt:  LNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSSTAL

Query:  TYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTF
          +    TA Q+WETL K Y++ S  ++  L+++L+   K + ++I  Y+Q +    D+LA +   +D ++ V   +  LP EY      +  +    T 
Subjt:  TYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTF

Query:  EELHVLLVSEESALEKQIKRDEAFAQPTALLA-----QTSSNRNQNYNPN
         E+H  L++ ES   K +    A   P    A      T++N N N N N
Subjt:  EELHVLLVSEESALEKQIKRDEAFAQPTALLA-----QTSSNRNQNYNPN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.3e-1425.88Show/hide
Query:  RLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSSTALTYIVGCETAQQMWETLE
        +L STN+++W  Q+ ++   ++L GF+DGS P P +++ T           AV + NP Y  W  +D  + + I   +S +    +    TA Q+WETL 
Subjt:  RLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSSTALTYIVGCETAQQMWETLE

Query:  KHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEELHVLLVSEESALEKQ
        K Y++ S  ++  L+                +I R     D+LA +   +D ++ V   +  LP +Y      +  +    +  E+H  L++ ES   K 
Subjt:  KHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEELHVLLVSEESALEKQ

Query:  IKRDEAFAQPTALLAQTSSNRNQNYNPN
        +  + A   P      T  N N N N N
Subjt:  IKRDEAFAQPTALLAQTSSNRNQNYNPN

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.3e-0925.3Show/hide
Query:  STARDLNSPIFLLSNI-----CNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTL
        S   D +SP +L  +I      ++  +  D  N+V WK + RS L+  K FGFIDG+ PKP                      +P Y+ W   +  +   
Subjt:  STARDLNSPIFLLSNI-----CNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTL

Query:  INATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKEL
        +  +++   L  ++  ETA +MWE L + +       I  L+  L T+ ++ G+S+ +Y  ++ ++
Subjt:  INATLSSTALTYIVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGATTCTACTGCCAGAGATCTCAATTCTCCTATTTTTCTTCTTTCAAATATTTGTAATCTGGTCTCAATTCGGCTAGATTCTACAAATTTTGTTCTATGGAAGCA
TCAGTTGCGATCAATCTTGAAGGCGCACAAATTGTTTGGATTCATTGATGGATCCTTTCCGAAACCGGCTTCATCTGTACGTACTACCACTTCTTCTTCGTCTACTGATT
CTTCATCTGCTGTTTCTCAGCCAAATCCACAATATGAGGACTGGATTGCCAAAGATCACGCACTACAAACTCTGATTAATGCAACTCTTTCTTCCACTGCCTTGACCTAT
ATTGTTGGTTGTGAGACTGCACAACAAATGTGGGAAACTCTAGAGAAACACTACTCTTCTACTTCGCGAACAAATATCGTTAACCTAAAATCAGAGTTGCAAACTATTGC
GAAGAAATCTGGTGAAAGCATCACACAATACATCCAACGCATCAAGGAGTTGAAAGATAAATTGGCCAATGTTTCCGTCACGATTGATGTAGAGGATCTGGTGATCTACA
CGATGAGCGGCTTACCTGCTGAATACAACACTTTCAAGACGTCTCTGCGTACTAGATCTCAGTCGGTCACCTTCGAAGAACTTCATGTTCTTCTTGTTTCTGAAGAGTCT
GCCCTTGAGAAACAGATTAAACGTGATGAGGCCTTTGCTCAGCCTACTGCTTTACTTGCACAAACTTCTTCAAATCGCAATCAGAACTACAATCCAAATATATCTCCTAT
TTGTCAATTGTTTCAGTTTCCACATTTTGCTATGTGCAATAGGGATTCAATACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGATTCTACTGCCAGAGATCTCAATTCTCCTATTTTTCTTCTTTCAAATATTTGTAATCTGGTCTCAATTCGGCTAGATTCTACAAATTTTGTTCTATGGAAGCA
TCAGTTGCGATCAATCTTGAAGGCGCACAAATTGTTTGGATTCATTGATGGATCCTTTCCGAAACCGGCTTCATCTGTACGTACTACCACTTCTTCTTCGTCTACTGATT
CTTCATCTGCTGTTTCTCAGCCAAATCCACAATATGAGGACTGGATTGCCAAAGATCACGCACTACAAACTCTGATTAATGCAACTCTTTCTTCCACTGCCTTGACCTAT
ATTGTTGGTTGTGAGACTGCACAACAAATGTGGGAAACTCTAGAGAAACACTACTCTTCTACTTCGCGAACAAATATCGTTAACCTAAAATCAGAGTTGCAAACTATTGC
GAAGAAATCTGGTGAAAGCATCACACAATACATCCAACGCATCAAGGAGTTGAAAGATAAATTGGCCAATGTTTCCGTCACGATTGATGTAGAGGATCTGGTGATCTACA
CGATGAGCGGCTTACCTGCTGAATACAACACTTTCAAGACGTCTCTGCGTACTAGATCTCAGTCGGTCACCTTCGAAGAACTTCATGTTCTTCTTGTTTCTGAAGAGTCT
GCCCTTGAGAAACAGATTAAACGTGATGAGGCCTTTGCTCAGCCTACTGCTTTACTTGCACAAACTTCTTCAAATCGCAATCAGAACTACAATCCAAATATATCTCCTAT
TTGTCAATTGTTTCAGTTTCCACATTTTGCTATGTGCAATAGGGATTCAATACATTAA
Protein sequenceShow/hide protein sequence
MADSTARDLNSPIFLLSNICNLVSIRLDSTNFVLWKHQLRSILKAHKLFGFIDGSFPKPASSVRTTTSSSSTDSSSAVSQPNPQYEDWIAKDHALQTLINATLSSTALTY
IVGCETAQQMWETLEKHYSSTSRTNIVNLKSELQTIAKKSGESITQYIQRIKELKDKLANVSVTIDVEDLVIYTMSGLPAEYNTFKTSLRTRSQSVTFEELHVLLVSEES
ALEKQIKRDEAFAQPTALLAQTSSNRNQNYNPNISPICQLFQFPHFAMCNRDSIH