; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025427 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025427
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr10:12814647..12820408
RNA-Seq ExpressionLag0025427
SyntenyLag0025427
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.1e-6247.8Show/hide
Query:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL
        M++  T  KS SK K   +  +H        TL+ERQKK+Y FP++D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI + VE CF LK+L
Subjt:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL

Query:  IIKLAMEGKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKK--EKK
        I KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF ++  ++  E  
Subjt:  IIKLAMEGKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKK--EKK

Query:  NFATSYCIDV-------EEVDNSEKGEQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPK
           T+  ++V       EE+DNS + +QRTSVFDCIKP TTR SVFQR+SM   +EENQC   T+ + SAF+RLS+S SKK +PST  FDRLK+T+DQ +
Subjt:  NFATSYCIDV-------EEVDNSEKGEQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPK

Query:  RKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLINTEGS
        R+M  L+ K F E   D K+ S +PSRMKRK SV INTEGS
Subjt:  RKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLINTEGS

KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.7e-6242.08Show/hide
Query:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL
        M++  T  KS SK K + +  +H        TL+ERQKK+Y FP++D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI + VE CF LK+L
Subjt:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL

Query:  IIKLAMEGKIELDLDEVAQSN-------------------------------------------------------------------------------
        I+KL  E KIELD+DEVAQ+N                                                                               
Subjt:  IIKLAMEGKIELDLDEVAQSN-------------------------------------------------------------------------------

Query:  -----LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDV----------EEVDNSEKGEQRTSVFDCIK
                I  K K +R K   K +P + + + F QP+Q + L + F ++F   H KE       +   +          EEVDNS + +QRTSVFD IK
Subjt:  -----LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDV----------EEVDNSEKGEQRTSVFDCIK

Query:  PPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLIN
        P TTR SVFQR+S+   EEENQC  ST TR SAF+ LS+STSKK +PSTS FDRLK+ +DQ +R+M +L+VK F E   D K+ S +PSRMKRK SV IN
Subjt:  PPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLIN

Query:  TEGS
        TEGS
Subjt:  TEGS

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-6251.43Show/hide
Query:  MVINTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL
        MV++ T  KS SK K     R+ +G      TLKERQ+K+Y FP++D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI +PVE CF LK+L
Subjt:  MVINTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL

Query:  IIKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKG
        I+KLA E KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +  + +  N+ +S     +EV+NS + 
Subjt:  IIKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKG

Query:  EQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPS
         QRTSVFD IKP TTR SVFQR+S+   EEENQC    +TR S  +RLS+ST KK +PSTS FDRLK+T+DQ +R+M + + K F E   D K+ S +PS
Subjt:  EQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPS

Query:  RMKRKFSVLINTEGS
        RMKRK  V INTEGS
Subjt:  RMKRKFSVLINTEGS

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.7e-5950.16Show/hide
Query:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL
        MV+  T  KS SK K   +  +H        TL+ERQKK+Y FP++D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI +PVE CF LK+L
Subjt:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL

Query:  IIKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKG
        I+KLA E KI+LD+DE  +               KD   LQP+R         RS     P++++ +    + +    E  N   SY    EEVDNS + 
Subjt:  IIKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKG

Query:  EQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPS
        +QRTSVFD IKP TTR  VFQR+SM   EEENQC  ST+TR SAF+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E   D K+ S IPS
Subjt:  EQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPS

Query:  RMKRKFSVLINTEGS
           RK SV IN EGS
Subjt:  RMKRKFSVLINTEGS

TYK20938.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.7e-5940.14Show/hide
Query:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQ-KKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKD
        M++  T  KS SK K   +  +H        TL+ERQ KK+Y FP++D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHR+I +PV+ CF LK+
Subjt:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQ-KKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKD

Query:  LIIKLAMEGKIELDLDEVAQSN---------------------------------------------------------------------LATIKEKS-
        LI+KLA E KIELD+DEVAQ+N                                                                      ++I+ KS 
Subjt:  LIIKLAMEGKIELDLDEVAQSN---------------------------------------------------------------------LATIKEKS-

Query:  --------------KHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDV----------EEVDNSEKGEQRTSVFDCI
                      K +R K   K +P + + + F QP++ + L K   +NF   H +E       +   +          EEVDNS + +QRTSVF+ I
Subjt:  --------------KHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDV----------EEVDNSEKGEQRTSVFDCI

Query:  KPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLI
        KP TTR SVFQR+SM   EEENQC  ST+ R SA +RLS+STSKK +PST  FDRLK+T+DQ +R+M +L+ K F E   D K+   IPSR+KRKFSV I
Subjt:  KPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLI

Query:  NTEGSSLYPVTL----FLLQVRGFSVVRLLR
        NTE S ++   L    F  + +GF V R  R
Subjt:  NTEGSSLYPVTL----FLLQVRGFSVVRLLR

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.0e-6247.8Show/hide
Query:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL
        M++  T  KS SK K   +  +H        TL+ERQKK+Y FP++D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP YCKYHRVI + VE CF LK+L
Subjt:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL

Query:  IIKLAMEGKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKK--EKK
        I KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF ++  ++  E  
Subjt:  IIKLAMEGKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNFHKK--EKK

Query:  NFATSYCIDV-------EEVDNSEKGEQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPK
           T+  ++V       EE+DNS + +QRTSVFDCIKP TTR SVFQR+SM   +EENQC   T+ + SAF+RLS+S SKK +PST  FDRLK+T+DQ +
Subjt:  NFATSYCIDV-------EEVDNSEKGEQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPK

Query:  RKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLINTEGS
        R+M  L+ K F E   D K+ S +PSRMKRK SV INTEGS
Subjt:  RKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLINTEGS

A0A5A7TGM1 Retrotransposon gag protein2.3e-6242.08Show/hide
Query:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL
        M++  T  KS SK K + +  +H        TL+ERQKK+Y FP++D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI + VE CF LK+L
Subjt:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL

Query:  IIKLAMEGKIELDLDEVAQSN-------------------------------------------------------------------------------
        I+KL  E KIELD+DEVAQ+N                                                                               
Subjt:  IIKLAMEGKIELDLDEVAQSN-------------------------------------------------------------------------------

Query:  -----LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDV----------EEVDNSEKGEQRTSVFDCIK
                I  K K +R K   K +P + + + F QP+Q + L + F ++F   H KE       +   +          EEVDNS + +QRTSVFD IK
Subjt:  -----LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDV----------EEVDNSEKGEQRTSVFDCIK

Query:  PPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLIN
        P TTR SVFQR+S+   EEENQC  ST TR SAF+ LS+STSKK +PSTS FDRLK+ +DQ +R+M +L+VK F E   D K+ S +PSRMKRK SV IN
Subjt:  PPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLIN

Query:  TEGS
        TEGS
Subjt:  TEGS

A0A5A7URH1 Ty3-gypsy retrotransposon protein6.0e-6351.43Show/hide
Query:  MVINTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL
        MV++ T  KS SK K     R+ +G      TLKERQ+K+Y FP++D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI +PVE CF LK+L
Subjt:  MVINTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL

Query:  IIKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKG
        I+KLA E KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + +  + +  N+ +S     +EV+NS + 
Subjt:  IIKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKG

Query:  EQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPS
         QRTSVFD IKP TTR SVFQR+S+   EEENQC    +TR S  +RLS+ST KK +PSTS FDRLK+T+DQ +R+M + + K F E   D K+ S +PS
Subjt:  EQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPS

Query:  RMKRKFSVLINTEGS
        RMKRK  V INTEGS
Subjt:  RMKRKFSVLINTEGS

A0A5A7VFA5 Ty3-gypsy retrotransposon protein8.2e-6050.16Show/hide
Query:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL
        MV+  T  KS SK K   +  +H        TL+ERQKK+Y FP++D+ DMLEQL+E QLI+L  CKRP +  KVDDP YCKYHRVI +PVE CF LK+L
Subjt:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDL

Query:  IIKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKG
        I+KLA E KI+LD+DE  +               KD   LQP+R         RS     P++++ +    + +    E  N   SY    EEVDNS + 
Subjt:  IIKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKG

Query:  EQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPS
        +QRTSVFD IKP TTR  VFQR+SM   EEENQC  ST+TR SAF+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ K F E   D K+ S IPS
Subjt:  EQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPS

Query:  RMKRKFSVLINTEGS
           RK SV IN EGS
Subjt:  RMKRKFSVLINTEGS

A0A5D3DBR8 Retrotransposon gag protein8.2e-6040.14Show/hide
Query:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQ-KKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKD
        M++  T  KS SK K   +  +H        TL+ERQ KK+Y FP++D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHR+I +PV+ CF LK+
Subjt:  MVINTTLPKSSSKEKRQTNGAHH-------LTLKERQ-KKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKD

Query:  LIIKLAMEGKIELDLDEVAQSN---------------------------------------------------------------------LATIKEKS-
        LI+KLA E KIELD+DEVAQ+N                                                                      ++I+ KS 
Subjt:  LIIKLAMEGKIELDLDEVAQSN---------------------------------------------------------------------LATIKEKS-

Query:  --------------KHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDV----------EEVDNSEKGEQRTSVFDCI
                      K +R K   K +P + + + F QP++ + L K   +NF   H +E       +   +          EEVDNS + +QRTSVF+ I
Subjt:  --------------KHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNF---HKKEKKNFATSYCIDV----------EEVDNSEKGEQRTSVFDCI

Query:  KPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLI
        KP TTR SVFQR+SM   EEENQC  ST+ R SA +RLS+STSKK +PST  FDRLK+T+DQ +R+M +L+ K F E   D K+   IPSR+KRKFSV I
Subjt:  KPPTTRPSVFQRMSMVATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLI

Query:  NTEGSSLYPVTL----FLLQVRGFSVVRLLR
        NTE S ++   L    F  + +GF V R  R
Subjt:  NTEGSSLYPVTL----FLLQVRGFSVVRLLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTATCAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCATTTCCC
TAATGCCGACATCCCTGATATGCTGGAACAATTACTGGAAGCACAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATT
GCAAGTATCATCGAGTTATTGGTTATCCAGTGGAAACATGTTTTGCCCTAAAGGACTTAATTATAAAACTAGCTATGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTA
GCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCA
ACAACTGGTGATGTTGAATAAATCCTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACTTTGCGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCTGAGA
AGGGTGAACAAAGGACCTCCGTCTTTGATTGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGTCGCGACAGAAGAAGAAAATCAATGTTCG
GTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCA
ACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAAACGCGACAAGAAGCTTCAAAGTAGCATCCCGTCACGTATGAAAAGGAAGTTCTCTGTTC
TCATAAATACAGAAGGTTCTTCGTTGTATCCTGTTACGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGA
TCTTATGTGGTGCGCTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCCGGAGTTCCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTTCG
TTGCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCTCCCCAAGCTCGAAGGTTCACGCACTTCGTTGTAGTTCTTTCTCCCCAAGTTC
GAAGGTTCACGCACTTCGCTACAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCATTTCGCTGCAATTCCTTCCTCCAAGTTCGAAGGCTCTCACATCCCTTCGCTGAC
GCCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACATCCCTTCGGAGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACATCCCTTCGGAGCAGTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCCGCTTCGCTGCAGTTTCTTCCTCCAAGTTCGAAAGTCCTCACGCCGCTTCGCTACAGTTCCTTCCTCCAAGTTCAAGGGTTCTCAC
GCGCTTTGCTGCGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGTTGCA
GTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCATTGCAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGTTGCAGTTCCTCCTCCAAGTTCGAGGGTTCT
CACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTCATGTTTCACACGTCGCTTCGCTGCGTT
CCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCAACAGTTCATTCTCTCCAAATTCGAGGTTCGAAGGTTCTCAAGTGTTACACTTCCTTCTTTAAGTTCGAAGGTT
CCCACGTTGCGCTGTTGTGTTGCTTCCTTCTCCAAGTTCAAAGGTTCTAACGTTGCGGTGCTTCCTTCATCAAGTTCGAAGTTCCTTATCTCCAATTTCGAAGGATCTCG
CACATTTCGCTTTCCTTCTCCAAGTTCAAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTCCTCCAAGTTCAAAGGGGTTCTCATGCAACGCCTTCCTCCATGTTCGAAG
GATCTCACGCATTTCGTTGTAGTTCCTTCTCTCAGAGTTTAGAGTTCAAAGTTTCTTCTCTCCAAGTTCGAAGGTCCTCACGTTGCATCGCTGCAGTCCCTTCTCTCAAG
TTTGAAAGTTCTCACACTGCTTCGAAGGTTCTCACGTGCTTCGCTACAGTTCCTTCAGGTCCTTACATTGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGT
GACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAAACAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACACCCCTGCAGGAA
ACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGAC
TGGTCTAGACAGGGGGGATTCACAGAGCAGAGAGGCAGAGTCCAGAGCATTCTCCAAGATCCAGAGTCGTTAGAATCCAAGAGTCCAGAGAATTCAGAGATCCGAGATTC
AGAATTCAAAGGATTCAAGACTCAGAAGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTATCAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCATTTCCC
TAATGCCGACATCCCTGATATGCTGGAACAATTACTGGAAGCACAACTGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATT
GCAAGTATCATCGAGTTATTGGTTATCCAGTGGAAACATGTTTTGCCCTAAAGGACTTAATTATAAAACTAGCTATGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTA
GCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCA
ACAACTGGTGATGTTGAATAAATCCTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACTTTGCGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCTGAGA
AGGGTGAACAAAGGACCTCCGTCTTTGATTGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGTCGCGACAGAAGAAGAAAATCAATGTTCG
GTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCA
ACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAAACGCGACAAGAAGCTTCAAAGTAGCATCCCGTCACGTATGAAAAGGAAGTTCTCTGTTC
TCATAAATACAGAAGGTTCTTCGTTGTATCCTGTTACGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGA
TCTTATGTGGTGCGCTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCCGGAGTTCCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTTCG
TTGCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCTCCCCAAGCTCGAAGGTTCACGCACTTCGTTGTAGTTCTTTCTCCCCAAGTTC
GAAGGTTCACGCACTTCGCTACAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCATTTCGCTGCAATTCCTTCCTCCAAGTTCGAAGGCTCTCACATCCCTTCGCTGAC
GCCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACATCCCTTCGGAGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACATCCCTTCGGAGCAGTTCCTTC
CTCCAAGTTCGAAGGTTCTCACGCCGCTTCGCTGCAGTTTCTTCCTCCAAGTTCGAAAGTCCTCACGCCGCTTCGCTACAGTTCCTTCCTCCAAGTTCAAGGGTTCTCAC
GCGCTTTGCTGCGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGTTGCA
GTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCATTGCAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGTTGCAGTTCCTCCTCCAAGTTCGAGGGTTCT
CACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTCATGTTTCACACGTCGCTTCGCTGCGTT
CCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCAACAGTTCATTCTCTCCAAATTCGAGGTTCGAAGGTTCTCAAGTGTTACACTTCCTTCTTTAAGTTCGAAGGTT
CCCACGTTGCGCTGTTGTGTTGCTTCCTTCTCCAAGTTCAAAGGTTCTAACGTTGCGGTGCTTCCTTCATCAAGTTCGAAGTTCCTTATCTCCAATTTCGAAGGATCTCG
CACATTTCGCTTTCCTTCTCCAAGTTCAAAGGTTCTCACGTTGCTTCGCTGCAGTTCTTTCCTCCAAGTTCAAAGGGGTTCTCATGCAACGCCTTCCTCCATGTTCGAAG
GATCTCACGCATTTCGTTGTAGTTCCTTCTCTCAGAGTTTAGAGTTCAAAGTTTCTTCTCTCCAAGTTCGAAGGTCCTCACGTTGCATCGCTGCAGTCCCTTCTCTCAAG
TTTGAAAGTTCTCACACTGCTTCGAAGGTTCTCACGTGCTTCGCTACAGTTCCTTCAGGTCCTTACATTGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGT
GACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAAACAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACACCCCTGCAGGAA
ACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGAC
TGGTCTAGACAGGGGGGATTCACAGAGCAGAGAGGCAGAGTCCAGAGCATTCTCCAAGATCCAGAGTCGTTAGAATCCAAGAGTCCAGAGAATTCAGAGATCCGAGATTC
AGAATTCAAAGGATTCAAGACTCAGAAGATTTGA
Protein sequenceShow/hide protein sequence
MVINTTLPKSSSKEKRQTNGAHHLTLKERQKKIYHFPNADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGYPVETCFALKDLIIKLAMEGKIELDLDEV
AQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNFHKKEKKNFATSYCIDVEEVDNSEKGEQRTSVFDCIKPPTTRPSVFQRMSMVATEEENQCS
VSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVKRDKKLQSSIPSRMKRKFSVLINTEGSSLYPVTLFLLQVRGFSVVRLLRCSSSKCEG
SYVVRCCIVPSSLKFDGSHAALPEFLLPKFEGSHALRCSSFSPSSKVLTRFVAVPSPQARRFTHFVVVLSPQVRRFTHFATVPSPKFEGSHAFRCNSFLQVRRLSHPFAD
AASLQFLPPSSKVLTSLRSSSFLQVRRFSHPFGAVPSSKFEGSHAASLQFLPPSSKVLTPLRYSSFLQVQGFSRALLRASLQFLPPSSRVLTRFAAVPPPSSRVLTRFVA
VPPPSSRVLTRFIAVPPPSSRVLTRFVAVPPPSSRVLTRFAAVPSSKFEVLTRFAAVPSSKFEGHVSHVASLRSFLQVRRFSCASQQFILSKFEVRRFSSVTLPSLSSKV
PTLRCCVASFSKFKGSNVAVLPSSSSKFLISNFEGSRTFRFPSPSSKVLTLLRCSSFLQVQRGSHATPSSMFEGSHAFRCSSFSQSLEFKVSSLQVRRSSRCIAAVPSLK
FESSHTASKVLTCFATVPSGPYIVVKSLQVKLMTTVVTTPAGNYSHQSDWSKQTGGEITASEADDDRGDTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSD
WSRQGGFTEQRGRVQSILQDPESLESKSPENSEIRDSEFKGFKTQKI