; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022711 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022711
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:36100117..36108349
RNA-Seq ExpressionLag0022711
SyntenyLag0022711
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]4.2e-4643.28Show/hide
Query:  SGIVYAPINANNFELKTGLIQMARDVHIEDRP---PRIQILI---------LNHSWTFVGRTRL------------VAVYYPGSITTWDALVQAFLKKFF
        S I+  PINANNFELK  LI M +       P   P + + +         +N       R RL            +    PGSI +W  + + FL KFF
Subjt:  SGIVYAPINANNFELKTGLIQMARDVHIEDRP---PRIQILI---------LNHSWTFVGRTRL------------VAVYYPGSITTWDALVQAFLKKFF

Query:  PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTP
        PPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  
Subjt:  PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTP

Query:  KKIVAGVFEVDKDNQIEEAVIAINTTVNGHSAAIKNIETQ-LGQLVNVVSTMNKGKAPAEQEKPQMEY
        KK VAG+ E+       E + A++  V   S  I  + TQ + Q    V++ +      E  + Q++Y
Subjt:  KKIVAGVFEVDKDNQIEEAVIAINTTVNGHSAAIKNIETQ-LGQLVNVVSTMNKGKAPAEQEKPQMEY

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]4.2e-4643.28Show/hide
Query:  SGIVYAPINANNFELKTGLIQMARDVHIEDRP---PRIQILI---------LNHSWTFVGRTRL------------VAVYYPGSITTWDALVQAFLKKFF
        S I+  PINANNFELK  LI M +       P   P + + +         +N       R RL            +    PGSI +W  + + FL KFF
Subjt:  SGIVYAPINANNFELKTGLIQMARDVHIEDRP---PRIQILI---------LNHSWTFVGRTRL------------VAVYYPGSITTWDALVQAFLKKFF

Query:  PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTP
        PPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  
Subjt:  PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTP

Query:  KKIVAGVFEVDKDNQIEEAVIAINTTVNGHSAAIKNIETQ-LGQLVNVVSTMNKGKAPAEQEKPQMEY
        KK VAG+ E+       E + A++  V   S  I  + TQ + Q    V++ +      E  + Q++Y
Subjt:  KKIVAGVFEVDKDNQIEEAVIAINTTVNGHSAAIKNIETQ-LGQLVNVVSTMNKGKAPAEQEKPQMEY

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.6e-4542.91Show/hide
Query:  SGIVYAPINANNFELKTGLIQMARDVHIEDRP---PRIQILI---------LNHSWTFVGRTRL------------VAVYYPGSITTWDALVQAFLKKFF
        S I+  PINANNFELK  LI M +       P   P I + +         +N       R RL            +    PGSI +W  + + FL KFF
Subjt:  SGIVYAPINANNFELKTGLIQMARDVHIEDRP---PRIQILI---------LNHSWTFVGRTRL------------VAVYYPGSITTWDALVQAFLKKFF

Query:  PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTP
        PPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  
Subjt:  PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTP

Query:  KKIVAGVFEVDKDNQIEEAVIAINTTVNGHSAAIKNIETQ-LGQLVNVVSTMNKGKAPAEQEKPQMEY
        KK VAG+ ++       E + A++  V   S  I  + TQ + Q    +++ +      E  + Q++Y
Subjt:  KKIVAGVFEVDKDNQIEEAVIAINTTVNGHSAAIKNIETQ-LGQLVNVVSTMNKGKAPAEQEKPQMEY

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.1e-5148.26Show/hide
Query:  QSGIVYAPINANNFELKTGLIQMARDVHIEDRPPRIQILILNHSWTFVGRTRLVAV------------------------YYPGSITTWDALVQAFLKKF
        Q GI+  PIN NNFELK GLIQMAR++    R        L       G  ++  V                          P SITTW+ L QAFL K+
Subjt:  QSGIVYAPINANNFELKTGLIQMARDVHIEDRPPRIQILILNHSWTFVGRTRLVAV------------------------YYPGSITTWDALVQAFLKKF

Query:  FPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERST
        FPPAK+ +LRTEIGTF+Q  DEQL+EAWER+K+LLR+CPQHGYPDWLQ+QLFYNGL  STK+I+DA AGG++ SK  + A T+LED+AT SY WP ER++
Subjt:  FPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERST

Query:  PK-KIVAGVFEVDKDNQIEEAVIAINTTVN
        P     AG++EVD+ N ++  + ++   ++
Subjt:  PK-KIVAGVFEVDKDNQIEEAVIAINTTVN

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]2.8e-5039.37Show/hide
Query:  GIVYAPINANNFELKTGLIQMARDVHI-----EDRPPRIQILI-------LNHSWTFVGRTRLVAVYYPGSITTWDALVQAFLKKFFPPAKTVKLRTEIG
        GI+  PINANN ELK GLIQM R+        ED    + I +       +N       R RL    +P S+   + +VQAFL  FFPPAKT +LRTEI 
Subjt:  GIVYAPINANNFELKTGLIQMARDVHI-----EDRPPRIQILI-------LNHSWTFVGRTRLVAVYYPGSITTWDALVQAFLKKFFPPAKTVKLRTEIG

Query:  TFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--
        +F++   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA  LL+DMA NS+QWPSERS  KK VAG++E+D  
Subjt:  TFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--

Query:  -----------------------------------------------------------------------KDNQIEEAVIAINTTVNGHSAAIKNIETQ
                                                                               + ++IE  V  +   + G++ +IKN+E Q
Subjt:  -----------------------------------------------------------------------KDNQIEEAVIAINTTVNGHSAAIKNIETQ

Query:  LGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYE
        +GQ+   ++TM KGK P++ E    E+CKA+T+   +  +EPE +  E
Subjt:  LGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYE

TrEMBL top hitse value%identityAlignment
A0A2I4F4C8 uncharacterized protein LOC1089953731.9e-4436.83Show/hide
Query:  SGIVYAPINANNFELKTGLIQMARDVHIEDRP---PRIQILI---------LNHSWTFVGRTRL------------VAVYYPGSITTWDALVQAFLKKFF
        SGI    INANNFELK  LI M +       P   P I + +         +N       R RL            +     GSIT+W  + + FL KFF
Subjt:  SGIVYAPINANNFELKTGLIQMARDVHIEDRP---PRIQILI---------LNHSWTFVGRTRL------------VAVYYPGSITTWDALVQAFLKKFF

Query:  PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENART-LLEDMATNSYQWPSERST
        PPAKT +LR+EI  F+Q   E L+EAWER+K L+R CPQHG P+WLQVQ+FYNGL   T+TIVDAAAGGTL+SKT+E A T LLE+M +N+YQWP+E++ 
Subjt:  PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENART-LLEDMATNSYQWPSERST

Query:  PKKI---------------VAGVFEVDKDNQ-----------------------------IEEAVIA------------------INTTVNGHSAAIKNI
         KK+               VA    +   N+                             +E+A+I+                  I+   +   AAIKNI
Subjt:  PKKI---------------VAGVFEVDKDNQ-----------------------------IEEAVIA------------------INTTVNGHSAAIKNI

Query:  ETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ-EESEEEPESEDYET
        E Q+G+L  +++   +G  P+  E    E CKAIT+    E E  P  E   T
Subjt:  ETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ-EESEEEPESEDYET

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.0e-4148.17Show/hide
Query:  INANNFELKTGLIQMAR-DVHIEDRPP------RIQILILNHSWTFVG------RTRLVAVYY------------PGSITTWDALVQAFLKKFFPPAKTV
        INANNFE+K   IQM +  V     P        +  L +  ++ + G      R RL                  GSITTW+ L Q FL KFFPPAKT 
Subjt:  INANNFELKTGLIQMAR-DVHIEDRPP------RIQILILNHSWTFVG------RTRLVAVYY------------PGSITTWDALVQAFLKKFFPPAKTV

Query:  KLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVAG
        K+R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K V G
Subjt:  KLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVAG

Query:  VFEVDKDNQIEEAVIAIN
         +E+D    +   V A++
Subjt:  VFEVDKDNQIEEAVIAIN

A0A6J0ZYV0 uncharacterized protein LOC1104134132.0e-4148.17Show/hide
Query:  INANNFELKTGLIQMAR-DVHIEDRPP------RIQILILNHSWTFVG------RTRLVAVYY------------PGSITTWDALVQAFLKKFFPPAKTV
        INANNFE+K   IQM +  V     P        +  L +  ++ + G      R RL                  GSITTW+ L Q FL KFFPPAKT 
Subjt:  INANNFELKTGLIQMAR-DVHIEDRPP------RIQILILNHSWTFVG------RTRLVAVYY------------PGSITTWDALVQAFLKKFFPPAKTV

Query:  KLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVAG
        K+R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K V G
Subjt:  KLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVAG

Query:  VFEVDKDNQIEEAVIAIN
         +E+D    +   V A++
Subjt:  VFEVDKDNQIEEAVIAIN

A0A6J1DU19 uncharacterized protein LOC1110243611.4e-5039.37Show/hide
Query:  GIVYAPINANNFELKTGLIQMARDVHI-----EDRPPRIQILI-------LNHSWTFVGRTRLVAVYYPGSITTWDALVQAFLKKFFPPAKTVKLRTEIG
        GI+  PINANN ELK GLIQM R+        ED    + I +       +N       R RL    +P S+   + +VQAFL  FFPPAKT +LRTEI 
Subjt:  GIVYAPINANNFELKTGLIQMARDVHI-----EDRPPRIQILI-------LNHSWTFVGRTRLVAVYYPGSITTWDALVQAFLKKFFPPAKTVKLRTEIG

Query:  TFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--
        +F++   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA  LL+DMA NS+QWPSERS  KK VAG++E+D  
Subjt:  TFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVD--

Query:  -----------------------------------------------------------------------KDNQIEEAVIAINTTVNGHSAAIKNIETQ
                                                                               + ++IE  V  +   + G++ +IKN+E Q
Subjt:  -----------------------------------------------------------------------KDNQIEEAVIAINTTVNGHSAAIKNIETQ

Query:  LGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYE
        +GQ+   ++TM KGK P++ E    E+CKA+T+   +  +EPE +  E
Subjt:  LGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYE

A0A6P6XAQ1 Reverse transcriptase4.4e-4142.37Show/hide
Query:  LASRRWQQSGIVYAPINANNFELKTGLIQMARDVH-----IEDRPPRIQ-ILILNHSWTFVG------RTRL------------VAVYYPGSITTWDALV
        L   +  Q+ IV   +NANNFE+K  LIQM +         ED    +   L +  +  F G      + RL            +  + P + TTWD L 
Subjt:  LASRRWQQSGIVYAPINANNFELKTGLIQMARDVH-----IEDRPPRIQ-ILILNHSWTFVG------RTRL------------VAVYYPGSITTWDALV

Query:  QAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQ
        +AFL KFFPP KT KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAGG L+ KT E A+ L+E+MA N+YQ
Subjt:  QAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQ

Query:  WPSERSTPKKIVAGVFEVDKDNQIEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKG
        W +ER   ++  AG+ EVD              T+N  SA + N+   L + V   S+ N+G
Subjt:  WPSERSTPKKIVAGVFEVDKDNQIEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCAGGTGTACGACGAGCCCGATGATGGAGACGAGGAGTTAATTTTACACAATTTATGTGAACCCGACATGCTTCTTGGTAGATCGGATTTTGCATCCTCCTTACC
TGTGTTAACTGATAACTGCCCAAAAGGAGTGAGTTTGGTGCTTGAGCGATCGCCTGGGGTAAGGTTTGAGCTTGATCCAGAAATCGAGAGGACATTCAGGAGAAGAAGGA
GAGAGCAGCGAAAAACCAGATGGAGAACGTGCCGCTTCTTCCGCAGGACTAGAGCCATTCGAGCATATGCTTTTCCAATGTTTGATGAGTTGAATCTAGGGATTGCAGTC
CTCAAATTCAGCGGCAATTTTGAAATGAAACCGATAATGTTTCAGATGTTGCAAACCGTGGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTT
TAAGGAGCTTTTCCGAAAGTGTCCCCACCATGGTTTACCACATTGTATCCAAATGAAACATTTTAAATGGGTTAAACGGAGTAACCCAGGCACGAATAAAAAGGTTAAGA
GTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAATGATTAATCATCAGACCACCACTATGGAGTCT
GCTGCAGTGGTGAACCAAGTCGTAGAGAAGCATGTGTCTATTGTGGCTGGCGCAACCACCCCAACTTCTCGTGGGGAGGACAAGGAAGCATGTGCAAGCGCAACAAAGGT
GAACCAGTCGGGATTTGCTAAAGCGCAGGTATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAGTTCTCTTGAGGCGATGATGAAAGAATTTATGG
CTCGTACAGATGCCCAATTCAAAGTAAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGAGT
GAGTTAGAGACTGGTCAGGGTGCTGGAGGCAGCAATAAAGATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACC
TCTACCTTTTCCACAAAGGCAAAGCCTAAGAATCAGGATGAAGCTAGGTATTGGTGAAGCTAGACCTACCACAGTCACACTCCAGCTAGCTGATAGGTCTATCACATATC
CAGAGGACGAAATGGAGGATTGCTCTTTCATTAGGATTCTGGAGAGCACAGTTGTTGAGACAGCAATACAGGATTCGACTGATAAGCACTTGGAAGATCATGGAGAGGAG
GTGATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCGACAGCAATTGGAAAAAGGTTGATTGGAGCCAAATGCAATAGTGGAGTTAATCGGGTGCTCGGGACGCG
AGAAGATGCGAAGGAAGGAAAAGAATCAAAATGGAAAAAAGTCAAATTCGGTCAAAAGGTGACTAGCGTCGAGATGCTAGCCCTTAGCGTCTCGACGCTAGCATTCCATA
TCAGAACATGCGCGAAATCGTCGCAGCGTCTCGACGCTGCAACCTTAGCGTCTCGACGCTGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAG
CTGAAGACCGGCCTCATTCAGATGGCTCGAGATGTGCATATAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGCACGAG
ATTGGTTGCAGTCTATTACCCTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGA
TTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTA
CAGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGA
TATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACCTAAAAAGATTGTTGCTGGAGTGTTTGAGGTTGATAAAGACAACCAAATAGAGGAGGCAGTCATTG
CTATCAACACCACGGTGAATGGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCT
GAACAAGAGAAACCCCAGATGGAGTATTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACCCTACAGGGGAAGCTGAG
GAGGACACATCATCTGATGAGGCTGAAAAGCCTAACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGCAGGTGTACGACGAGCCCGATGATGGAGACGAGGAGTTAATTTTACACAATTTATGTGAACCCGACATGCTTCTTGGTAGATCGGATTTTGCATCCTCCTTACC
TGTGTTAACTGATAACTGCCCAAAAGGAGTGAGTTTGGTGCTTGAGCGATCGCCTGGGGTAAGGTTTGAGCTTGATCCAGAAATCGAGAGGACATTCAGGAGAAGAAGGA
GAGAGCAGCGAAAAACCAGATGGAGAACGTGCCGCTTCTTCCGCAGGACTAGAGCCATTCGAGCATATGCTTTTCCAATGTTTGATGAGTTGAATCTAGGGATTGCAGTC
CTCAAATTCAGCGGCAATTTTGAAATGAAACCGATAATGTTTCAGATGTTGCAAACCGTGGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTT
TAAGGAGCTTTTCCGAAAGTGTCCCCACCATGGTTTACCACATTGTATCCAAATGAAACATTTTAAATGGGTTAAACGGAGTAACCCAGGCACGAATAAAAAGGTTAAGA
GTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAATGATTAATCATCAGACCACCACTATGGAGTCT
GCTGCAGTGGTGAACCAAGTCGTAGAGAAGCATGTGTCTATTGTGGCTGGCGCAACCACCCCAACTTCTCGTGGGGAGGACAAGGAAGCATGTGCAAGCGCAACAAAGGT
GAACCAGTCGGGATTTGCTAAAGCGCAGGTATTGCCCCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCGGGGAGTTCTCTTGAGGCGATGATGAAAGAATTTATGG
CTCGTACAGATGCCCAATTCAAAGTAAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTCGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGAGT
GAGTTAGAGACTGGTCAGGGTGCTGGAGGCAGCAATAAAGATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATGTACCACC
TCTACCTTTTCCACAAAGGCAAAGCCTAAGAATCAGGATGAAGCTAGGTATTGGTGAAGCTAGACCTACCACAGTCACACTCCAGCTAGCTGATAGGTCTATCACATATC
CAGAGGACGAAATGGAGGATTGCTCTTTCATTAGGATTCTGGAGAGCACAGTTGTTGAGACAGCAATACAGGATTCGACTGATAAGCACTTGGAAGATCATGGAGAGGAG
GTGATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCGACAGCAATTGGAAAAAGGTTGATTGGAGCCAAATGCAATAGTGGAGTTAATCGGGTGCTCGGGACGCG
AGAAGATGCGAAGGAAGGAAAAGAATCAAAATGGAAAAAAGTCAAATTCGGTCAAAAGGTGACTAGCGTCGAGATGCTAGCCCTTAGCGTCTCGACGCTAGCATTCCATA
TCAGAACATGCGCGAAATCGTCGCAGCGTCTCGACGCTGCAACCTTAGCGTCTCGACGCTGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAG
CTGAAGACCGGCCTCATTCAGATGGCTCGAGATGTGCATATAGAGGATCGCCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGGGACGCACGAG
ATTGGTTGCAGTCTATTACCCTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGA
TTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTA
CAGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGA
TATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACCTAAAAAGATTGTTGCTGGAGTGTTTGAGGTTGATAAAGACAACCAAATAGAGGAGGCAGTCATTG
CTATCAACACCACGGTGAATGGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAATAAAGGTAAGGCCCCAGCT
GAACAAGAGAAACCCCAGATGGAGTATTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACCCTACAGGGGAAGCTGAG
GAGGACACATCATCTGATGAGGCTGAAAAGCCTAACCTGA
Protein sequenceShow/hide protein sequence
MMQVYDEPDDGDEELILHNLCEPDMLLGRSDFASSLPVLTDNCPKGVSLVLERSPGVRFELDPEIERTFRRRRREQRKTRWRTCRFFRRTRAIRAYAFPMFDELNLGIAV
LKFSGNFEMKPIMFQMLQTVGFRQLEDETFSEAWERFKELFRKCPHHGLPHCIQMKHFKWVKRSNPGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTMINHQTTTMES
AAVVNQVVEKHVSIVAGATTPTSRGEDKEACASATKVNQSGFAKAQVLPQQNKQALPQQNSGSSLEAMMKEFMARTDAQFKVRPQGKLPSDTEHPRREGKEQVKAVTLRS
ELETGQGAGGSNKDAGASGSVPDVEPPYVPPPPYVPPLPFPQRQSLRIRMKLGIGEARPTTVTLQLADRSITYPEDEMEDCSFIRILESTVVETAIQDSTDKHLEDHGEE
VIKWLDAGIIYPIATAIGKRLIGAKCNSGVNRVLGTREDAKEGKESKWKKVKFGQKVTSVEMLALSVSTLAFHIRTCAKSSQRLDAATLASRRWQQSGIVYAPINANNFE
LKTGLIQMARDVHIEDRPPRIQILILNHSWTFVGRTRLVAVYYPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQV
QLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIVAGVFEVDKDNQIEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPA
EQEKPQMEYCKAITVHQEESEEEPESEDYETLQGKLRRTHHLMRLKSLT